正文內(nèi)容

并行算法2algorithm(文件)

2024-11-06 17:19 上一頁(yè)面

下一頁(yè)面

　

【正文】 during execution. What is the maximum degree of concurrency of the database query examples? ? The average degree of concurrency is the average number of tasks that can be processed in parallel over the execution of the program. Assuming that each tasks in the database example takes identical processing time, what is the average degree of concurrency in each deposition? ? The degree of concurrency increases as the deposition bees finer in granularity and vice versa. 52 Critical Path Length ? A directed path in the task dependency graph represents a sequence of tasks that must be processed one after the other. ? The longest such path determines the shortest time in which the program can be executed in parallel. ? The length of the longest path in a task dependency graph is called the critical path length. 53 Critical Path Length Consider the task dependency graphs of the two database query depositions: What are the critical path lengths for the two task dependency graphs? If each task takes 10 time units, what is the shortest parallel execution time for each deposition? How many processors are needed in each case to achieve this minimum parallel execution time? What is the maximum degree of concurrency? 54 Limits on Parallel Performance ? It would appear that the parallel time can be made arbitrarily small by making the deposition finer in granularity. ? There is an inherent bound on how fine the granularity of a putation can be. For example, in the case of multiplying a dense matrix with a vector, there can be no more than (n2) concurrent tasks. ? Concurrent tasks may also have to exchange data with other tasks. This results in munication overhead. The tradeoff between the granularity of a deposition and associated overheads often determines performance bounds. 55 Task Interaction Graphs ? Subtasks generally exchange data with others in a deposition. For example, even in the trivial deposition of the dense matrixvector product, if the vector is not replicated across all tasks, they will have to municate elements of the vector. ? The graph of tasks (nodes) and their interactions/data exchange (edges) is referred to as a task interaction graph. ? Note that task interaction graphs represent data dependencies, whereas task dependency graphs represent control dependencies. 56 Task Interaction Graphs: An Example Consider the problem of multiplying a sparse matrix A with a vector b. The following observations can be made: ? As before, the putation of each element of the result vector can be viewed as an independent task. ? Unlike a dense matrixvector product though, only nonzero elements of matrix A participate in the putation. ? If, for memory optimality, we also partition b across tasks, then one can see that the task interaction graph of the putation is identical to the graph of the matrix A (the graph for which A represents the adjacency structure). 57 Task Interaction Graphs, Granularity, and Communication In general, if the granularity of a deposition is finer, the associated overhead (as a ratio of useful work associated with a task) increases. Example: Consider the sparse matrixvector product example from previous foil. Assume that each node takes unit time to process and each interaction (edge) causes an overhead of a unit time. Viewing node 0 as an independent task involves a useful putation of one time unit and overhead (munication) of three time units. Now, if we consider nodes 0, 4, and 5 as one task, then the task has useful putation totaling to three time units and munication corresponding to four time units (four edges). Clearly, this is a more favorable ratio than the former case. 58 Processes and Mapping ? In general, the number of tasks in a deposition exceeds the number of processing elements available. ? For this reason, a parallel algorithm must also provide a mapping of tasks to processes. Note: We refer to the mapping as being from tasks to processes, as opposed to processors. This is because typical programming APIs, as we shall see, do not allow easy binding of tasks to physical processors. Rather, we aggregate tasks into processes and rely on the system to map these processes to physical processors. We use processes, not in the UNIX sense of a process, rather, simply as a collection of tasks and associated data. 59 Processes and Mapping ? Appropriate mapping of tasks to processes is critical to the parallel performance of an algorithm. ? Mappings are determined by both the task dependency and task interaction graphs. ? Task dependency graphs can be used to ensure that work is equally spread across all processes at any point (minimum idling and optimal load balance). ? Task interaction graphs can be used to make sure that processes need minimum interaction with other processes (minimum munication). 60 Processes and Mapping An appropriate mapping must minimize parallel execution time by: ? Mapping independent tasks to different processes. ? Assigning tasks on critical path to processes as soon as they bee available. ? Minimizing interaction between processes by mapping tasks with dense interactions to the same process. Note: These criteria often conflict with each other. For example, a deposition into one task (or no deposition at all) minimizes interaction but does not result in a speedup at all! 61 Processes and Mapping: Example Mapping tasks in the database query deposition to processes. These mappings were arrived at by viewing the dependency graph in terms of levels (no two nodes in a level have dependencies). Tasks within a single level are then assigned to different processes. 62 Algorithms and Concurrency ? Introductio

點(diǎn)擊復(fù)制文檔內(nèi)容

教學(xué)課件相關(guān)推薦

算法與算法分析ppt課件-資料下載頁(yè)

【摘要】1?第一章緒論引言算法及算法分析（算法評(píng)價(jià)）2什么是算法？?算法是對(duì)解決問(wèn)題的方法的一種精確描述。?并非所有問(wèn)題都有算法，有些問(wèn)題經(jīng)研究可行，則可能有相應(yīng)算法；而有些問(wèn)題經(jīng)研究不

2025-04-29 03:58

計(jì)算機(jī)算法設(shè)計(jì)與分析第2版4貪心算法-資料下載頁(yè)

【摘要】1第4章貪心算法2?學(xué)習(xí)要點(diǎn)?理解貪心算法的概念。?掌握貪心算法的基本要素?（1）最優(yōu)子結(jié)構(gòu)性質(zhì)?（2）貪心選擇性質(zhì)?理解貪心算法與動(dòng)態(tài)規(guī)劃算法的差異?理解貪心算法的一般理論?通過(guò)應(yīng)用范例學(xué)習(xí)貪心設(shè)計(jì)策略。?（1）活動(dòng)安排問(wèn)題；?（2）最優(yōu)裝載問(wèn)題；?（3）

2025-01-04 01:36

必修3算法基本語(yǔ)句(2)--賦值語(yǔ)句-資料下載頁(yè)

【摘要】開(kāi)始Pi=S=pi×r×r結(jié)束輸入半徑r輸出s流程圖：以直觀的圖形和流向線形象地描述算法起止框處理框輸入輸出框流程線（一）順序結(jié)構(gòu)按照步驟依次執(zhí)行的一個(gè)算法，稱為具有“順序結(jié)構(gòu)”的算法，或稱為算法的順序結(jié)構(gòu).AB特點(diǎn)：

2025-08-16 02:05

并行接口芯片8255a(2)-資料下載頁(yè)

【摘要】第9章、并行接口芯片徐承彬概述通信方式CPU外設(shè)1外設(shè)2并行接口芯片串行接口芯片可編程并行接口芯片8255A-5的結(jié)構(gòu)可編程并行接口芯片8255A-5的結(jié)構(gòu)1、數(shù)據(jù)端口A、B、C（1）端口A對(duì)應(yīng)1個(gè)8位數(shù)據(jù)輸入鎖存器1個(gè)8位輸出鎖存器

2025-01-01 04:50

片機(jī)的并行擴(kuò)展ppt課件-資料下載頁(yè)

【摘要】5-2程序存儲(chǔ)器的擴(kuò)展5-3數(shù)據(jù)存儲(chǔ)器的擴(kuò)展5-4簡(jiǎn)單IO口的擴(kuò)展第5章單片機(jī)的并行擴(kuò)展5-5擴(kuò)展可編程IO口8255A5-6小結(jié)5-1并行三總線的產(chǎn)生5-1并行三總線的產(chǎn)生外擴(kuò)展是構(gòu)建單片機(jī)系統(tǒng)的重要內(nèi)容,有兩類外擴(kuò)展：

2025-01-17 06:36

支付結(jié)算法律制度(2)-資料下載頁(yè)

【摘要】第二章支付結(jié)算法律制度1第二章支付結(jié)算法律制度第一節(jié)概述第二節(jié)現(xiàn)金管理第三節(jié)銀行結(jié)算賬戶第四節(jié)票據(jù)結(jié)算方式（本章重點(diǎn)）2第四節(jié)票據(jù)結(jié)算方式（本章重點(diǎn)）一、票據(jù)的概念和種類二、支票

2025-01-11 14:28

計(jì)算機(jī)算法導(dǎo)論_第2章-資料下載頁(yè)

【摘要】IntroductiontoAlgorithms計(jì)算機(jī)算法導(dǎo)論2022~2022年第一學(xué)期Quiz(10minutes)Question1.Supposeweareparingimplementationsofinsertionsortandmergesortonthesamemachine.Forinputs

2025-02-21 13:59

常用的數(shù)據(jù)結(jié)構(gòu)和算法--(2)-資料下載頁(yè)

【摘要】專業(yè)教程理論講解部分第017課算法及數(shù)據(jù)結(jié)構(gòu)?概述：?窮舉算法?遞歸算法?重點(diǎn)：?難點(diǎn)：?遞歸算法?窮舉算法?遞歸算法第017課算法及數(shù)據(jù)結(jié)構(gòu)1窮舉?依次查詢所有

2025-07-25 06:21

【微積分】極限運(yùn)算法則(2)-資料下載頁(yè)

【摘要】第七節(jié)函數(shù)的連續(xù)性一、函數(shù)的連續(xù)性.),,(,),()(0000的增量為自變量在點(diǎn)稱內(nèi)有定義在設(shè)函數(shù)xxxxxUxxUxf???????.)()()(00的增量相應(yīng)于為稱xxfxfxxfy??????xy0xy00xxx??0)(xfy?x?xx??00xx?y?y?

2025-04-21 04:08

并行io口及其應(yīng)用ppt課件-資料下載頁(yè)

【摘要】4單片機(jī)并行I/O口結(jié)構(gòu)及使用2P0端口的結(jié)構(gòu)與功能?P0口的一位結(jié)構(gòu)圖P0口除可以作為通用的8位I/O口外，當(dāng)進(jìn)行外部存儲(chǔ)器的擴(kuò)展時(shí)，還可以將其作為分時(shí)復(fù)用的低8位地址/數(shù)據(jù)總線。3P0端口的結(jié)構(gòu)與功能?P0口用作通用I/O口?作為輸出口?作為輸入口?“讀-修改-寫

2025-01-05 10:20

關(guān)聯(lián)算法簡(jiǎn)介apriori算法，fp-tree算法-資料下載頁(yè)

【摘要】2021-11-6數(shù)據(jù)挖掘：概念和技術(shù)1數(shù)據(jù)挖掘:概念和技術(shù)—Chapter6—2021-11-6數(shù)據(jù)挖掘：概念和技術(shù)2第6章：從大數(shù)據(jù)庫(kù)中挖掘關(guān)聯(lián)規(guī)則?關(guān)聯(lián)規(guī)則挖掘?從交易數(shù)據(jù)庫(kù)中挖掘一維的布爾形關(guān)聯(lián)規(guī)則?從交易數(shù)據(jù)庫(kù)中挖掘多層次關(guān)聯(lián)規(guī)則?在交易數(shù)據(jù)庫(kù)和數(shù)據(jù)倉(cāng)庫(kù)中挖掘多維關(guān)聯(lián)規(guī)則?從

2024-10-19 11:41

freepeople性欧美熟妇, 色戒完整版无删减158分钟hd, 无码精品国产vα在线观看DVD, 丰满少妇伦精品无码专区在线观看,艾栗栗与纹身男宾馆3p50分钟,国产AV片在线观看,黑人与美女高潮,18岁女RAPPERDISSSUBS,国产手机在机看影片

并行算法2algorithm(文件)

算法與算法分析ppt課件-資料下載頁(yè)

計(jì)算機(jī)算法設(shè)計(jì)與分析第2版4貪心算法-資料下載頁(yè)

必修3算法基本語(yǔ)句(2)--賦值語(yǔ)句-資料下載頁(yè)

并行接口芯片8255a(2)-資料下載頁(yè)

片機(jī)的并行擴(kuò)展ppt課件-資料下載頁(yè)

支付結(jié)算法律制度(2)-資料下載頁(yè)

計(jì)算機(jī)算法導(dǎo)論_第2章-資料下載頁(yè)

常用的數(shù)據(jù)結(jié)構(gòu)和算法--(2)-資料下載頁(yè)

【微積分】極限運(yùn)算法則(2)-資料下載頁(yè)

并行io口及其應(yīng)用ppt課件-資料下載頁(yè)

關(guān)聯(lián)算法簡(jiǎn)介apriori算法，fp-tree算法-資料下載頁(yè)

第7章并行接口-資料下載頁(yè)

chap4-指令級(jí)并行-資料下載頁(yè)

seda與java并行編程點(diǎn)滴-資料下載頁(yè)

mpi并行程序設(shè)計(jì)--資料下載頁(yè)

并行算法2algorithm-文庫(kù)吧在線文庫(kù)

并行算法2algorithm(完整版)

并行算法2algorithm(更新版)

并行算法2algorithm(專業(yè)版)

并行算法2algorithm(留存版)