SLIP 2008 - Wim Heirman - Rent’s Rule and Parallel Programs

Rent’s Rule and Parallel Programs: Characterizing Network Traffic Behavior W. Heirman, J. Dambre, D. Stroobandt, J. Van Campenhout ELIS Department, Ghent University, Belgium Sponsored by IAP-V PHOTON & IAP-VI photonics@be, Belgian Science Policy Office P PHOTONnetwork Outline • • • • Introduction Rent’s rule & traffic locality Time-varying network traffic Conclusions 2 Evolution of Systems design • VLSI systems get ever more complicated • More software, processor IP blocks, hardware/software co-design • Ad-hoc global wiring Network-onChip (“communication IP block”), long wires packets • What with Rent’s rule? 3 Rent’s rule: power law relation Rent’s Rule components (G) log T,B vs. terminals (T) T = tG p processors (N) vs. bandwidth (B) B = bN p [1] [2] log G,N circuits wires processor cores networks-on-chip 4 [1] Landman and Russo, IEEE Trans. on Computers, 1971 [2] D. Greenfield et. al, NOCS 2007. Multiprocessor + Network architecture Shared memory: network is part of memory hierarchy supercomputer CPU cache NetIF NetIF NetIF MEM CPU MEM CPU MEM CPU MEM CPU MEM CPU MEM NetIF NetIF NetIF on-chip CPU MEM CPU MEM CPU MEM NetIF NetIF NetIF server 5 NoC design: problems and opportunities • Simple traffic models: uniform, hot-spot, fixed bandwidth distribution – Ignores locality, time-variance in network traffic – Yields non-optimal NoC designs (uniform vs. non-uniform, static vs. reconfigurable) • Opportunity: better traffic models, analytical tools vs. trial-and-error 6 Outline • • • • Introduction Rent’s rule & traffic locality Time-varying network traffic Conclusions 7 Partitioning nodes by communication intensity • Hierarchically partition nodes according to communication (hMETIS) • Just as for wires, but: • Communication graph is usually fully connected • Weight on each connection = total communication between node pair • Fit power law on (cluster size, bandwidth) distribution 8 Rent exponent measured Rent exponent (dependent on application): • 16 nodes: .55-.65 • 64 nodes: .66-.74 9 “Wire length” distribution Distribution of communication vs. distance distance(A, B) = log2(size of smallest cluster containing both A and B) 10 Outline • • • • Introduction Rent’s rule & traffic locality Time-varying network traffic Conclusions 11 Communication varies through time • Hardware: – fixed function – traffic remains similar through time • Software: – more complex, different phases (e.g. function call) – communication patterns can change trough time 12 Communication varies through time • Repeat partitioning per interval of 100k clock cycles • Periods of high and low communication alternate • Rent exponent badly defined during periods of low communication 13 Communication varies through time • Repeat partitioning per interval of 100k clock cycles • Periods of high and low communication alternate • Rent exponent badly defined during periods of low communication 14 Node placement vs. variable traffic • Node partitioning can lead to optimal node placement (minimal communication distances) • But: varying traffic placement? varying optimal • Compute interval similarity, based on partitionings • Account for traffic intensity (moving noncommunicating nodes has no effect) 15 Similarity of communication between intervals • For time intervals X and Y, each with traffic pattern traffic and optimal partitioning part • part[X] cuts minimal fraction of traffic[X] • assume we use part[X] in interval Y, what fraction of traffic[Y] is cut? cut[X,Y] • always more than part[Y] would = cut[Y,Y] • similarity of partitionings, accounting for traffic intensity: cut[ X , X ] + cut[Y , Y ] sim[ X , Y ] = cut[Y , X ] + cut[ X , Y ] 16 Similarity measure properties sim[ X , Y ] = cut[ X , X ] + cut[Y , Y ] cut[Y , X ] + cut[ X , Y ] • cut[X,X] < cut[Y,X] and cut[Y,Y] < cut[X,Y] 0 ≤ sim[X,Y] ≤ 1 • sim[X,X] = 1 • when traffic[X] >> traffic[Y]: cut[*,Y] ~ 0 sim[X,Y] ~ cut[X,X]/cut[Y,X] (only dependent on traffic[X]) 17 Similarity matrix: FFT 18 Similarity matrix: Water 19 Suitability of a single placement • Static network one single placement • How suitable is this placement through time? • Suitability measure: based on partitionings (as are placements) • Optimal partitioning for traffic[X]: part[X], cutting a bandwidth cut[X,X]. • Suitability of partitioning P: cut[X,X] / cut[P,X] 20 Suitability of a single placement 21 Outline • • • • Introduction Rent’s rule & traffic locality Time-varying network traffic Conclusions 22 Conclusions • Measuring Rent exponents: – small number of nodes: difficult to measure, lots of noise – shared-memory: implicit communication, lots of non-essential communication better/other results with message-passing? • Still, difference in locality is visible, can be traced back to the benchmark’s algorithm • Time-variant communication! • Rent’s Rule (partitioning) is helpful to study communication behavior 23 Thank you! P PHOTONnetwork

Related docs
Slip Fall
Views: 48  |  Downloads: 1
Slip Copy, 2008 WL 5272770 (S
Views: 2  |  Downloads: 0
non slip tiles
Views: 56  |  Downloads: 0
Parallel-Processing-HOWTO
Views: 67  |  Downloads: 10
Parallel Genetic Algorithm
Views: 0  |  Downloads: 0
Hand Gandhi a Pink Slip
Views: 0  |  Downloads: 0
2008 2009
Views: 1  |  Downloads: 0
Packing Slip
Views: 1275  |  Downloads: 217
SLIP LEASE - 2008
Views: 35  |  Downloads: 2
Other docs by equalityluvv
Direct Deposit Enrollment Form
Views: 491  |  Downloads: 25
Harley Davidson Inc Ammendments and Bylaws
Views: 224  |  Downloads: 1
Stock Certificate for Common Stocks
Views: 447  |  Downloads: 17
wel-all
Views: 260  |  Downloads: 3
Minutes of Annual Shareholders Meeting
Views: 339  |  Downloads: 12
Temporary help checklist
Views: 324  |  Downloads: 5
CorpDocs- Corporate Governance Guidelines
Views: 342  |  Downloads: 21
Agreement-Trademark Assignment
Views: 501  |  Downloads: 19
The Home Depot Inc Ammendments and Bylaws
Views: 215  |  Downloads: 1
EMPLOYEE DATA SHEET
Views: 673  |  Downloads: 38