System Interconnection Design Trade-offs in Three-Dimensional (3-D
Document Sample


System Interconnection Design Trade-offs in
Three-Dimensional (3-D) Integrated Circuits
Roshan Weerasekera
roshan@kth.se
Department of Electronics, Computer, and Software Systems
School of Information and Communication Technologies
The Royal Institute of Technology (KTH)
164 40 Kista, Stockholm, Sweden.
Presentation Outline
3-D Integration: what, why and when ?
Thesis Focus and Contributions
− Electrical Modelling and Analysis of 3-D IC interconnects
− Signalling techniques for on-chip global interconnects
− Cost, Performance and Technological Tradeoffs for 3-D Integration
Conclusions
RW, December, 2008 2
3-D Integration: What, Why and When ?
Stacking two or more planar modules/chips/dies and electrically connecting them
Wiring Around Packages Wiring Around dies Wiring through dies
(Package in/on Package) (Multi Die Packages) (Silicon Stacking)
Chip 2
Chip 1
Source: http://imec.be/wwwinter/mediacenter/en/SR2006/681542.html
http://www.solid-state.com/display_article/257485/5/none/none/Dept/Mapping-progress-in-3D-IC-integration
RW, December, 2008 3
3-D Integration: What, Why and When ?
• Interconnect Delay
• More Transistors • Power Consumption
• Increase device density • Increased Die Size • Noise
• Improve Functionality • Mixed Technologies • Temperature
• Complexity
• Disparate Technologies
System-on-Chip
Cost per function
Time to market
System in Package
And 3-D packaging
System Complexity
SiP and 3-D packaging allows incorporation
of other circuit elements such as MEMS,
optoelectronics and bio-electronics into the
package reducing system cost and improving
performance…..
RW, December, 2008 4
3-D Integration: What, Why and When ?
Higher Integration Density Shorter Interconnects
• Direct Vertical Communication
• Integration in horizontal and vertical
directions – Shorter Global/Chip-to-Chip
interconnects
• Smaller footprint
– Reduced number of global interconnects
• Less Resistance and Capacitance
– Less power consumption
– Higher performance
Heterogeneous Integration Short Time to Market
• Exploits the best process technologies for • Can use optimized standard already
fabrication of different functional blocks available products from different vendors
– No yield compromise
– Greater functionality
RW, December, 2008 5
Challenges for 3-D Integration
• Interconnections
Manufacturing • Alignment and bonding
• Signal, Power and Clock Interconnect design
Design • Thermal Management
• CAD Tool
Testing • Pre/post-bond testing
RW, December, 2008 6
Ramifications of 3-D Stacking
Basically two categories:
− Folding
− Stacking (Package and Wafer Level)
Source: http://www.rpi.edu/~luj/
RW, December, 2008 singh,”3D Packaging Developments and Trends”, 2005
Source: Inderjit 7
Major 3-D bonding methods ...
3D Die Stacking (System in Package)
3D Wafer-Level-Processing
− 3D-W2W bonding
− 3D-D2W bonding
Source: Eric Bayne, ” 3D Interconnection and Packaging”, 2006
RW, December, 2008 8
Layer-to-Layer Interconnection technologies
Wire bonding
Through-hole Via
Contactless
Lower Cost Short path length (in m range)
Higher Reliability Many connections
Suitable for stacking identical chips
Interconnects at any point
Long Chip-to-Chip connection
Limited in Numbers High cost
Unsuitable for stacking different size chips Test is more difficult
High frequency operation
Source: http://techon.nikkeibp.co.jp/article/HONSHI/20061122/124183/
RW, December, 2008 9
Thesis Focus and Contributions
Primary focus of this dissertation is the design, modelling and analysis of
system interconnection and their effects in massively integrated 3-D ICs.
Technical Contributions
1. Electrical modelling of Interconnects in 3-D ICs
2. Signalling techniques for global on-chip interconnects
3. Cost, performance and technological trade-offs for 2-D and 3-D ICs
RW, December, 2008 10
1. Electrical modelling of Interconnects in 3-D ICs
RW, December, 2008 11
Interconnect Modelling
The parasitic parameters of interconnects in ICs have complex field dependence
predicated on the physical geometry, and the material constants.
Parasitic information will allow circuit simulations for SI and delay from package
pins to internal circuit pins for a wide variety of physical configurations
Interconnect
Structure
Specifications
Structure Model in a
Accurate, Time and Field Solver Models for Parasitics
memory consuming, (eg: Table-Based, formulae)
Inefficient for full chip
extraction
Efficient, but not
Parasitic Extraction 100% accurate
SI, Delay and PI
Analysis
RW, December, 2008 12
Interconnects in 3-D ICs
Multilevel Wiring Structures
- Models already proposed in literature
(discussed in Chapter 2)
R, L,C Models for Coupled TSVs
-Not available so far in literature.
(Discussed Chapter 3)
Models for interconnects in a 3-D IC
RW, December, 2008 13
Nature of Coupling in a TSV Bundle
The capacitive coupling terms to
nearest neighbors dominate
Inductive coupling is significant within
Coupling terms to nonadjacent lines the entire bundle
are mostly insignificant
Within nearest neighbors the lateral
terms are more significant than the
diagonal terms.
RW, December, 2008 14
Electrical Modelling of TSVs
Model TSV electrical parameters in terms of Physical Dimensions
physical dimensions and material constants & material properties
− Various TSV structures have been
simulated in a field solver and their
Field solver
electrical parameters extracted R,L,C
− Using curve fitting techniques and
analytical insight, semi empirical formulae
for capacitance and inductance with Database No
varying geometrical and physical Enough ?
parameters are being finalised
Yes
Curve Fitting
Empirical formula:
R,Cs,Cc, Ls,Lm ~f(w,d,h,t)
RW, December, 2008 Representative Unit of a TSV bundle 15
TSV Parasitic Models
Capacitance Inductance
Self-Capacitance model is of the form: Self-Inductance model is of the form:
k 2 v k3 v
p p
l
k5
p
k7
lv
C s Ctsv 1 k1e rv
k 4 v
k6 v
r k8 l
rv Ls 0.16lv ln 1 0.9 v
v rv
where Ctsv is the capacitance of an isolated TSV
63.34lv Mutual-Inductance model is of the form:
Ctsv
l lv
log 1 5.26 v
Lm 0.199lv ln 1 0.438
rv dv
Coupling-Capacitance model is of the form:
k1 0 lv pv
k4
l
k6
pv
k8
Cc 1 k 3
r k5 v
r
l
k7
p v v v
ln k 2 v
rv
Max. % Averag
Error e %
Error
k1 k2 k3 k4 k5 k6 k7 k8
Cs _ M 0.1505 -0.0071 -0.091 0.1849 -1.9371 6.9577 -0.0131 -0.0354 48.0 7.8
0.6876 -0.0390 -0.0583 1.8076 -0.2229 11.3537 0.0402 -13.1813 10.2 1.9
Cs _ N
0.3406 -0.0345 -0.0686 5.0708 -0.1530 -5.6346 -0.3859 -0.7643 13.3 2.0
C s _ NE
10.191 0.5490 -0.014 0.796 0.054 -1.157 -0.018 -0.600 8.7 1.9
Cc _ l
Cc _ p 3.180 0.5440 -0.199 0.586 0.122 0.540 2.176 0.110 10.9 1.8
18.117 28.457 -1.734 -2.178 0.600 -0.518 -0.470 0.188 8.0 1.4
Cc _ d
RW, December, 2008 16
Global Interconnect in a 3-D IC
RLC or LC or RC or C ?
Inductive coupling is
vanishingly small. t d _ tsv 0.69 Rdrv (Cs Clat Cdiag )
Capacitive coupling when 8 Switching Pattern
aggressors switch Victim Lateral Diagonal
simultaneously on a silent UP UP UP 0 0
victim net about 15% of UP - - 3.4 5.2
Vdd.
DOWN DOWN DOWN 9.0 10.6
RW, December, 2008 17
Capacitive Crosstalk in TSV Bundle…
S G S
High
Capacitive
coupling G
(Clat)
G S
Low
Capacitive
S G S coupling Victim Eye 10 GBPS
(Cdiag)
counteract capacitive crosstalk at high
signalling speeds with shielding
Shield N, E, W, S TSVs (Marked as G)
Victim Eye 10 GBPS with Lateral TSVs Shielded
RW, December, 2008 18
Results Summary - Key Publications
Roshan Weerasekera, Dinesh Pamunuwa, Matt Grange, Hannu Tenhunen, and Li-Rong
Zheng, ” Closed-Form Equations for Through-Silicon Via (TSV) Parasitics in 3-D Integrated
Circuits (ICs)” , 3D Integration Workshop, The Design, Automation, and Test in Europe
(DATE) conference, 2009. Accepted
Matt Grange, Roshan Weerasekera, Dinesh Pamunuwa, and Hannu Tenhunen “Examination
of Delay and Signal Integrity Metrics in Through Silicon Vias”, 3D Integration Workshop, The
Design, Automation, and Test in Europe (DATE) conference, 2009. Accepted
Roshan Weerasekera, Dinesh Pamunuwa, Hannu Tenhunen, and Li-Rong Zheng, ”Modelling
Through-Silicon-Vias in 3D-ICs,” IET Electronic Letters, September, 2008, Under Review.
Roshan Weerasekera, Dinesh Pamunuwa, Matt Grange, Hannu Tenhunen, and Li-Rong
Zheng, ”Parasitic Parameter Estimation and Electrical Modelling of Through-Silicon Vias in 3-D
ICs,” IEEE International Symposium on Circuits and Systems 2009, Under Review.
Matt Grange, Roshan Weerasekera, Dinesh Pamunuwa and Hannu Tenhunen, ”Exploration of
Through Silicon Via Interconnect Parasitics for 3-Dimensional Integrated Circuits” IEEE
International Symposium on Circuits and Systems 2009, Under Review.
Roshan Weerasekera, Matt Grange, Dinesh Pamunuwa, Hannu Tenhunen, and Li-Rong Zheng
”Modelling and Analysis of Through Silicon Via Interconnects in 3-Dimensional Integrated
Circuits” In submission to IEEE Transactions on Very Large Scale Integration, to be
submitted
2. Signalling techniques for global on-chip
interconnects
RW, December, 2008 20
Design Methodologies for on-chip Interconnects
Already proposed methods can be discussed under several hierarchy
levels:
Layout and Architectural
Technological Circuit
Routing & System
Repeater Insertion, the widely used method for delay reduction …
l
Repeaters increase the total
capacitance by 0.73Cwire,
td = (tdrv+ 0.4rcl2) which Increases power
consumption
repeaters
50% of dynamic power
ls ls consumption in a
microprocessor is due to
interconnects
td = (l/ls)(tb+ 0.4rcls2)
RW, December, 2008 21
Adaptive Smart Repeater Concept
With switching pattern effective
Static load capacitance is varying…
driver With a traditional repeater, the
Ceff C s drive strength is static and hence
there is a variation of delay
With adaptive driver
• For worst-case switching
pattern drive strength is high,
• For best-case switching
Ceff Cs 2Cc pattern drive strength is low.
driver is slower for the switching
Assistant Driver (Ha) combinations that give rise to a
lower capacitive load, and hence
reduces Jitter.
Advantage of the repeater circuit
proposed in this work:
− Energy Saving
− Delay Equalization
Main Driver (Hm)
RW, December, 2008 22
Design Methodology
Delay expression when
Assistant Driver (Ha)
both drivers Switching
(TMA)
Estimate optimum Ht
(=Hm+Ha)
and number of sections
Main Driver (Hm)
Delay expression when
Main Driver (Hm)
Assistant is Quiet
(TM)
Evaluate Ha such
Ht is a function of wire length that TMA-TM=0
Ha depends on coupling capacitance
RW, December, 2008 23
Circuit Realization of Smart Driver
RW, December, 2008 24
Simulation Results
Higher the Ha, the higher the Energy saving, with a higher delay variation.
With a higher Ha, delay is increased due to lack of drive strength for some switching
patterns.
RW, December, 2008 25
Energy Saving for Each Pattern
Group 1 : Both wires switch in the same direction
Group 3/4: One wire switches, other one is quiet
Group 5 : Both switches in opposite direction
Energy Dissipation (fJ)
Eavg
Driver
Group 1 Group 3/4 Group 5 (fF)
Trad. 1530 1735 1997 893
Smart 994 1248 2065 753
Simulation
Selector 57 102 215 71
E 31% 22% -14% 8%
Trad. 1507 1888 2195 939
Smart 934 1274 2195 789
Model
Selector 77 77 77 77
E 33% 28% -4% 7.8%
Energy Dissipation for each switching group (Ha=104)
RW, December, 2008 26
Smart Repeater...
A clear design methodology – as simple as that of repeater insertion
− The abstract delay model allows the repeater to be easily modeled in a CAD
interconnect planning tool.
Low complex driver circuit
A comprehensive delay and energy analysis.
12% reduction of Dynamic Delay
Energy saving that can be achieved by the SMART driver in future nanometer
technologies is found to be in the range of 20% - 25%.
Tech. Node
130 90 65 45 32
(nm)
K 14 24 36 54 84
Ht 325 268 277 278 282
Ha 202 162 163 158 154
Esmrt/Etrad 0.74 0.75 0.77 0.80 0.83
RW, December, 2008 27
Results Summary – Key Publications
Roshan Weerasekera, Li-Rong Zheng, Dinesh Pamunuwa and Hannu Tenhunen,
”Switching Sensitive Interconnect Driver to Combat Dynamic Delay in on-Chip
Buses,” in Lecture Notes in Computer Science (Proceedings of PATMOS), vol. 3728,
pp. 277-285, 2005.
Roshan Weerasekera, Dinesh Pamunuwa, Li-Rong Zheng and Hannu Tenhunen,
”Minimum-Power, Delay-Balanced Drivers for interconnects in the Nanometer
Regime,” in Proceedings of the international workshop on system-level interconnect
prediction, Germany, March, 2006, pp. 113-120.
Roshan Weerasekera, Dinesh Pamunuwa, Li-Rong Zheng and Hannu Tenhunen,
”Delay-Balanced Smart-Repeaters for on-chip Global Signaling”, in Proceedings of
the 20th International Conference on VLSI Design held jointly with 6th
International Conference on Embedded Systems, 2007, pp. 308-313.
Roshan Weerasekera, Dinesh Pamunuwa, Li-Rong Zheng and Hannu Tenhunen,
”Minimal-Power, Delay-Balanced Smart Repeaters for Global Interconnects in the
Nanometer Regime”, IEEE Transactions on Very Large Scale Integration (VLSI)
Systems, vol. 16, no. 5, pp. 589-593, May, 2008.
3. Cost, Performance and Technological trade-offs
for 2-D and 3-D ICs
RW, December, 2008 29
SoC vs SoP vs SiP vs 3-D ?
SoC SoP
SiP 3D
RW, December, 2008 30
Trade-off Parameters
Chip Area, Area
Technology Fusion, Temperature
Mixed-Signal Integration (Footprint)
Worst-Case
Source: http://www.ziptronix.com
Delay
Yield
Wafer, Packaging,
Testing, Rework and Repair Cost
RW, December, 2008 31
Chip/Module Cost Model Flow
Substrate Cost
Gate Count
Technology Die Area
Parameters Assemble
(Cost, yield)
Test, Package
Bare Die Cost
Cost
Test
(Cost, Yield)
Chip Cost
Repair
Detailed models and flow is described in Chapter 5. (Cost, Yield)
RW, December, 2008 32
Performance – Latency ?
L=0.12µm
Wires in
• SoC: small and resistive
• SoP: fat and low resistive (LC transmission lines)
• 3D : shorter in length, direct vertical communication
Performance of SoC is not necessarily better ?
RW, December, 2008 33
Chip/Package Temperature Model
Thermal integrity is a critical issue for
even conventional chips because the
system reliability is strongly dependent
on the temperature.
Assuming the heat dissipates through
the Silicon substrate, the average die
temperature can be usually described Source: Mahajan et al. Cooling a Microprocessor Chip,
Proceedings of the IEEE
t
Tdie Tambient Pchip
kA
If the same assumption is made that yes
T3D>Tmax ? Insert T-vias
the die size is much larger than its
thickness, the maximum temperature in
no
a 3D-IC m m
T3 D Tambient R( i 1),i Pj Done!
i 1 j i
RW, December, 2008 34
Case Studies
Mobile Terminal
128Mb DRAM, 300K logic and 500K ASIC 2mm2 analog/RF, and
CMOS image sensor (8 Megapixel, and pixel size 1.75 m X 1.75 m)
Areas in mm2:
ASIC uP DRAM Image Analog/RF
Sensor
0.750 0.644 5.44 24.5 2
Wireless Sensor
2Mbit DRAM, 300K logic, and 500K ASIC, 2mm2 analog/RF, and
Sensor 1mm2
Areas in mm2:
ASIC uP DRAM Sensor Analog/RF
0.750 0.644 0.084 1 2
RW, December, 2008 35
Case Study 1: Mobile Terminal
Stacking Reduces footprint
3
Norm. Area
2 Chips
2
4 Chips
Single Chip 1
SoC yield is very low..
0
2D-SoP 3D-SiP 3D-W2W 3D-D2W
100
1
Norm. Cost
%Yield
50 0.5
SoC is
0
2D-SoP 3D-SiP 3D-W2W 3D-D2W
0
2D-SoP 3D-SiP 3D-W2W 3D-D2W Expensive!
400
120
Temperature/oC
Delay/ps
100
200 80
60
0 40
2D-SoP 3D-SiP 3D-W2W 3D-D2W 2D-SoP 3D-SiP 3D-W2W 3D-D2W
Lower operating Stacking increases
SoC performance is low temperature operating Temperature
RW, December, 2008 36
Case Study 2: Wireless Sensor
Footprint of SoC is
SoP has a larger area… Much closer to that of 3-D
6
Norm. Area
2 Chips
Yield doesn’t improve 4 Chips
4
as much Single Chip 2
0
2D-SoP 3D-SiP 3D-W2W 3D-D2W
100 SoC is
4 cheaper….!
Norm. Cost
So as 3D W2W
%Yield
50
2
0 0
2D-SoP 3D-SiP 3D-W2W 3D-D2W 2D-SoP 3D-SiP 3D-W2W 3D-D2W
200
120
Temperature/oC
Delay/ps
100
100 80
60
0 40
2D-SoP 3D-SiP 3D-W2W 3D-D2W 2D-SoP 3D-SiP 3D-W2W 3D-D2W
Stacking doesn’t improve Stacking increases
Lower operating
performance… operating Temperature
temperature 37
RW, December, 2008
2D vs. 3D Integration
3-D Integration is cost effective for large designs, but not for smaller
designs.
RW, December, 2008 38
Results Summary – Key Publications
Roshan Weerasekera, Li-Rong Zheng, Dinesh Pamunuwa and Hannu Tenhunen,
”Early selection of system implementation choice among SoC, SoP and 3-D
Integration,” in IEEE International System-on-Chip Conference, September, 2007,
pp.187-190.
Roshan Weerasekera, Li-Rong Zheng, Dinesh Pamunuwa and Hannu Tenhunen, ”
Extending Systems-on-Chip to the Third Dimension: Performance, Cost and
Technological Tradeoffs,” in Proceedings of the IEEE/ACM international conference
on Computer-aided design, IEEE Press, November, 2007, pp. 212-219.
Roshan Weerasekera, Dinesh Pamunuwa, Li-Rong Zheng and Hannu Tenhunen, ”2-
D and 3-D Integration of Heterogeneous Electronic Systems under Cost,
Performance and Technological Constraints,” in IEEE Transactions on Computer
Aided Design of Integrated Circuits and Systems, September, 2008, Under Review
(Third Round Revision)
Conclusions
Continuous scaling of feature size has been challenging 2-D planar IC design
− Power consumption and heat removal
− Interconnect woes
− Disparate technologies
Three-dimensional integration is a promising solution to expected limits of scaling
Discussed major three aspects of 3-D integration:
− Through Silicon Via modelling and subsequent analysis of Signal Integrity
Provide close form models for TSV parasitics
Given guidelines for SI issued in a TSV bundle
− Low power signalling techniques for global on-chip interconnects
Smart repeater circuit is proposed which can save 10% energy and 12%
delay variation
− 3-D IC design in cost and performance of perspectives
A methodology to analysis cost and performance a-priori.
RW, December, 2008 40
Thank you !
RW, December, 2008 41
Related docs
Get documents about "