• No results found

Post-silicon tuning capabilities of 45nm low-power CMOS digital circuits

N/A
N/A
Protected

Academic year: 2021

Share "Post-silicon tuning capabilities of 45nm low-power CMOS digital circuits"

Copied!
3
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

Post-silicon tuning capabilities of 45nm low-power CMOS

digital circuits

Citation for published version (APA):

Meijer, M., Liu, B., Veen, van, R., & Pineda de Gyvez, J. (2009). Post-silicon tuning capabilities of 45nm

low-power CMOS digital circuits. In Proceedings of 2009 Symposium on VLSI Circuits, 16-18 June 2009, Honolulu,

Hawaii (pp. 110-111). Institute of Electrical and Electronics Engineers.

Document status and date:

Published: 01/01/2009

Document Version:

Publisher’s PDF, also known as Version of Record (includes final page, issue and volume numbers)

Please check the document version of this publication:

• A submitted manuscript is the version of the article upon submission and before peer-review. There can be

important differences between the submitted version and the official published version of record. People

interested in the research are advised to contact the author for the final version of the publication, or visit the

DOI to the publisher's website.

• The final author version and the galley proof are versions of the publication after peer review.

• The final published version features the final layout of the paper including the volume, issue and page

numbers.

Link to publication

General rights

Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain

• You may freely distribute the URL identifying the publication in the public portal.

If the publication is distributed under the terms of Article 25fa of the Dutch Copyright Act, indicated by the “Taverne” license above, please follow below link for the End User Agreement:

www.tue.nl/taverne

Take down policy

If you believe that this document breaches copyright please contact us at:

openaccess@tue.nl

providing details and we will investigate your claim.

(2)

110

978-4-86348-010-0 2009 Symposium on VLSI Circuits Digest of Technical Papers

11-2

Post-Silicon Tuning Capabilities of 45nm Low-Power CMOS Digital Circuits

Maurice Meijer1, Bo Liu2, Rutger van Veen1 and Jose Pineda de Gyvez1,2

1NXP Semiconductors, Eindhoven, The Netherlands 2Technical University of Eindhoven, Eindhoven, The Netherlands

Abstract

Adaptive circuit techniques enable modification of power-performance efficient circuit operation. Yet it is unclear if such techniques remain effective in modern deep-submicron CMOS. In this paper we examine the technological boundaries of supply voltage scaling and body biasing in 45nm low-power CMOS. We demonstrate that there exists an effective tuning range for power-performance and power-performance variability control. Our analysis is supported by ring oscillator test-chip measurements.

Introduction

Modern integrated circuits have been equipped with supply voltage scaling (VS) and body bias (BB) tuning approaches for improving power-performance efficient operation [1-3]. Although the benefits of such design technologies are well-established, their effectiveness is strongly process technology dependent. In this paper, we explore the technological boundaries of VS and BB for a state-of-the-art 45nm low-power (LP) CMOS process. In particular, we investigate power-performance trade-offs, leakage power savings, and how far process-dependent performance spread can be tuned. Moreover, we investigate if standard-Vth (SVT) with BB-tuning can eliminate the need for low-Vth (LVT) and high-Vth (HVT) masks.

Test-Chip Design and Tuning Ranges

A test-chip with a size of 1.25x0.44mm2 has been implemented in 45nm LP-CMOS (Fig.1). It contains two copies of 10 inverter-based ring oscillators (ringos) with different chain lengths which are part of the same core. Layout identical ringo instances are placed at a distance of 125µm.The test-chip contains three cores with different threshold voltage options, namely SVT, LVT, and HVT. Each core has independent power supply voltage (VDD) pads for current measurements. The ground pads, and independent body bias voltage (VBB) pads for PMOS (Vnwell) and NMOS (Vpwell) devices are common for all cores. Frequency, power, and leakage have been measured for 57 dies on a 300mm wafer. The measurements were performed for 0.6V-1.3V VDD, and for a range of temperatures in between -40oC and 125oC. The nominal VDD setting equals 1.1V. Moreover, a body biasing was applied ranging from 1.1V reverse body bias (RBB) up to 0.5V forward body bias (FBB). In this paper we will use a symmetrical body bias, e.g. VBB=Vpwell=VDD-Vnwell, and 25oC, unless stated otherwise.

Power-Performance Tuning and Leakage Control

Fig. 2 shows the frequency distributions versus VDD for 57 dies of a LVT, SVT, and HVT 101-stage ringo, respectively. The symbols indicate the frequency of the median sample. We measured a frequency downscaling of 7.8x (LVT), 13.4x (SVT), and 33.3x (HVT) when VDD reduces from 1.1V to 0.6V. Energy and frequency trade-offs for the SVT chip sample are illustrated in Fig.3. Each cloud relates to a unique VDD value, and each dot in a cloud corresponds to a unique VBB. We measured a 44x power reduction, and 3.3x energy savings using VS from 1.1V to 0.6V. The use of BB at VDD=1.1V provided a large frequency tuning range from -19% (1.1V RBB) till 27% (0.5V FBB) w.r.t. the nominal operating point. The frequency tuning index factors are -11% (-31%) up to 17% (41%) for a LVT (HVT) median die sample. The impact of body biasing on energy is small, even for a low circuit activity of 0.3%. For SVT, we obtained 15% energy savings at the same performance w.r.t. the nominal operating point by using combined VS and BB (VDD=1.0V, 0.4V FBB). BB tuning was proposed to eliminate the use of LVT and HVT in 45nm CMOS [3]. At 1.1V VDD our experiments confirm that SVT with 0.5V FBB can achieve LVT

performance (Fig.4). However, this gives a 3.6x higher leakage than LVT for the median samples. VS is not preferred for achieving LVT performance due to the associated power penalty. Furthermore, we observed that SVT with RBB alone can not even achieve nominal HVT leakage (Fig.4). This is due to the small body factor (γ) available, and the presence of gate-induced drain leakage at large RBB values. Alternatively, VS alone or combined with RBB enable SVT circuits to effectively achieve HVT leakage (Fig.5). Fig.5 shows the SVT leakage current distributions versus VBB for two VDD values at 25oC. The symbols indicate the results for the median sample. At 1.1V VDD we measured 1.5x-2.8x leakage savings using optimal RBB settings. Reducing VDD from 1.1V down to 0.6V is more effective (4.0x-4.5x). Combined VS+RBB provided 10x-22x leakage savings. The actual leakage savings are strongly temperature dependent (Fig.6). The dominant leakage components determine if VS or RBB is more effective. VS+RBB showed maximum leakage savings around 75oC for the SVT median sample. The location of this maximum depends on process skew, and Vth option used.

Performance-Spread Compensation

The impact of systematic and random process variability on ringo timing has been determined by means of a clock-period correlation plot (Fig.7). The clock-period of two closely located layout-identical ringos has been correlated for each die sample. The statistical delay spread of an 11-, 21-, 31-, 41- and 101-stage ringo has been calculated for three BB values at 1.1V VDD (Fig.8). In Fig.8, the solid and open symbols relate to the 3σ systematic and random delay spread, respectively. The trend lines are extrapolated from the 101-stage ringo, which are closely matching the results from the other ringos. Observe that the delay spread reduces consistently when FBB is applied, while it increases for RBB. Fig.9 shows a more detailed analysis for the 21-stage ringo. The total delay spread is about 2x lower for 0.5V FBB w.r.t. the nominal BB case. Contrarily, the spread is about 2x higher for 1.1V RBB. Observe that FBB can significantly reduce both systematic and random delay spread in 45nm LP-CMOS. Fig.10 puts in perspective the clock period mean and 3σ-spread versus VBB for the 21-stage ringo. A ±11% spread was observed at the nominal operating point. This spread could be fully compensated through BB tuning using up to 0.2V FBB for slow die samples, and up to 0.7V RBB for fast die samples. Frequency and leakage was measured for all available samples (Fig.11). For each die sample, it was possible to tune its frequency to the nominal target spec through BB tuning. We required a BB range from 0.1V RBB up to 0.2V FBB. This gives basically an enhancement to 100% parametric yield for our sample set. We measured a 32% frequency increase with 0.5V FBB for the slowest die sample at a 26x leakage penalty. This offers sufficient tuning range for compensating process-dependent performance spread.

Conclusions

Test-chip measurements show that VS and BB remain effective in 45nm LP-CMOS. We demonstrated the presence of large power-performance modification capabilities. For SVT circuits, VS enables 3.3x energy savings when the frequency can be reduced by 13.2x. Combined VS+BB results in 15% energy savings at no frequency penalty, and 10x-22x leakage savings at 25oC. BB tuning offers sufficient range to achieve process-related performance spread compensation. Moreover, FBB can effectively reduce timing uncertainty, e.g. we observe a 2x lower delay spread at 0.5V for a 21-stage SVT ringo. Finally, SVT+FBB can mimic LVT performance, while SVT+RBB can not achieve HVT leakage.

(3)

111

2009 Symposium on VLSI Circuits Digest of Technical Papers

References 000E+0 50E+6 100E+6 150E+6 200E+6 250E+6 300E+6 350E+6 400E+6 450E+6

000E+0 50E-15 100E-15 150E-15 200E-15 250E-15 300E-15

Energy consumption [J] Fr eq ue nc y [H z] 101-stage SVT ring-oscillator T=25oC, α=0.003 VDD=1.0V VDD=0.9V VDD=0.8V VDD=0.7V VDD=0.6V VDD=1.1V 0V BB 318MHz 251fJ 0.5V FBB 1.1V RBB 0.9 1.0 1.1 1.2 0 1 10 Normalized Leakage N or m al iz ed F re qu en cy -0.1 0 0.2 N um be r o f S am pl es 10 0

Body Bias Voltage [V]

Nominal frequency specification 101-stage SVT ring-oscillator 57 die samples VDD=1.1V, T=25oC 1E+6 10E+6 100E+ 6 1E+9 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4

Power Supply Voltage [V]

Fr eq ue nc y [H z] HVT SVT LVT 401MHz 212MHz 310MHz 101-stage ring-oscillator 57 die samples nominal body bias, T=25oC

0 5 10 15 20 25 Temperature [degC] Le ak ag e R ed uc tio n Fa ct or VS (1.1V-0.6V) 12.0 5.5 4.2 3.5 3.1 2.9 2.6 BB (Vdd=1.1V) 1.1 1.7 2.6 3.5 4.3 4.7 4.7 BB (Vdd=0.6V) 1.6 3.5 5.3 7.1 7.9 8.1 8.0 VS+BB 19.3 19.5 22.2 24.4 24.6 23.1 21.2 -40 0 25 50 75 100 125

[1] Tschanz et.al., VLSI 2002, pp. 310-311.

[2] Nomura et.al., ISSCC 2008, pp. 262-263.

[3] Gammie et.al., ISSCC 2008, pp. 258-259.

Figure 3. Frequency vs. energy trade-offs for a 101-stage SVT ring-oscillator

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 0 1 10 100 Normalized Leakage N or m al iz ed F re qu en cy HVT SVT LVT FBB RBB 101-stage SVT ring-oscillator median die sample VDD=1.1V, T=25oC

VBB: 1.1V RBB up to 0.5V FBB

0.5V FBB 1.1V RBB

Figure 11. Performance spread compensation

Core A Core B Core C

SVT

Selection circuit 45nm triple-well LP-CMOS process 57 samples from the same 300mm-wafer Die size of 1.25x0.44 mm2

2x30 fanout-1 inverter-based ring-oscillators Nand-2 and inverter

drawn device sizes Nominal VDD Vthoptions Features PMOS: W=415nm, L=40nm NMOS: W=290nm, L=40nm 1.1V SVT, LVT, HVT 45nm triple-well LP-CMOS process 57 samples from the same 300mm-wafer Die size of 1.25x0.44 mm2

2x30 fanout-1 inverter-based ring-oscillators Nand-2 and inverter

drawn device sizes Nominal VDD Vthoptions Features PMOS: W=415nm, L=40nm NMOS: W=290nm, L=40nm 1.1V SVT, LVT, HVT LVT HVT ÷1024 VDD VNWELL VSS VPWELL VDDP VSS enable out Ring-oscillator Periphery 0.01 0.1 1 10 100 -1.2 -1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6

Body Bias Voltage [V]

N or m al iz ed L ea ka ge VDD=1.1V VDD=0.6V HVT-min HVT-max FBB RBB SVT ring-oscillator core 57 die samples T=25oC 1.2µA VS

Figure 1. Test-chip photograph and description Figure 2. Frequency vs. VDD trends

Figure 4. Frequency vs. leakage for a BB-tuned 101-stage ring-oscillator

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 -1.2 -1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6

Body Bias Voltage [V]

N or m al iz ed C lo ck P er io d 21-stage SVT ring-oscillator VDD=1.1V, T=25oC FBB RBB

Figure 10. Estimated mean clock period and spread for 21-stage SVT ring-oscillator

3.0E-09 3.1E-09 3.2E-09 3.3E-09 3.4E-09 3.5E-09

3.0E-09 3.1E-09 3.2E-09 3.3E-09 3.4E-09 3.5E-09

Period Ring-A [s] Pe rio d R in g-B [s ] Random variation Total varia tion 101-stage SVT ring-oscillator 57 pairs of oscillators VDD=1.1V, T=25oC

Figure 7. Clock period correlation plot of two layout-identical 101-stage SVT ring-oscillators

000E+0 20E-12 40E-12 60E-12 80E-12 100E-12 120E-12 140E-12 160E-12 0 5 10 15 20 25 30 35 40 Number of Stages 3-si gm a de la y sp re ad [s ] SVT ring-oscillators VDD=1.1V, T=25oC 1.1V RBB 0.5V FBB 0V BB Systematic 3σ-spread Random 3σ-spread

Figure 8. Systematic and random delay spread vs. number of ring-oscillator stages

000E+0 20E-12 40E-12 60E-12 80E-12

Body Bias Voltage [V]

3-si gm a de la y sp re ad [s ]

Total spread 71.50E-12 34.36E-12 18.59E-12

Systematic spread 67.68E-12 31.43E-12 16.52E-12

Random spread 23.05E-12 13.88E-12 8.52E-12

-1.1 0 0.5

21-stage SVT ring-oscillator VDD=1.1V, T=25oC

Figure 6. VS and BB dependent leakage savings vs. temperature for an SVT ring-oscillator core

Figure 9. Estimated delay spread for the available 21-stage SVT ring-oscillators

Figure 5. SVT leakage vs. VBB for two distinct VDD values at 25oC

Referenties

GERELATEERDE DOCUMENTEN

resolution as stated by the BRRD. Cost of failure is to be internalized by the banking sector, public funds are not to be used, spillover effects are to be minimized. DB announces

Table 11 reports the value of expected government support defined in probabilities of default (difference in trading notches between the foreign long-term credit assessment and

te doen ontstaan, kunnen we als kri terium voor doorsl.ag stellen, dat deze plaats vindt, zodra zich ergens in de vloeistof een dampbelletje heeft gevormd. Voor

spoordefinities: beerput, bezinkput, boomval, gracht, graf (crematie), graf (inhumatie), greppel, haard, houten paal, karrenspoor, kelder, kuil, muur, ophogingslaag, oven,

This investigation focuses on the analysis and performance evaluation of a novel perturbation signal, namely the Pseudo-Random Impulse Sequence (PRIS), for in situ application in

Ham wordt voor het eerst vermeld in 784 en zou afkomstig zijn van het Germaanse "hamma", wat zou wijzen op een landtong uitspringend in overstromingsgebied. Oostham

Zeven jaar na de oorspronkelijke uitgave blijkt het werk ‘De Nederlandse zoetwatermollusken’ nog bepaald niet bekend onder verzamelaars van strandfossielen..

Pared down to their bare essence, the principles of action learning selectively applied and adapted in the rap programme were the following (see especially