1. Trang chủ
  2. » Kỹ Thuật - Công Nghệ

Lecture VLSI Digital signal processing systems: Chapter 17 - Keshab K. Parhi

34 58 0

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 34
Dung lượng 156,3 KB

Các công cụ chuyển đổi và chỉnh sửa cho tài liệu này

Nội dung

Lecture VLSI Digital signal processing systems - Chapter 17: Low-power design includes content as: VLSI digital signal processing systems, power consumption in DSP, power dissipation, CMOS power consumption, dynamic power consumption, switching activity (α), increased switching activity due to glitching,…

Trang 1

Chapter 17: Low-Power Design

Keshab K Parhi and Viktor Owall

Trang 2

New Design Space

Trang 3

VLSI Digital Signal Processing

Systems

• Technology trends:

– 200-300M chips by 2010 (0.07 micron CMOS)

• Challenges:

– Low-power DSP algorithms and architectures

– Low-power dedicated / programmable systems

– Multimedia & wireless system-driven architectures – Convergence of Voice, Video and Data

– LAN, MAN, WAN, PAN

– Telephone Lines, Cables, Fiber, Wireless

– Standards and Interoperability

Trang 4

Power Consumption in DSP

• Low performance portable applications:

– Cellular phones, personal digital assistants

– Reasonable battery lifetime, low weight

• High performance portable systems:

– Laptops, notebook computers

• Non-portable systems:

– Workstations, communication systems

– DEC alpha: 1 GHz, 120 Watts

– Packaging costs, system reliability

Trang 5

Power Dissipation

Two measures are important

• Peak power (Sets dimensions)

Average power (Battery and cooling)

dt (t)

i T

V P

T

0 DD

DD

max DD

DD

Trang 6

CMOS Power Consumption

switching for

y probabilit α

V I

I V

V C

f α

P P

P P

DD leakage

sc DD

2 DD L

leakage sc

dyn tot

=

+ +

=

= +

+

=

Trang 7

Dynamic Power Consumption

Energy charged in a capacitor

Trang 8

Off-Chip Connections have High Capacitive

Load

Reduced off Chip Data Transfers by

System Integration

Ideally a Single Chip Solution

Reduced Power Consumption

Trang 9

P z = =

0.375 8

3

P z = =

P d =0.5

Due to correlation

Trang 10

Increased Switching Activity due to

Glitching

Extra transition due to race

Dissipates energy

a

c x

Trang 11

Clock Gating and Power Down

Module A

Enable A

CL

K

Module B

Enable B

Module C

Enable C

Only active modules should be clocked!

Control circuitry is needed for clock gating and power down

and Needs wake-up

Trang 12

Add i+1

Trang 13

S

Trang 14

Delay as function of Supply

Trang 15

Delay as function of Threshold

Trang 16

Dual V T Technology

Reduced VDD αααα Increased delay

Low VT αααα Faster but Increased Leakage

Trang 17

High VT αααα low leakage

High VT αααα low leakage

Low leakage in stand by when high VT tansistors turned off

Low VTFast high leakage

Trang 18

Low Power Gate Resizing

Systematic capture and elimination of slack using fictitious entities called Unit

Delay Fictitious Buffers.

• Replace unnecessary fast gates by slower lower power gates from an

underlying gate library.

• Use a simple relation between a gate’s speed and power and the UDF’s in its

fanout nets Model the problem as an efficiently solvable ILP similar to

retiming

In Proceedings of ARVLSI’99 Georgia Tech.

4 1

3

1

3 3

0 0

Trang 19

Dual Supply Voltages for Low

Power

• Components on the Critical Path exhibit no slack but components off the critical path exhibit

excessive slack.

• A high supply voltage VDDH for critical path

components and a low supply voltage VDDL for non critical path components.

• Throughput is maintained and power consumption

is lowered.

V Sundararajan and K.K Parhi, "Synthesis of Low Power CMOS VLSI Circuits using Dual Supply

Voltages", Prof of ACM\/IEEE Design Automation Conference, pp 72-75, New Orleans, June 1999

Trang 20

Dual Supply Voltages for Low

Power

Systematic capture and elimination of slack using fictitious entities called Unit

Delay Fictitious Buffers.

• Switch unnecessarily fast gates to to lower supply voltage VDDL thereby

saving power, critical path gates have a high supply voltage of VDDH.

• Use a simple relation between a gate’s speed/power and supply voltage with

the UDF’s in its fanout nets Model the problem as an approximately solvable

ILP.

4 1

1 1

3

1

3 3

0 0

7 Critical Path = 8, UDF’s in Boxes

VDDH VDDH

VDDH

VDDH

VDDL VDDH

LC = Level Converter

Trang 21

Chapter 17 21

Dual Threshold CMOS VLSI for

Low Power

Systematic capture and elimination of slack using fictitious entities called Unit

Delay Fictitious Buffers.

• Gates on the critical path have a low threshold voltage VTL and unnecessarily

fast gates are switched to a high threshold voltage VTH

• Use a simple relation between a gate’s speed /power and threshold voltage

with the UDF’s in its fanout nets Model the problem as an efficiently

approximable 0-1 ILP.

4 1

1 1

3

1

3 3

0 0

Trang 22

Experimental Results

• Table :ISCAS’85 Benchmark Ckts

Resizing (20 Sizes) Dual VDD Dual

Trang 23

HEAT: Hierarchical Energy

Analysis Tool

• Salient features:

– Based on stochastic techniques

– Transistor-level analysis

– Effectively models glitching activity

– Reasonably fast due to its hierarchical nature

Trang 24

1 0 0

i i

x x

x x

p p

p p

0 0 1 0 1 1 0 1

1 0

1

= +

+ +

i i

i

x x

x x

NS

j

i i

N x

p p

p p

NS

j x j x p

( )

1 0

1 1

1

lim

i i

i

x x

NS

j i

N x

p p

NS

j x p

Trang 25

State Transition Diagram

Modeling

) ( )

( ) ( )) ( 1 ( ) 1

2 n x n x n x n node n Node + = − + ⋅ ⋅

) ( )

( )

( ))

( 1

( )

( ))

( 1

( )

1

Trang 26

The HEAT algorithm

• Partitioning of systems unit into smaller sub-units

• State transition diagram modeling

• Edge energy computation (HSPICE)

• Computation of steady-state probabilities

(MATLAB)

• Edge activity computation

• Computation of average energy

Energy = WjEAj

Trang 27

0 1000 2000 3000 4000 5000 6000 7000 8000 9000

Trang 28

Finite field arithmetic Addition

Trang 29

Programmable finite field

multiplier

MAC2 MAC2 DEGRED2 DEGRED2

MAC2 + DEGRED2

Four Instr.

Trang 30

Finite field programmable finite field multipliers

L Song and K K Parhi, “Low-energy digit-serial/parallel finite field multipliers”, Journal of VLSI Signal Processing, 19(2), pp 149-166, June 1998

Trang 31

Data-path architectures for low

energy RS codecs

• Advantages of having two separate sub-arrays

– Example: Vector-vector multiplication over GF(2 )

– Assume energy(parallel multiplier)=Eng

1 1

ê ê ê ë é

Trang 32

Data-path architectures for

low-power RS encoder

• Data-paths

– One parallel finite field multiplier

– Digit-serial multiplication: MACx and DEGREDy

Trang 33

Data-path architectures for low

energy RS codecs

• Data-path:

– one parallel finite field multiplier

– Digit-serial multiplication: MACx and DEGREDy

Energy

MAC8 + DEGRED2 MAC8 + DEGRED1 MAC4 + DEGRED2 MAC4 + DEGRED1

Energy-delay MAC8 + DEGRED4

MAC8 + DEGRED2

L Song, K.K Parhi, I Kuroda, T Nishitani, "Hardware/Software Codesign of Finite Field Datapath for Low-Energy

Reed-Solomon Codecs", IEEE Trans on VLSI Systems, 8(2), pp 160-172, Apr 2000

Trang 34

• Low-Power Architectures driven by

Interconnect, Crosstalk in DSM technology

• How Far are we away from PDAs/Cell

Phones for wireless video, internet access and e-commerce?

Ngày đăng: 12/02/2020, 21:12

TỪ KHÓA LIÊN QUAN

w