Home    Bloggers    Messages    Webinars    Resources   
Tw  |  Fb  |  In  |  Rss
William Murray

Synthesizing for Low Power in FPGAs

William Murray
Paul A. Clayton
Paul A. Clayton
8/5/2012 5:10:52 PM
User Rank
Beginner
Power and bandwidth/latency
Max wrote: "I was just thinking about how originallt gate count was the limiting factor, then timing and area, now power...  what next?"

Related to the power issues are bandwidth and latency issues.  Like power (connections and heat removal), off-chip bandwidth is limited by surface area (optical interconnects could be a big help in the not too distant future).

On-chip bandwidth and latency are also influenced (like power density) by the shrinking transistor--while keeping chip area more-or-less constant.  The number of times a transistor can switch in the time required to send a signal across the chip has been increasing.  The number of nodes of N transistors that can fit on a chip is increasing, so a direct all-to-all network will take up more space and power (and more space and power leads to higher latency).  A simpler network like a grid will have an impact on latency and bandwidth between arbitrary nodes.

3D can help with "on chip" bandwidth and latency, but would seem to make off-chip bandwidth worse (n-cubed computation and local storage with n-squared surface area for off-chip interconnect vs. n-squared for both with 2D).  (The same applies for 3D and power.  Local communication will use less power--and reduced latency and improved bandwidth will mean less semi-idle time with static power consumption implications--, but the power/heat use/production [barring dark silicon] would increase at n-cubed while the input/extraction would increase at n-squared.)

One might also note the "software wall".  The constraints on single-thread performance (power and memory walls being major contributors) seem to be pushing for explicit parallelism (vectorization and multithreading/multiprogramming) which introduces difficulties in programming (better languages/libraries and programmer education can help, hardware features can help [e.g., performance counters and transactional memory], but some problems are probably just difficult).  Use of more power-efficient accelerators can also help, but such also has implications for programming difficulty.

The technology for power-efficiency optimization of software is likely extremely immature (even less mature than that for parallelization).

50%
50%
William Murray
William Murray
8/5/2012 3:49:20 PM
User Rank
Blogger
Re: Before power was a consideration
Now days there are three orders of magintude of gates more in each class of device than there were a few years back -- power has to matter -- or the devices would be hotter than the surface of the sun!

50%
50%
Max Maxfield
Max Maxfield
8/5/2012 1:21:15 PM
User Rank
Blogger
Re: Before power was a consideration
@Adam: I agree totally -- I was just thinking about how originallt gate count was the limiting factor, then timing and area, now power...  what next?

50%
50%
Adam Taylor
Adam Taylor
8/5/2012 11:13:01 AM
User Rank
Blogger
Re: Before power was a consideration
@Max in these days power seems to be one of the major driving factors 

50%
50%
Max Maxfield
Max Maxfield
8/5/2012 11:06:45 AM
User Rank
Blogger
Before power was a consideration
It's funny -- when I started designing ASICs back in 1980, the one thing we never worried about was power -- everything we designed was the size of  fridge/freezer and power simply wasn't on our list of thngs to worry about. I wonder what the next thing will be...

50%
50%
William Murray
William Murray
8/3/2012 8:06:48 PM
User Rank
Blogger
Re: Here's a good whitepaper
That white-paper looks like another good trick for the new parts -- It runs after synthesis, unlike the Synopsis, or Cadence, during synthesis optimizations--

50%
50%
Max Maxfield
Max Maxfield
8/3/2012 1:39:43 PM
User Rank
Blogger
Here's a good whitepaper
@William: FYI Xilinx have a good whitepaper on thsi topic titled: Reducing Switching Power with Intelligent Clock Gating http://bit.ly/Qnu1pd

50%
50%
T34
T34
8/2/2012 9:24:28 AM
User Rank
Beginner
Power again
Great timing :)

As I mentioned under Warren's blog, the race is on. And by eliminating the gates in eg 8051 IP, you can get 2000 gates instead of 10 or 15k. And then you get just 0.081mW/MHz in 180nm process.

 

50%
50%
Max Maxfield
Max Maxfield
8/2/2012 8:57:45 AM
User Rank
Blogger
Strange coincidence
@Warren: Thanks for this article -- very timely -- and what a strange coincidence that this came so close to Warren's blog: FPGAs & Low Power: Where Have We Been? http://bit.ly/MeyXrm

50%
50%
More Blogs from William Murray
Today's FPGAs already integrate a substantial amount of "stuff" (MCU cores, programmable fabric, on-chip memory, etc.), so what's left to integrate and why is this being left for the future?
When extreme thermal cycling causes circuit boards and chip packages and the silicon die in the packages to expand and contract at different rates, problems may ensue.
In order to simulate a design we need models that represent the functionality and timing characteristics of our design elements, but the timing aspects of these models may be based on uncertain data.
A large amount of skill is required to write custom test code for custom hardware. Even more skill is required to test a CPU, RAM, or ROM.
Designing high-temperature electronics can present many challenges for "down-hole" petroleum equipment, ovens and micro-waves, automotive, medical, aerospace, and other applications.
flash poll
follow us on twitter
follow Xilinx on twitter
like us on facebook
like Xilinx on facebook
All Programmable Planet     About Us     Contact Us     Help     Register     Twitter     Facebook     RSS