Wire Topology Optimization for Low Power (TopCool and TopCoolViewer)*

Paul Zuber, Robert Hartl, Thomas Ilnseher and Walter Stechele

TU München, Institute for Integrated Systems, Arcisstr. 21, 80290 München, Germany
\{Paul.Zuber, Walter.Stechele\}@tum.de

Abstract

TopCool, activity-driven local wire spacing for up to 9% less chip-wide wire power is demonstrated. Routed layouts are im-/exported from/to EDA tools. Area and timing are not adversely affected.

1. Introduction

Despite the increasing importance of capacitive power consumption, today’s design tools do not offer switching activity driven layout synthesis for low power digital CMOS. TopCool was developed to reduce the wire power consumption of detail-routed circuits by locally modifying the wire topology. Globally relaxing the routing pitch [1] or specifying the temporal routing sequence of the nets [3] shows limited success on many circuits due to routability problems. There exists related work focused to optimizing the wire power consumption of buses [2]. Its principle: individual, non-uniform wire spacing driven by activities, is leveraged and applied to entire circuits.

2. TopCool and TopCoolViewer

TopCool searches a given layout for all instances of groups of parallel wire segments that can be shifted laterally. The wires are then spaced through integer-convex programming. The result depends on activity and availability of surrounding whitespace. Unlike prior spacing techniques, the presented approach routes connections between the old and new wire endings. Finally, the best subset of all groups is selected. A detailed description can be found in [4].

Switching activities and layout data are accessed through appropriate interfaces to commercial physical design tools. Figure 1 illustrates the flow of the proposed method.

The OpenGL-based viewer TopCoolViewer was developed to investigate on-chip interconnects. It supports wire coloring according to activity, 3-D operations, starting the optimization and visualizing its results. The new model can be overlaid on the reference for comparison. Figure 2 was created with this tool.

3. Results

Benchmark circuits were synthesized from RT-level, placed and routed with a commercial tool, BlastChip by Magma∗. Two types of timing and area constraints were used, moderate, and hard, cf. the table. To obtain switching activities, high-effort propagation including gate- and wire delays was used. Power was evaluated by BlastChip after full-chip 3-D capacitance extraction with QuickCap.

The table compares the interconnect switching power of the routing results of the commercial layout synthesis tool to the six techniques spreading, Sanyo [1], Intel [3], TopCool, and combinations of TopCool with [1] and [3].

Both wire spreading and TopCool are post routing operations. They can be applied to any design, no matter how congested the routing area is. This is not true for [3] and [1] which modify the routing parameters of an existing router. This can save remarkable amounts of power but fails on larger or more tightly constrained designs due to congestion problems, limiting their universal applicability.

*The authors would like to thank Magma Eindhoven for their support.
### TopCool

TopCool improves the yield limit of the metal layers but does not achieve the same values as spreading. A hybrid optimization for yield and low power is feasible by not rejecting wire groups with low power reduction as currently done. Improvements in the power saving results are possible by relocating vias or ordering wires prior to spacing. More research is required, in this regard.

## 5. Conclusions

Activity-driven wire spacing applied in-place in already detail-routed layouts has been demonstrated. TopCool reads layouts from existing physical design tools and switching activities from simulation. Wires are then relocated depending on the activities and the amount of surrounding whitespace available. Optimization is fast as multiple wires are treated simultaneously with a tailored integer-convex program. After optimization, a new layout is exported and can be read back by EDA tools. Reductions of up to 9.41% of a modern benchmark design were achieved after 3-D RC-extraction without affecting area or timing. Commercial spreading which does not consider activities achieves only half of the savings and is four to eight times slower. TopCoolViewer allows the user to investigate the resulting interconnect power and the optimization steps.

### References


