Is there still hope for asynchronous design?

This method has long been promising but has never been realized. Is there a fundamental issue, or is it just bad luck?

In the current era where power consumption has become a core design constraint, there is still skepticism about whether asynchronous logic can play a substantial role. Although this design style has many significant advantages, its practical application still lacks sufficient experimental verification.

Synchronous design relies on clock signals, and the clock frequency is often limited by the longest and slowest path in the design, with the need to consider variables that may arise during the manufacturing process. During the testing phase, it is common practice to categorize chips into different grades based on performance; otherwise, any chip that does not meet the working frequency will be considered substandard.

Clock skew further complicates this complexity. Although the clock signal originates from a single point, it encounters varying degrees of delay as it traverses the chip. This clock skew refers to the deviation of the clock signal from its intended arrival time, and it is also deeply influenced by the manufacturing process.

To mitigate these issues, multiple clocks or other complex design methods are often employed. While these methods help create asynchronous coupling domains, they also give rise to a new category of problems—clock domain crossing.

Moreover, clocks are one of the main sources of power consumption. Since clock signals need to be distributed throughout the chip, a significant amount of capacitance accumulates on the clock lines. Each transition at the clock edge means a charging and discharging process of the capacitor, which not only slows down the speed but also leads to substantial power consumption. To reduce the capacitive load on each buffer, additional buffers are often added, which in turn increases power consumption.

As chips get closer to the process limits, single synchronous clock operation has become impractical. Michael Frank, a retired senior engineer at Arteris, pointed out: "If it is not possible to complete the traversal of the entire chip within a single clock cycle, the design must be considered as a hybrid mode that is locally synchronous but runs asynchronously over long distances. This requires implementation through synchronizers, or it may draw on strategies from the early CPU era—constructing a clock grid with low clock skew. The problem is that clocks consume a lot of power and require a massive re-buffering tree to support a large number of flip-flops."

Synchronous design is attractive because once the longest path is determined, the timing factor becomes relatively secondary, as all operations are broken down into a series of discrete steps. This is crucial for design tools such as synthesis.Synopsys' senior engineer Rob Aitken stated: "Asynchronous design is often seen as a technology full of promise but difficult to implement. Apart from a few specific cases, it is really hard to apply it broadly. Although this is an overly general statement, if you take a piece of RTL code optimized for synchronous design and try to implement the same functionality with asynchronous design, you first have to adjust the RTL to fit the asynchronous environment. Then you proceed with the asynchronous implementation and evaluate whether this adjustment has brought actual benefits. In the long run, the world will eventually find ways to benefit from asynchronous design, but for now, fully synchronous design still dominates because all tools and technologies are optimized around it."