This report explores an analog-based approach to CNN acceleration, emphasizing energy efficiency in resource-constrained embedded environments. The architecture integrates mixed-signal elements for CNN convolution, multiplication, accumulation, and activation, leveraging the power and speed of analog circuits. This design targets applications requiring low power consumption, real-time processing, and high-speed inference, such as embedded AI in image processing.
The growth of AI and deep learning has amplified the demand for high-performance hardware accelerators capable of efficiently handling large computations. Convolutional Neural Networks (CNNs), in particular, are widely used in tasks like image recognition but often require significant computational power, leading to challenges in latency and power consumption. Traditional digital accelerators handle these computations but can be inefficient for real-time applications. This report presents a low-power analog accelerator that performs core CNN operations, including multiplication, accumulation, activation (ReLU), and pooling, while avoiding digital conversion where possible.
In digital CNN accelerators, the multiply-and-accumulate (MAC) operations consume considerable power. Analog circuits, by contrast, can perform these operations directly with minimal energy consumption and without the need for digital representation, potentially providing real-time inference with lower power consumption.
The DACs convert binary digital values to analog currents, essential for initiating the convolution operation. This project includes an 8-bit DAC multiplier testing and a 4-bit DAC multiplier testing to explore the trade-offs in precision and power consumption.
Schematic (8-bit Current Steering DAC):

Schematic (4-bit Current Steering DAC):

For an
-
$$( D )$$ : Decimal representation of the binary input. -
$$( n )$$ : Number of bits in the DAC. -
$$( I_{ref} )$$ : Reference current.
The 8-bit DAC provides higher precision, while the 4-bit DAC is more power-efficient, making it suitable for applications where lower precision is acceptable.
Waveform (8-bit DAC tb):
Waveform (8-bit DAC waveform):

The multiplier circuit uses current mirrors to multiply the DAC output with another analog input. The circuit configuration allows precise scaling through transistor sizing, supporting both 8-bit and 4-bit multipliers.
The output current
where:
-
$$( k_1 )$$ and$$( k_2 )$$ represent scaling factors from transistor (W/L) ratios. -
$$( I_{DAC} )$$ : DAC output current. -
$$( I_{in} )$$ : Secondary input current.
This output is directly proportional to the product of
The integrator sums the products generated by the multiplier circuit, combining results from multiple operations without A/D conversion, which reduces power and latency.
For an integrator with capacitance
This formulation shows the continuous accumulation of input current over time, providing the summation needed in MAC operations.
The max pooling circuit performs down-sampling by retaining the maximum value within a region. This circuit outputs the highest current value from a set, representing the maximum activation in analog form.
For inputs
This reduces data while retaining essential features, making it a crucial component in CNNs.
The Rectified Linear Unit (ReLU) introduces non-linearity into the network by zeroing out negative values while allowing positive values to pass.
For an input current
The waveform shows accurate digital-to-analog conversion for the 8-bit DAC, reflecting high precision.
These waveforms validate the accuracy and functionality of the multiplier circuits for both the 8-bit and 4-bit configurations.
The ReLU waveform shows that the circuit accurately outputs zero for negative inputs and allows positive currents to pass.
This analog accelerator design provides efficient MAC operations for CNNs in resource-constrained environments. Through analog implementation, the design reduces power consumption and latency. Future improvements include:
- Component optimization for power and speed.
- Support for multi-channel convolutions.
- Expanded testing with diverse embedded applications.
- Razavi, B., “The Current-Steering DAC [A Circuit for All Seasons],” IEEE Solid-State Circuits Magazine, vol. 10, no. 1, pp. 11-15, Winter 2018.
- Asghar, M. S., et al., “A Digitally Controlled Analog Kernel for Convolutional Neural Networks,” ISOCC 2021.
- Zhu, J., et al., “Analog Implementation of Reconfigurable Convolutional Neural Network Kernels,” APCCAS 2019.







