Towards high performance low bitwidth training for deep neural networks

Chunyou Su; Sheng Zhou; Liang Feng; Wei Zhang

doi:10.1088/1674-4926/41/2/022404

Journals >Journal of Semiconductors >Volume 41 >Issue 2 >Page 022404 > Article

Journal of Semiconductors
Vol. 41, Issue 2, 022404 (2020)

Towards high performance low bitwidth training for deep neural networks

Chunyou Su¹, Sheng Zhou², Liang Feng¹, and Wei Zhang¹

Author Affiliations

¹Department of Electronics and Computer Engineering, Hong Kong University of Science and Technology, Hong Kong, China

²Department of Computer Science Engineering, Hong Kong University of Science and Technology, Hong Kong, China

show less

DOI: 10.1088/1674-4926/41/2/022404 Cite this Article

Chunyou Su, Sheng Zhou, Liang Feng, Wei Zhang. Towards high performance low bitwidth training for deep neural networks[J]. Journal of Semiconductors, 2020, 41(2): 022404 Copy Citation Text

EndNote(RIS)

BibTex

Plain Text

show less

Fig. 1. NR simulation.

Download full size | View in the Article

Fig. 2. SR simulation.

Download full size | View in the Article

Fig. 3. Execution modules.

Download full size | View in the Article

Fig. 4. Whole design structure.

Download full size | View in the Article

Fig. 5. Module structure example.

Download full size | View in the Article

Fig. 6. Random number generator.

Download full size | View in the Article

Model	8-bit model (SR)	8-bit model (NR)	Acc. Drop
AlexNet	54.34%	52.46%	1.88%
ResNet-18	65.96%	65.72%	0.24%

Table 1. Top-1 accuracy of 8-bit AlexNet and ResNet18, SR versus NR.

View in the Article

Model	Full	8-bit model	Acc. Drop
ResNet-20	92.24%	92.12%	0.12%
ResNet-56	94.14%	93.75%	0.39%

Table 2. Top-1 accuracy on CIFAR-10 dataset.

View in the Article

Model	Full	8-bit model	Acc. Drop
AlexNet(DoReFa^[14])	55.9%	53.0%	2.9%
AlexNet	54.76%	54.34%	0.42%
ResNet-50	75.46%	74.14%	1.32%
Inception V3	76.95%	75.03%	1.92%

Table 3. Top-1 accuracy on ImageNet dataset.

View in the Article

Parameter	BRAM	DSP	FF	LUT
Used	238	610	434213	564233
Percentage	5%	8%	18%	47%

Table 4. Resource usage of FPGA prototyping.

Chunyou Su, Sheng Zhou, Liang Feng, Wei Zhang. Towards high performance low bitwidth training for deep neural networks[J]. Journal of Semiconductors, 2020, 41(2): 022404

Download Citation

Save the article for my favorites

Paper Information