Research on Hardware Acceleration Optimisation Strategies for Deep Learning in Computer Vision

Yifan Gao

doi:10.54097/cnqx4b90

Authors

Yifan Gao

DOI:

https://doi.org/10.54097/cnqx4b90

Keywords:

Deep neural network, Computer vision, Hardware accelerator.

Abstract

As deep neural network (DNN) models get larger and more complicated, the importance of hardware acceleration becomes more and more apparent. This paper discusses various hardware acceleration strategies for deep learning, especially in the area of computer vision. It explores the use of GPUs, FPGAs, and ASICs, detailing their respective strengths and weaknesses in accelerating DNNs. This paper argues that the future of DNN hardware acceleration lies in hybrid approaches that combine the advantages of different architectures. Software advances such as improved compilers and synthesis tools will also play a critical role in making these techniques more accessible. By utilizing the appropriate hardware technology for a given task and continuing to innovate in both hardware and software, computer vision will make significant advances in performance, efficiency, and scalability. This hybrid approach is key to the future of DNN hardware acceleration, offering a path to overcome the limitations of any single type of hardware.

Downloads

Download data is not yet available.

References

Bianco, S.; Cadene, R.; Celona, L.; Napoletano, T. Benchmark Analysis of Representative Deep Neural Network Architectures. IEEE Access. 2018, 6, 64270 – 67277.

Ridnik T, Lawen H, Noy A, et al. Tresnet: High performance gpu-dedicated architecture [C]//proceedings of the IEEE/CVF winter conference on applications of computer vision. 2021: 1400 - 1409.

Feng X, Jiang Y, Yang X, et al. Computer vision algorithms and hardware implementations: A survey[J]. Integration, 2019, 69: 309 - 320.

Magnus Halvorsen, Hardware Acceleration of Convolutional Neural Networks, MS thesis, Norwegian University of Science Technology, 2015.

Jouppi N, Kurian G, Li S, et al. Tpu v4: An optically reconfigurable supercomputer for machine learning with hardware support for embeddings[C]//Proceedings of the 50th Annual International Symposium on Computer Architecture. 2023: 1 - 14.

Eriko Nurvitadhi, et al., Can FPGAs beat GPUs in accelerating next-generation deep neural networks? in: International Symposium on Field-Programmable Gate Arrays, 2017.

Wu X, Ma Y, Wang M, et al. A flexible and efficient FPGA accelerator for various large-scale and lightweight CNNs [J]. IEEE Transactions on Circuits and Systems I: Regular Papers, 2021, 69 (3): 1185 - 1198.

Redmon J, Farhadi A. Yolov3: An incremental improvement [J]. arXiv preprint arXiv: 1804.02767, 2018.

Joe Osborne, Google's Tensor Processing Unit Explained: This Is what the Future Computing Looks like, TechRadar, 2016.

Norm Jouppi, Google Supercharges Machine Learning Tasks with TPU Custom Chip, Google Could, 2017.

Naveen Rao, Intel Nervana Neural Network Processor (NNP) Redefine AI Silicon, Intel Website, 2017.

Research on Hardware Acceleration Optimisation Strategies for Deep Learning in Computer Vision

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

License

How to Cite

Indexing

Latest publications