Key Technologies, Challenges and Future Prospects of Neural Network in Automatic Driving System
DOI:
https://doi.org/10.54097/4b5yhf59Keywords:
Automatic driving; Deep learning; Convolutional neural network; Transformer; End to end learning.Abstract
Automatic driving technology is a disruptive force to reshape the future travel and transportation system. Its development has undergone a fundamental transformation from a rule-based modular approach to a data-driven deep learning paradigm. This paper systematically summarizes the key role, frontier progress and core challenges of neural network as the core technology engine in the whole technology stack of automatic driving perception, decision-making, planning and control. Firstly, this paper summarizes the application foundation of convolutional neural network, cyclic neural network, transformer and graph neural network in the field of automatic driving; Then the representative work and implementation path in the core tasks of multi-sensor fusion 3D target detection, interactive behavior prediction, end-to-end driving are analyzed in detail. The analysis shows that although the neural network has greatly improved the performance ceiling of the system, its inherent black box characteristics, vulnerability to adversarial attacks, difficult to deal with the "long tail problem" and high computing costs are still the fundamental obstacles restricting its safe landing. Finally, this paper looks forward to the future research directions of multimodal large-scale model, causal reasoning, neural radiation field simulation and vehicle road coordination, aiming to provide a comprehensive and profound perspective of technology development for researchers in the field.
Downloads
References
[1] Tian Y, Pei K, Jana S, et al. DeepTest: Automated Testing of Deep-Neural-Network-driven Autonomous Cars. Proceedings of the 27th ACM SIGSOFT International Symposium on Software Testing and Analysis, 2018.
[2] Bojarski M, et al. End to end learning for self-driving cars. arXiv preprint arXiv:1604.07316, 2016.
[3] Chen X, Ma H, Wan J, Li B, & Xia T. Multi-view 3d object detection network for autonomous driving. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition (pp. 1907-1915), 2017.
[4] Redmon J, Divvala S, Girshick R, & Farhadi A. You Only Look Once: Unified, Real-Time Object Detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2016.
[5] Lang A H, Vora S, Caesar H, Zhou L, Yang J, & Beijbom O. PointPillars: Fast Encoders for Object Detection from Point Clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.
[6] Long J, Shelhamer E, & Darrell T. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2015.
[7] Prakash A, Chitta K, & Geiger A. Multi-Modal Fusion Transformer for End-to-End Autonomous Driving. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021.
[8] Gao J, Sun C, Zhao H, Shen Y, Anguelov D, Li C, & Schmid C. VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.
[9] Szegedy C, Zaremba W, Sutskever I, Bruna J, Erhan D, Goodfellow I, & Fergus R. Intriguing properties of neural networks. arXiv preprint arXiv:1312.6199, 2013.
[10] Wang Z, et al. DriveGPT: Towards Scalable Autonomous Driving with Large-Scale Video Pre-Training. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Frontiers in Computing and Intelligent Systems

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

