Neural Network-Based Multimodal Fusion for Navigation

Authors

  • Xingwei Mao

DOI:

https://doi.org/10.54097/6t7rmv72

Keywords:

Navigation, Fusion, Multimodal.

Abstract

This article is based on deep neural networks and explores how to effectively integrate multiple sensor data to improve navigation accuracy and environmental perception capabilities. With the popularity of sensors such as LiDAR, cameras, and inertial measurement units, various data sources provide rich information, which provides possibilities for multimodal fusion. This paper presented a new multimodal data fusion framework that combines a convolutional neural network (CNN) and a long short-term memory network (LSTM) to process spatial and temporal data features. This effectively enhances the robustness and adaptability of navigation systems in changing environments. Research has shown that multimodal fusion navigation methods based on neural networks not only improve positioning accuracy but also increase response speed by 30% in dynamic environments. In addition, by analyzing the data fusion process of different sensors, we can find that the synergy between sensors can effectively reduce the measurement error and improve the robustness of the system, especially in complex terrain and a changing environment.

Downloads

Download data is not yet available.

References

[1] Liu J, Guo X L. Intelligent connected vehicle environment obstacle perception method based on machine learning. Industrial Control Computer, 2025, 38 (06): 101–102+106.

[2] Huang D, Liu X, Xu G. Research progress of deep learning applications in mass spectrometry imaging data analysis. Chinese Journal of Chromatography, 2024, 42: 669–680.

[3] Gao C, Yang Y, Chen S C, et al. Survey on multimodal model-driven embodied intelligence research. Intelligent Perception Engineering, 2025, 2 (02): 1–12.

[4] Du J, Luo Y L. Robot environment perception and path planning by Velodyne HDL-32E 3D LiDAR. Journal of Machine Design, 2025, 42 (07): 178–184.

[5] Jia X, Li S L, She Y S, et al. Research on environmental perception information unified fusion method of intelligent vehicle based on interactive multiple models. Automotive Engineering, 2025, 47 (6): 1144–1154.

[6] Zheng K, Huang D Y, Li Y P. Research on an efficient multi-modal fusion bird’s-eye view perception algorithm. China Auto, 2025, 35 (06): 341–350.

[7] Zhao X, Wang L, Zhang Y, et al. A review of convolutional neural networks in computer vision. Artificial Intelligence Review, 2024, 57 (4): 43.

[8] Li W, Wei Y, An D, et al. LSTM-TCN: dissolved oxygen prediction in aquaculture, based on combined model of long short-term memory network and temporal convolutional network. Environmental Science and Pollution Research, 2022, 29 (26): 1–29.

[9] Baltrusaitis T, Ahuja C, Morency L P. Multimodal machine learning: a survey and taxonomy. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, PP (99): 1–1.

[10] Xing H T, Guo J L, Liu S A, et al. NOx emission forecasting based on CNN-LSTM hybrid neural network. Electronic Measurement Technology, 2022, 45 (02): 98–103.

Downloads

Published

13-03-2026

Issue

Section

Articles

How to Cite

Mao, X. (2026). Neural Network-Based Multimodal Fusion for Navigation. Academic Journal of Science and Technology, 19(3), 84-90. https://doi.org/10.54097/6t7rmv72