A Study on Multimodal Large Models for Wireless Channels

Authors

  • Qianyi Qian

DOI:

https://doi.org/10.54097/x7skbe36

Keywords:

Wireless channels, multimodal model, communication technology.

Abstract

The rapid evolution of Fifth-Generation (5G) and Sixth-Generation (6G) communication technologies, coupled with complex and dynamic wireless propagation environments and diverse application scenarios, has imposed increasingly stringent demands on channel modeling. Traditional approaches, which predominantly rely on limited measurements or a restricted set of environmental parameters, often fail to comprehensively capture channel variations caused by the superposition of factors in real world such as diverse building shapes and terrain. This inherent limitation constrains the generalization capability and prediction accuracy of such models. In response, Multi-Modal Large Models (MMLMs) have emerged as a prominent research focus in wireless channel modeling nowadays. By integrating multi-source environmental data—such as visual, radar, and sensor information, MMLMs can more comprehensively characterize propagation environments, uncover correlations between different modalities, and enhance modeling accuracy and generalization as well. Hence, this paper will systematically review the fundamental methodologies and practical challenges of traditional wireless channel modeling. It analyzes the advantages of MMLMs in terms of input information richness, multi-perspective environmental comprehension, spatiotemporal sequence modeling, and generalization in complex scenarios, while also comparing their similarities and differences with conventional approaches. Furthermore, the paper synthesizes recent advancements in the application of MMLMs to channel modeling, evaluates their practical utility and developmental prospects, and discusses key implementation challenges along with potential future research directions.

Downloads

Download data is not yet available.

References

[1] Chen Huang, Ruisi He, Bo Ai, Andreas F. Molisch, Buon Kiong Lau, Katsuyuki Haneda, Bo Liu, Cheng-Xiang Wang, Mi Yang, Claude Oestges, Zhangdui Zhong. IEEE Transactions on Antennas and Propagation, Artificial Intelligence Enabled Radio Propagation for Communications—Part II: Scenario Identification and Channel Modeling, 2022. DOI: https://doi.org/10.1109/TAP.2022.3149665

[2] Ethan Zhu, Haijian Sun, Mingyue Ji. Physics-Informed Generalizable Wireless Channel Modeling with Segmentation and Deep Learning Fundamentals, Methodologies, and Challenges, 2024. DOI: https://doi.org/10.1109/MWC.015.2300603

[3] Mi Yang, Ruisi He, Bo Ai, Chen Huang, Chenlong Wang, Yuxin Zhang, Zhangdui Zhong. IEEE Communications Magazine. AI-Enabled Data-Driven Channel Modeling for Future Communications, 2024. DOI: https://doi.org/10.1109/MCOM.019.2300072

[4] Jun Jiang, Wenjun Yu, Yunfan Li, Yuan Gao, Shugong Xu. ICMLCN. MIMO Wireless Channel Foundation Model via CIR-CSI Consistency, 2025. DOI: https://doi.org/10.1109/ICMLCN64995.2025.11140262

[5] Kai Wang, Li Yu, Jianhua Zhang, Yixuan Tian, Eryu Guo, Guangyi Liu. China Mobile Research Institution. Multi-Modal Environmental Sensing Based Path Loss Prediction for V2I Communications, 2024. DOI: https://doi.org/10.1109/VTC2025-Spring65109.2025.11174652

[6] Ethan Zhu, Haijian Sun, and Mingyue Ji. arXiv: 2401.01288v1 [cs.IT] 2 Jan 2024. Physics-Informed_Generalizable_Wireless_Channel_Modeling_with_Segmentation_and_Deep_Learning_Fundamentals_Methodologies_and_Challenges, 2024.

[7] Can Zheng, Jiguang He, Guofa Cai, Zitong Yu, Chung G. Kang. arXiv: 2503.10432v2 [cs.LG] 27 Jun 2025. BeamLLM: Vision-Empowered mmWave Beam Prediction with Large Language Models, 2025. DOI: https://doi.org/10.1109/VTC2025-Fall65116.2025.11309918

[8] Tribhuvanesh Orekondy, Arash Behboodi, Joseph B. Soriaga. IEEE International Conference on Communications. MIMO-GAN: Generative MIMO Channel Modeling, 2022. DOI: https://doi.org/10.1109/ICC45855.2022.9839123

[9] Xiongwen Zhao, Fei Du, Suiyan Geng, Zihao Fu, Zhongyu Wang, Yu Zhang, Zhenyu Zhou, Lei Zhang, Liuqing Yang. IEEE Journal on Selected Areas in Communications. Playback of 5G and Beyond Measured MIMO Channels by an ANN-Based Modeling and Simulation Framework, 2020. DOI: https://doi.org/10.1109/JSAC.2020.3000827

[10] Chenlong Wang, Bo Ai, Ruisi He, Mi Yang, Shun Zhou, Long Yu, Yuxin Zhang, Zhicheng Qiu, Zhangdui Zhong, Jianhua Fan. IEEE Transactions on Machine Learning in Communications and Networking. Channel Path Loss Prediction Using Satellite Images: A Deep Learning Approach, 2024. DOI: https://doi.org/10.1109/TMLCN.2024.3454019

[11] Sreelal M. Mana, Kerolos G. K. Gabra, Sepideh M. Kouhini, Malte Hinrichs, Dominic Schulz, Peter Hellwig, Anagnostis Paraskevopoulos, Ronald Freund, Volker Jungnickel. IEEE Access. LIDAR-Assisted Channel Modelling for LiFi in Realistic Indoor Scenarios, 2022.

[12] Hasan F. Ates, Syed Muhammad Hashir, Tonçer Baykas, Bahadır K. Gunturk. IEEE Access. Path Loss Exponent and Shadowing Factor Prediction from Satellite Images Using Deep Learning, 2019. DOI: https://doi.org/10.1109/ACCESS.2019.2931072

[13] Omar Ahmadien, Hasan F. Ates, Tuncer Baykas, and Bahadır K. Gunturk. IEEE Access. Predicting Path Loss Distribution of an Area from Satellite Images Using Deep Learning, 2020. DOI: https://doi.org/10.1109/ACCESS.2020.2985929

[14] Shudong Zhou, Yu Liu, Rui Wang, Ziheng Li, Zhichao Xin, Jie Huang, and Ji Bian. IEEE Internet of Things Journal. A Multimodal Predictive Channel Model Based on Dual-Camera Images for IIoT Communications, 2025. DOI: https://doi.org/10.1109/JIOT.2025.3545469

Downloads

Published

29-01-2026

Issue

Section

Articles

How to Cite

Qian, Q. (2026). A Study on Multimodal Large Models for Wireless Channels. Academic Journal of Science and Technology, 19(2), 351-363. https://doi.org/10.54097/x7skbe36