Scalable Multi-View Stereo Camera Array for Real-Time Image Capture and 3D Display in Real-World Applications
DOI:
https://doi.org/10.54097/cyjs1142Keywords:
Multi-view stereo camera array, Real-time image capture, AI-based object tracking, Image calibration, Geometric correction.Abstract
3D display technology has advanced, finding applications in entertainment, healthcare, and education. However, existing multi-view content capture devices are limited by their reliance on single-camera setups or synthetic animations, constraining their flexibility and application range. This study proposes a scalable multi-view stereo camera array for real-time image capture and 3D display. The system uses 16 CMOS cameras, each with a resolution of 1920x1080 pixels, to synchronously capture multi-view images at 30 frames per second. Innovations include improved image calibration and geometric correction algorithms, completing each set of image calibration within 0.5 seconds with geometric correction accuracy of 0.1 pixels. The system also incorporates AI-based object tracking, capable of tracking targets moving at speeds up to 5 meters per second with 90% accuracy, and high-speed data transmission to ensure real-time image transfer with latency below 1 second. AI algorithms enhance performance in image calibration and object tracking. Machine learning techniques improve geometric correction accuracy and efficiency, while deep learning models ensure robust tracking in dynamic scenes. This system overcomes limitations of traditional single-camera setups and synthetic animations, offering improved capture efficiency and higher quality 3D images. It shows potential in multi-view facial recognition, stereo surgical training, and drone stereo monitoring. Future research will optimize image calibration and geometric correction algorithms, enhance object tracking stability, and explore additional application scenarios to improve system practicality and reliability.
References
Yoon, J. S., Ceylan, D., Wang, T. Y., Lu, J., Yang, J., Shu, Z., & Park, H. S. (2022). Learning motion-dependent appearance for high-fidelity rendering of dynamic humans from a single camera. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 3407-3417).
Yang, Y., Guo, Z., Gellman, A. J., & Kitchin, J. R. (2022). Simulating segregation in a ternary Cu–Pd–Au alloy with density functional theory, machine learning, and Monte Carlo simulations. The Journal of Physical Chemistry C, 126(4), 1800-1808.
Gao, H., Li, R., Tulsiani, S., Russell, B., & Kanazawa, A. (2022). Monocular dynamic view synthesis: A reality check. Advances in Neural Information Processing Systems, 35, 33768-33780.
Xu, T. (2024). Comparative Analysis of Machine Learning Algorithms for Consumer Credit Risk Assessment. Transactions on Computer Science and Intelligent Systems Research, 4, 60-67.
He, F., & Habib, A. (2018). Three-point-based solution for automated motion parameter estimation of a multi-camera indoor mapping system with planar motion constraint. ISPRS Journal of Photogrammetry and Remote Sensing, 142, 278-291.
Yao, Y. (2022). A Review of the Comprehensive Application of Big Data, Artificial Intelligence, and Internet of Things Technologies in Smart Cities. Journal of Computational Methods in Engineering Applications, 1-10.
Xu, X., Li, K., Xu, C., & He, S. (2020, April). GDFace: Gated deformation for multi-view face image synthesis. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 34, No. 07, pp. 12532-12540).
Xia, Y., Liu, S., Yu, Q., Deng, L., Zhang, Y., Su, H., & Zheng, K. (2023). Parameterized Decision-making with Multi-modal Perception for Autonomous Driving. arXiv preprint arXiv:2312.11935.
Harfouche, M., Kim, K., Zhou, K. C., Konda, P. C., Sharma, S., Thomson, E. E., ... & Horstmeyer, R. (2023). Imaging across multiple spatial scales with the multi-camera array microscope. Optica, 10(4), 471-480.
Zhang, Y., Yang, K., Wang, Y., Yang, P., & Liu, X. (2023, July). Speculative ECC and LCIM Enabled NUMA Device Core. In 2023 3rd International Symposium on Computer Technology and Information Science (ISCTIS) (pp. 624-631). IEEE.
Machicoane, N., Aliseda, A., Volk, R., & Bourgoin, M. (2019). A simplified and versatile calibration method for multi-camera optical systems in 3D particle imaging. Review of Scientific Instruments, 90(3).
Lin, Y. Discussion on the Development of Artificial Intelligence by Computer Information Technology.
Lin, Y. (2023). Optimization and Use of Cloud Computing in Big Data Science. Computing, Performance and Communication Systems, 7(1), 119-124.
Lin, Y. (2023). Construction of Computer Network Security System in the Era of Big Data. Advances in Computer and Communication, 4(3).
Ullah, H., Zia, O., Kim, J. H., Han, K., & Lee, J. W. (2020). Automatic 360 mono-stereo panorama generation using a cost-effective multi-camera system. Sensors, 20(11), 3097.
Qiu, L., & Liu, M. (2024). Innovative Design of Cultural Souvenirs Based on Deep Learning and CAD.
Stathopoulou, E. K., & Remondino, F. (2023). A survey on conventional and learning‐based methods for multi‐view stereo. The Photogrammetric Record, 38(183), 374-407.
Yao, Y. (2024). Digital Government Information Platform Construction: Technology, Challenges and Prospects. International Journal of Social Sciences and Public Administration, 2(3), 48-56.
Yao, Y. (2024). Application of Artificial Intelligence in Smart Cities: Current Status, Challenges and Future Trends. International Journal of Computer Science and Information Technology, 2(2), 324-333.
Bortolon, M., Bazzanella, L., & Poiesi, F. (2021). Multi-view data capture for dynamic object reconstruction using handheld augmented reality mobiles. Journal of Real-Time Image Processing, 18(2), 345-355.
Xu, T. (2024). Credit Risk Assessment Using a Combined Approach of Supervised and Unsupervised Learning. Journal of Computational Methods in Engineering Applications, 1-12.
Olagoke, A. S., Ibrahim, H., & Teoh, S. S. (2020). Literature survey on multi-camera system and its application. IEEE Access, 8, 172892-172922.
Liu, M., & Li, Y. (2023, October). Numerical analysis and calculation of urban landscape spatial pattern. In 2nd International Conference on Intelligent Design and Innovative Technology (ICIDIT 2023) (pp. 113-119). Atlantis Press.
Christodoulou, L. (2013). Overview: 3D stereo vision camera-sensors-systems, advancements, and technologies. 3D Stereo Vision Camera-sensors, Advancements, and Technologies, 73.
Lin, Y. (2024). Application and Challenges of Computer Networks in Distance Education. Computing, Performance and Communication Systems, 8(1), 17-24.
Lin, Y. (2024). Design of urban road fault detection system based on artificial neural network and deep learning. Frontiers in neuroscience, 18, 1369832.
Nocerino, E., Dubbini, M., Menna, F., Remondino, F., Gattelli, M., & Covi, D. (2017). Geometric calibration and radiometric correction of the maia multispectral camera. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 42, 149-156.
Yang, Y., Guo, Z., Gellman, A. J., & Kitchin, J. (2022, November). Modeling Ternary Alloy Segregation with Density Functional Theory and Machine Learning. In 2022 AIChE Annual Meeting. AIChE.
Yang, Y., Liu, M., & Kitchin, J. R. (2022). Neural network embeddings based similarity search method for atomistic systems. Digital Discovery, 1(5), 636-644.
Yang, Y., Achar, S. K., & Kitchin, J. R. (2022). Evaluation of the degree of rate control via automatic differentiation. AIChE Journal, 68(6), e17653.
Rathore, M. M., Paul, A., Ahmad, A., Chilamkurti, N., Hong, W. H., & Seo, H. (2018). Real-time secure communication for Smart City in high-speed Big Data environment. Future Generation Computer Systems, 83, 638-652.
Yang, J. (2024). Data-Driven Investment Strategies in International Real Estate Markets: A Predictive Analytics Approach. International Journal of Computer Science and Information Technology, 3(1), 247-258.
Yang, J. (2024). Comparative Analysis of the Impact of Advanced Information Technologies on the International Real Estate Market. Transactions on Economics, Business and Management Research, 7, 102-108.
Yang, J. (2024). Application of Business Information Management in Cross-border Real Estate Project Management. International Journal of Social Sciences and Public Administration, 3(2), 204-213.
Wang, C., Yang, H., Chen, Y., Sun, L., Wang, H., & Zhou, Y. (2012). Identification of Image-spam Based on Perimetric Complexity Analysis and SIFT Image Matching Algorithm. JOURNAL OF INFORMATION &COMPUTATIONAL SCIENCE, 9(4), 1073-1081.
Amosa, T. I., Sebastian, P., Izhar, L. I., Ibrahim, O., Ayinla, L. S., Bahashwan, A. A., ... & Samaila, Y. A. (2023). Multi-camera multi-object tracking: a review of current trends and future advances. Neurocomputing, 552, 126558.
Wang, C., Yang, H., Chen, Y., Sun, L., Zhou, Y., & Wang, H. (2010). Identification of Image-spam Based on SIFT Image Matching Algorithm. JOURNAL OF INFORMATION &COMPUTATIONAL SCIENCE, 7(14), 3153-3160.
Yang, Y., Jiménez-Negrón, O. A., & Kitchin, J. R. (2021). Machine-learning accelerated geometry optimization in molecular simulation. The Journal of Chemical Physics, 154(23).
Kim, D., Comandur, B., Medeiros, H., Elfiky, N. M., & Kak, A. C. (2017). Multi-view face recognition from single RGBD models of the faces. Computer Vision and Image Understanding, 160, 114-132.
Tu, H., Shi, Y., & Xu, M. (2023, May). Integrating conditional shape embedding with generative adversarial network-to assess raster format architectural sketch. In 2023 Annual Modeling and Simulation Conference (ANNSIM) (pp. 560-571). IEEE.
Jiang, X., Shokri-Ghadikolaei, H., Fodor, G., Modiano, E., Pang, Z., Zorzi, M., & Fischione, C. (2018). Low-latency networking: Where latency lurks and how to tame it. Proceedings of the IEEE, 107(2), 280-306.
Shi, Y., Ma, C., Wang, C., Wu, T., & Jiang, X. (2024, May). Harmonizing Emotions: An AI-Driven Sound Therapy System Design for Enhancing Mental Health of Older Adults. In International Conference on Human-Computer Interaction (pp. 439-455). Cham: Springer Nature Switzerland.
Lopez, C. D., Boddapati, V., Lee, N. J., Dyrszka, M. D., Sardar, Z. M., Lehman, R. A., & Lenke, L. G. (2021). Three-dimensional printing for preoperative planning and pedicle screw placement in adult spinal deformity: a systematic review. Global Spine Journal, 11(6), 936-949.
Soana, V., Shi, Y., & Lin, T. A Mobile, Shape-Changing Architectural System: Robotically-Actuated Bending-Active Tensile Hybrid Modules.
Zhong, Y., Liu, Y., Gao, E., Wei, C., Wang, Z., & Yan, C. (2024). Deep Learning Solutions for Pneumonia Detection: Performance Comparison of Custom and Transfer Learning Models. medRxiv, 2024-06.
Lian, J., & Chen, T. (2024). Research on Complex Data Mining Analysis and Pattern Recognition Based on Deep Learning. Journal of Computing and Electronic Information Management, 12(3), 37-41.
Chen, T., Lian, J., & Sun, B. (2024). An Exploration of the Development of Computerized Data Mining Techniques and Their Application. International Journal of Computer Science and Information Technology, 3(1), 206-212.
An, L., Song, C., Zhang, Q., & Wei, X. (2024). Methods for assessing spillover effects between concurrent green initiatives. MethodsX, 12, 102672.
Shih, H. C., Wei, X., An, L., Weeks, J., & Stow, D. (2024). Urban and Rural BMI Trajectories in Southeastern Ghana: A Space-Time Modeling Perspective on Spatial Autocorrelation. International Journal of Geospatial and Environmental Research, 11(1), 3.
Downloads
Published
Issue
Section
License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.