Interdisciplinary Perspectives on Intangible Cultural Heritage Digitization: A Review of Vision-Based Human Pose Estimation Research

Authors

  • Zixuan Lu
  • Mingyuan Guo

DOI:

https://doi.org/10.54097/sqjqd585

Keywords:

Intangible cultural heritage(ICH); Human Pose Estimation(HPE)

Abstract

The digitization of intangible cultural heritage (ICH) has emerged as a pivotal direction for cultural preservation and transmission. Among enabling technologies, human pose estimation offers critical support for digitally documenting and disseminating performance-based ICH, such as Tai Chi and traditional dances. This paper systematically reviews the applications, technical challenges, and future trends of computer vision-based human pose estimation in ICH digitization. First, we analyze the strategic significance of ICH digitization and the unique requirements for motion capture in performance-based ICH. Next, we trace the technological evolution from traditional sensor-based methods to deep learning-driven approaches, with a focus on the applicability of core techniques, including pose estimation algorithms and temporal modeling in ICH contexts. Subsequently, we identify persistent bottlenecks, such as occlusion by intricate costumes, low-quality archival footage processing, and cross-cultural semantic understanding of movements. Finally, we prospect future research directions, including lightweight models, immersive virtual-augmented teaching systems, and digital twins, while proposing a synergistic "technology-culture-industry" innovation framework to bolster the dynamic preservation and global dissemination of ICH. By bridging computer vision and cultural heritage studies, this work aims to advance interdisciplinary research and foster sustainable digital safeguarding of traditional culture.

References

[1] Directives, U. N. E. S. C. O. "UNESCO Operational Directives for the Implementation of the Convention for the Safeguarding of the Intangible Heritage." 2022.

[2] Wang, Zhuo Yu, Rong Qun Chen, and Jun Yu Zhao. "Research on the development of China’s cultural tourism Industry during the “14th Five-Year Plan” period." SHS Web of Conferences. Vol. 165. EDP Sciences, 2023.

[3] Wang, Xiaobing. "Preservation and transmission of traditional chinese martial arts in the digital era." Of Papers Presented at 2023 4th Asia Sport Science Conference (ASSC). 2023.

[4] Gao, Yang, et al. "Inheritors’ happiness and its relevant factors in intangible cultural heritage." Sustainability 14.21 (2022): 14084.

[5] Hou, Yumeng, et al. "Digitizing intangible cultural heritage embodied: State of the art." Journal on Computing and Cultural Heritage (JOCCH) 15.3 (2022): 1-20.

[6] Kico, Iris, et al. "Digitization and visualization of folk dances in cultural heritage: A review." Inventions 3.4 (2018): 72.

[7] Lan, Gongjin, et al. "Vision-based human pose estimation via deep learning: A survey." IEEE Transactions on Human-Machine Systems 53.1 (2022): 253-268.

[8] Colyer, Steffi L., et al. "A review of the evolution of vision-based motion analysis and the integration of advanced computer vision methods towards developing a markerless system." Sports medicine-open 4 (2018): 1-15.

[9] Ghafoor, Mehwish, and Arif Mahmood. "Quantification of occlusion handling capability of a 3D human pose estimation framework." IEEE Transactions on Multimedia 25 (2022): 3311-3318.

[10] Zheng, Ce, et al. "Deep learning-based human pose estimation: A survey." ACM Computing Surveys 56.1 (2023): 1-37.

[11] Stacchio, Lorenzo, et al. "Danxe: An extended artificial intelligence framework to analyze and promote dance heritage." Digital Applications in Archaeology and Cultural Heritage 33 (2024): e00343.

[12] Guigon, Emmanuel, et al. "Experimental and theoretical study of velocity fluctuations during slow movements in humans." Journal of neurophysiology 121.2 (2019): 715-727.

[13] Mei, Xiao, and Yang Xiao. "The “ICH Movement” in China." Music, Communities, Sustainability: Developing Policies and Practices (2022): 101.

[14] Qi, Fengmeng, et al. "Effects of taichi on physical and psychological health of college students: A systematic review." Frontiers in Physiology 13 (2022): 1008604.

[15] Rallis, Ioannis, et al. "Choreographic pattern analysis from heterogeneous motion capture systems using dynamic time warping." Technologies 7.3 (2019): 56.

[16] Bostanci, Erkan, Nadia Kanwal, and Adrian F. Clark. "Augmented reality applications for cultural heritage using Kinect." Human-centric Computing and Information Sciences 5 (2015): 1-18.

[17] Afyenni, Rita, et al. "Augmented Reality Applications for Cultural Heritage Using Kinect Sensor." International Journal of Advanced Science Computing and Engineering 6.3 (2024): 127-131.

[18] Goldfarb, Nathaniel, et al. "Open source Vicon Toolkit for motion capture and Gait Analysis." Computer methods and programs in biomedicine 212 (2021): 106414.

[19] Schlagenhauf, Franziska, Siddarth Sreeram, and William Singhose. "Comparison of kinect and vicon motion capture of upper-body joint angle tracking." 2018 IEEE 14th international conference on control and automation (ICCA). IEEE, 2018.

[20] Tölgyessy, Michal, et al. "Evaluation of the azure kinect and its comparison to kinect v1 and kinect v2." Sensors 21.2 (2021): 413.

[21] Qiao, Sen, Yilin Wang, and Jian Li. "Real-time human gesture grading based on OpenPose." 2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI). IEEE, 2017.

[22] Sun, Ke, et al. "Deep high-resolution representation learning for human pose estimation." Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2019.

[23] Yu, Changqian, et al. "Lite-hrnet: A lightweight high-resolution network." Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2021.

[24] Wang, Hao, Dingli Luo, and Takeshi Ikenaga. "Image information assistance neural network for videopose3d-based monocular 3d pose estimation." 2021 17th International Conference on Machine Vision and Applications (MVA). IEEE, 2021.

[25] Zhang, Ping, et al. "Model and algorithm of coordinated flow controlling with station-based constraints in a metro system." Transportation Research Part E: Logistics and Transportation Review 148 (2021): 102274.

[26] Zhou, Kangkang, et al. "Efficient hierarchical multi-view fusion transformer for 3D human pose estimation." Proceedings of the 31st ACM International Conference on Multimedia. 2023.

[27] Har, Austin Oting. "Integrating Live Computer Tools into the Creation, Adaptation, and Performance of Japanese Noh Theatre." Audio Engineering Society Convention 155. Audio Engineering Society, 2023.

[28] Shekhawat, Megha Yashwant. Reflecting changes in performing arts over time: Bharatanatyam as a case study. Diss. 2023.

[29] Hou, Jianfeng, et al. "The implementation of a Beijing Opera interactive display system based on motion recognition." 2021 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA). IEEE, 2021.

[30] Qin, Wei, and Junying Meng. "The research on dance motion quality evaluation based on spatiotemporal convolutional neural networks." Alexandria Engineering Journal 114 (2025): 46-54.

[31] Xia, Feng, Xin Lu, and Xingwei Si. "Taijiquan Auxiliary Training and Scoring Based on Motion Capture Technology and DTW Algorithm." International Journal of Ambient Computing and Intelligence 14.1 (2023): 1-15.

[32] Shen, Zijun, et al. "Application and Exploration of Computer Technology in Digital Protection of Bashu Intangible Cultural Heritage." 2024 5th International Conference on Information Science, Parallel and Distributed Systems (ISPDS). IEEE, 2024.

[33] He, Kaiming, et al. "Mask r-cnn." Proceedings of the IEEE international conference on computer vision. 2017.

[34] Zhang, Lei, et al. "EdgePose: Real-Time Human Pose Estimation Scheme for Industrial Scenes." IEEE Access (2024).

[35] Hewage, Pradeep, et al. "Temporal convolutional neural (TCN) network for an effective weather forecasting using time-series data from the local weather station." Soft Computing 24 (2020): 16453-16482.

[36] Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems 30 (2017).

[37] Ismail-Fawaz, Ali, et al. "Establishing a unified evaluation framework for human motion generation: a comparative analysis of metrics." Computer Vision and Image Understanding 254 (2025): 104337.

[38] Zhang, Xiao, et al. "Metaverse for cultural heritages." Electronics 11.22 (2022): 3730.

Downloads

Published

25-04-2025

Issue

Section

Articles

How to Cite

Lu, Z., & Guo, M. (2025). Interdisciplinary Perspectives on Intangible Cultural Heritage Digitization: A Review of Vision-Based Human Pose Estimation Research. Mathematical Modeling and Algorithm Application, 4(3), 16-21. https://doi.org/10.54097/sqjqd585