Patent Value Score Prediction Based on BERT-XGBoost-Stacking with Late Fusion

Authors

  • Wenjuan Li

DOI:

https://doi.org/10.54097/fryfts61

Keywords:

Patent Value Prediction; BERT; Xgboost; Stacking Ensemble; Late Fusion.

Abstract

Predicting the value scores of high-value patents is essential for evaluating technological innovation, managing intellectual property, and promoting industrial development. However, existing methods still face challenges in effectively fusing multimodal data and achieving high prediction accuracy. To address this issue, we propose a late fusion model based on BERT-XGBoost-Stacking to improve the accuracy and robustness of patent value assessment. This approach leverages BERT to extract deep semantic features from patent texts, employs XGBoost to model structured numerical features, and optimizes the fusion strategy through the Stacking framework. Experimental results on patents in China's new energy vehicle sector demonstrate that the proposed method outperforms single-modal models in terms of Mean Squared Error (MSE), Mean Absolute Error (MAE), and Mean Absolute Percentage Error (MAPE), indicating higher prediction accuracy and stability. This study not only enriches quantitative methods for patent valuation but also provides new insights for the application of artificial intelligence in patent analysis, thereby supporting technological innovation and high-quality industrial development.

References

[1] Liu Yan. Review and Future Outlook of Patent Value Assessment Research [J]. Library and Information Service, 2022, 66(15): 127-139.

[2] Yu Xinmiao, Li Wenhong. Review and Outlook of Patent Quality Perception and Evaluation Systems [J]. Studies in Science of Science, 2024, 42(10): 2100-2109.

[3] Li Liming. Literature Review and Future Outlook on Patent Value Research [J]. Journal of Information Science, 2023, 42(02): 166-174.

[4] Hu Zewen, Zhou Xiji. Prediction of High-Value Patents and Analysis of Influencing Factors Based on BP Neural Network and MIV Algorithm [J]. Journal of Information Resource Management, 2023,13(6):144-155.

[5] Sun Xiaoming, Xiong Wang, Yuwen Lewei, et al. Value Assessment of Dormant Patents in Universities and Research Institutions [J/OL]. Science and Technology Progress and Countermeasures, 1-10.

[6] Sun Yaoran, An Yaolu, Li Yaogang. Patent Value Prediction Using Multi-Feature Fusion: A Case Study of 5G Technology [J]. Modern Intelligence, 2022, 42(11): 87-96.

[7] Zhang Yaobiao, Wu Yaohong, Gao Daobin, et al. Research on Identifying Transferable Patents in Universities Based on Feature Fusion [J]. Journal of Intelligence, 2022, 41(9): 159-165.

[8] Fu Yao-jiao, Liang Li-zhi. Early Prediction of Patent Value Integrating Claim Semantic Features [J]. Journal of Information Science, 2024, 43(2): 183-191.

[9] BREIMAN L. Stacked regressions [J]. Machine Learning, 1996, 24(1): 49-64.

[10] Chen Xi, Cheng Ge, Yin Zhibin. A Patent Value Evaluation Method Integrating Heterogeneous Graph Global Structure Information and Time Series [J/OL]. Journal of Intelligence, 2025-03-07.

[11] Feng Guohua, Li Lin, Liu Renhua, Deng Weiwei. Identifying High-Value Patents in Nanomedicine: Integration of Cross-Perspective and Multi-Method Approaches [J/OL]. Journal of Intelligence, 2025-05-15.

[12] VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need [C]// Advances in Neural Information Processing Systems. 2017: 5998-6008.

[13] Li Nian. Research on Transformer Model Improvements for Patent Value Assessment [D]. Nanjing: Nanjing Tech University, 2024.

[14] Friedman J.H. Greedy Function Approximation: A Gradient Boosting Machine [J]. Annals of Statistics, 2001, 29(5): 1189-1232.

Downloads

Published

23-03-2026

Issue

Section

Articles

How to Cite

Li, W. (2026). Patent Value Score Prediction Based on BERT-XGBoost-Stacking with Late Fusion. Mathematical Modeling and Algorithm Application, 8(3), 77-81. https://doi.org/10.54097/fryfts61