Research on Application of Financial Large Language Models

Authors

  • Mingting Du

DOI:

https://doi.org/10.54097/1z673097

Keywords:

Large Language Models, BloombergGPT, PIXIU, FinBERT, natural language processing.

Abstract

With the increasing use of large language models such as chatgpt, it is not difficult to apply their capabilities to the research of natural language processing in the financial field, including but not limited to text extraction, sentiment analysis, etc. This paper analyzes the construction ideas and applications of three financial big language models, including BloombergGPT, PIXIU and FinBERT, and concludes that the current application of big language models in the financial field is possible, multi-faceted and suitable, but there are still shortcomings in ethics, data processing and other aspects. The application of large language models in the field of finance is still something to look forward to. Through this study and the comparative exploration of various models, we hope to provide valuable modeling experience for practitioners in the field of finance or computer. At the same time, it is hoped that each researcher can follow the ideas of these model-making teams to make up for the shortcomings in their own models and make their own financial big language models better.

Downloads

Download data is not yet available.

References

[1] T. Brown, B. Mann, N. Ryder, et al. Language models are few-shot learners. NeurIPS, 33: 1877 – 1901. (2020).

[2] T. Almutiri, F. Nadeem, Markov models applications in natural language processing: a survey. Int. J. Inf. Technol. Comput. Sci 2, 1 – 16 (2022).

[3] Tom B. Brown et al. Language Models are Few-Shot Learners, arXiv:2005.14165 (2020).

[4] J. Devlin, M. Chang, K. Lee, et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv: 1810.04805 (2019).

[5] T. Scao, A. Fan, C. Akiki, et al. Bloom: A 176b-parameter open-access multilingual language model. arXiv preprint arXiv: 2211.05100. (2022).

[6] S. Wu, O. Irsoy, S. Lu, et al. BloombergGPT: A Large Language Model for Finance, arXiv: 2303.17564v3 (2023).

[7] J. Hoffmann, S. Borgeaud, A. Mensch et al. (2022). An empirical analysis of compute-optimal large language model training. Adv. neural inf. process. syst, 35, 30016 - 30030 (2022).

[8] M. Maia, S. Handschuh, A. e Freitas, et al. financial opinion mining and question answering. The Web Conference 2018, pages 1941 – 1942 (2018).

[9] Q. Xie, W. Han, X. Zhang, et al. PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for Finance, arXiv: 2306.05443v1 (2023).

[10] OpenAI. GPT-4 Technical Report. arXiv: 2303.08774 (2023).

[11] T. Le Scao, A. Fan, C. Akiki, et al. Bloom: A 176b-parameter open-access multilingual language model. arXiv preprint arXiv: 2211.05100 (2022).

[12] S. Zhang, S. Roller, N. Goyal, et al. Opt: Open pre-trained transformer language models. arXiv preprint arXiv: 2205.01068 (2022).

[13] Z. Zhang, H. Zhang, K. Chen, et al. Mengzi: Towards lightweight yet ingenious pre-trained models for chinese. arXiv preprint arXiv: 2110.06696 (2021).

[14] M. E Peters, M. Neumann, M. Iyyer, et al. Deep contextualized word representations. https: //doi.org/10.18653/v1/N18- 1202 arXiv: 1802.05365 (2018).

[15] J. Howard, S. Ruder. Universal Language Model Fine- tuning for Text Classification. arXiv: 1801.06146 http://arxiv.org/abs/ 1801.06146 (2018).

[16] P. Malo, A. Sinha, P. Korhonen, et al. good debt or bad debt: Detecting semantic orientations in economic texts. J. Assoc. Inf. Sci. Technol. 65, 4, 782 – 796 (2014).

[17] D. Tan Araci, FinBERT: Financial Sentiment Analysis with Pre-trained Language Models. arXiv preprint arXiv: 1908.10063 (2019).

Downloads

Published

24-12-2024

How to Cite

Du, M. (2024). Research on Application of Financial Large Language Models. Highlights in Business, Economics and Management, 45, 628-634. https://doi.org/10.54097/1z673097