Data Analysis of the Wordle Game: Insights and Predictive Models Based on Twitter Data
DOI:
https://doi.org/10.54097/hset.v68i.11969Keywords:
XGBoost Model, ARIMA, Prediction Model, Big Data.Abstract
In 2022, the game "Wordle" gained immense popularity worldwide as players faced the challenge of guessing a five-letter word within six attempts, accompanied by feedback. This paper presents an extensive analysis of "Wordle" based on data mined from Twitter during the period from January 7 to December 31, 2022. The primary objective is to explore the game's dynamics and player engagement comprehensively.To achieve this, a sophisticated time-series model was developed to effectively track the fluctuation in the number of players. The model highlights an initial upward trend, reaching its peak on February 3, followed by a gradual decline and eventual stabilization, reflecting the sustained allure of the game. Additionally, this paper leveraged an XGBoost model to predict the distribution of player attempts, exhibiting remarkable accuracy, particularly for attempts ranging from three to six. This research demonstrates the powerful impact of data science in decoding intricate game dynamics and player behavior. Moreover, it emphasizes the fusion of gaming, data analytics, and social media as an exciting frontier for future research. The study's findings provide valuable insights into the gaming community's preferences and the underlying mechanisms that drive user engagement in digital gaming platforms.
Downloads
References
Kiarie J, Mwalili S, Mbogo R. Forecasting the spread of the COVID-19 pandemic in Kenya using SEIR and ARIMA models[J]. 2022, 7(2):10.
Agustín Maravall.A CLASS OF DIAGNOSTICS IN THE ARIMA-MODEL-BASED DECOMPOSITION OF A TIME SERIES[J].2022.
Mielke S J, Alyafeai Z, Salesky E, et al. Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP[J]. 2021.
Fernandez-Herraiz C, Sara Esclapés, Prado-Dominguez A J. Tokens and Tokenization: Still a Gordian Knot for the Future of FinTech? [M]. 2020.
Security S. Tokenization vs encryption: RSA touts tokens to reduce PCI DSS pain[J]. [2023-07-31].
Carlos Fernández-Herraiz, Sara Esclapés-Membrives Antonio,Javier Prado-Domínguez.Tokens and Tokenization Still a Gordian Knot for the Future of Finance[J].2019.
Chiarcos C, Stede R M. Linguistic Annotation || By all these lovely tokens. Merging conflicting tokenizations[J]. Language Resources & Evaluation, 2012, 46(1):53-74.
Cambridge U C, Cohen P C. Applied Multiple Regression/ Correlation Analysis for The Behavioral Sciences[J]. Journal of the Royal Statistical Society Series D (The Statistician), 2003, 52(4). Chen T, He T, Benesty M. xgboost: Extreme Gradient Boosting[J].2016.
Dutta R, Chen C, Renshaw D, et al.XGBoost automates the characterisation of reversibly actuating planar-flow-casted NiTi shape memory alloy foil[J].2021.
Grislain N, Gonzalvez J. DP-XGBoost: Private Machine Learning at Scale[J].2021.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







