The Application of Data Analysis in A Web-Based Word Game: Worldle

Authors

  • Zien Zhu

DOI:

https://doi.org/10.54097/hset.v70i.12175

Keywords:

EEMD-LSTM Model, Moving Average, GA-BP Neural Network, RSR, Decision Tree.

Abstract

We will analyze the game Worldle based on its data in 2022. First, we use the EEMD Model to analyze the trend of the number of players over time in this year.We need to forecast the number of players in the next 60 days and reduce the prediction error, we use the Moving Average and build Rolling Window to reduce the forecast error. We use both LSTM model and EEMD-LSTM for training and comparing the effects of the two and find EEMD-based LSTM model is better. Then, we come up with several properties of the solution words that we think would affect the difficulty of the puzzle, and after Correlation Analysis, we find that this is indeed the case and give some explanations. Second, we mainly use Deep Learning. To predict the associated percentages of (1, 2, 3, 4, 5, 6, X) for a given future solution word on a future date, we use a Genetic Algorithm and Back Propagation Neural Network (GA-BP Neural Network) to analyze the percentage distribution of given words. After that, the distribution of the number of answers for the example can be obtained by prediction as (1.1831,6.2839,21.3307,28.6500,24.7627,14.7354,3.8857). Third, how difficult is it to guess a word correctly? For this problem, we use a method belonging to Machine Learning - PSO Decision Trees. Before this, our group first used RSR and the Pearson Correlation Coefficient to divide the difficulty level of words.

Downloads

Download data is not yet available.

References

Lei Y, He Z, Zi Y. Application of the EEMD method to rotor fault diagnosis of rotating machinery [J]. Mechanical Systems and Signal Processing, 2009, 23(4): 1327-1338.

Myles A J, Feudale R N, Liu Y, et al. An introduction to decision tree modeling [J]. Journal of Chemometrics: A Journal of the Chemometrics Society, 2004, 18(6): 275-285.

Yu Y, Si X, Hu C, et al. A review of recurrent neural networks: LSTM cells and network architectures [J]. Neural computation, 2019, 31(7): 1235-1270.

Cohen I, Huang Y, Chen J, et al. Pearson correlation coefficient [J]. Noise reduction in speech processing, 2009: 1-4.

Cao W,Fan R. Performance evaluation analysis of city business firms based on rank-sum ratio comprehensive evaluation method [J]. Academic Journal of Business & Management,2023,5(11).

Hui C,Yulin W,Chuanwang S, et al. Prediction of Surface Subsidence Based on PSO-BP Neural Network [J]. Journal of Physics: Conference Series,2022,2400(1).

Downloads

Published

15-11-2023

How to Cite

Zhu, Z. (2023). The Application of Data Analysis in A Web-Based Word Game: Worldle. Highlights in Science, Engineering and Technology, 70, 151-156. https://doi.org/10.54097/hset.v70i.12175