Comparison of Machine Learning Strategies in Hazardous Asteroids Prediction

Authors

  • Yao Wang

DOI:

https://doi.org/10.54097/hset.v39i.6527

Keywords:

Machine Learning; Hazardous Asteroids; Voting Classifier; Asteroids Prediction.

Abstract

The purpose of this study is to use multiple classification algorithm from machine learning to predict hazardous asteroids that orbit Earth. Seven models are Logistic Regression, K-Nearest Neighbor Classifier, Random Forest Classifier, Decision Tree Classifier, Multinomial Naïve Bayes Classifier, Gradient Boosting Classifier, and Voting Classifier. Confusion matrix is used to evaluate those models. Evaluation metrics include accuracy, precision, recall, and f1-score. The result shows that random forest classifier has the greatest overall performance with highest accuracy. Decision Tree classifier, Gradient Boosting classifier, and Voting classifier also perform well. Gradient Boosting classifier is capable of greatly reducing the risk of hazardous asteroid, which is, reduce the number of hazardous asteroids that is predicted as non-hazardous. Because of assumptions of some models like Logistic Regression, data used in the experiment do not follow them, so the overall performance of those models are not well. It would be better to select data for fitting the model. The result shows that combined classifiers perform better. Voting Classifier can be used to assemble those accurate models and get a more accurate result by offsetting disadvantages of each model.

Downloads

Download data is not yet available.

References

Atkinson H, Tickell C, Williams D. Report of the task force on potentially hazardous near Earth objects. 2000.

Hartmann, W. K., The Impact That Wiped Out the Dinosaurs, 09/21/2022, URL: https:// www. psi. edu/ epo/ ktimpact/ktimpact.html.

Petrov N, Sokolov L, Polyakhova E, et al. Predictions of asteroid hazard to the Earth for the 21st century, In: AIP Conference Proceedings. 2018, 1959(1): 040012.

Ranaweera R N, Fernando T. Prediction of Potentially Hazardous Asteroids using Deep Learning, In: 2022 2nd International Conference on Advanced Research in Computing (ICARC). IEEE, 2022: 31-36.

Vani, Sameep, NASA-Nearest Earth Objects, 06/17/2022, URL: https://www. kaggle.com/ datasets/ sameepvani/nasa-nearest-earth-objects.

Park H A. An introduction to logistic regression: from basic concepts to interpretation with particular attention to nursing domain. Journal of Korean Academy of Nursing, 2013, 43(2): 154-164.

Safavian S R, Landgrebe D. A survey of decision tree classifier methodology. IEEE transactions on systems, man, and cybernetics, 1991, 21(3): 660-674.

Xu S, Li Y, Wang Z. Bayesian multinomial Naïve Bayes classifier to text classification, Advanced multimedia and ubiquitous engineering. Springer, Singapore, 2017: 347-352.

Nelson D. Gradient boosting classifiers in python with scikit-learn. Retrieved from Stack Abuse, 2019.

Tandel G S, Tiwari A, Kakde O G. Performance optimisation of deep learning models using majority voting algorithm for brain tumour classification. Computers in Biology and Medicine, 2021, 135: 104564.

Haghighi S, Jasemi M, Hessabi S, et al. PyCM: Multiclass confusion matrix library in Python. Journal of Open-Source Software, 2018, 3(25): 729.

Downloads

Published

01-04-2023

How to Cite

Wang, Y. (2023). Comparison of Machine Learning Strategies in Hazardous Asteroids Prediction. Highlights in Science, Engineering and Technology, 39, 201-208. https://doi.org/10.54097/hset.v39i.6527