Statistical Approaches to Machine Learning in a Big Data Environment
DOI:
https://doi.org/10.54097/ff7j0t18Keywords:
machine learning; statistical methods; big data environment; data analysis.Abstract
This thesis explores statistical approaches to machine learning in a big data environment. Firstly, the connections and differences between machine learning and statistics in a big data environment are introduced, as well as the statistical foundations in machine learning models. Secondly, the application of statistical methods in big data analysis is discussed, including the combination of traditional data analysis and machine learning. Then, the challenges and limitations of statistical methods in the big data environment, such as high dimensionality and huge amount of data, are discussed. Then, common statistical methods in the big data environment, including linear regression, decision trees, and support vector machines, are described in detail. Finally, the research findings are summarised and future directions and research trends are outlined. Through the research in this paper, a deeper understanding of statistical methods for machine learning in big data environment is provided, which provides an important reference for big data analysis and application.
Downloads
References
Mayhew M , Atighetchi M , Adler A ,et al.Use of machine learning in big data analytics for insider threat detection[C]//MILCOM 2015.IEEE, 2015.
Alam M , Amjad M .A precipitation forecasting model using machine learning on big data in clouds environment[J].Mausam: Journal of the Meteorological Department of India, 2021(4):72.
Hullman J , Kapoor S , Nanayakkara P ,et al.The worst of both worlds: A comparative analysis of errors in learning from data in psychology and machine learning[J].arXiv e-prints, 2022.
Iniesta R , Stahl D , Mcguffin P .Machine learning, statistical learning and the future of biological research in psychiatry[J].Psychological Medicine, 2016, 46(12):2455-2465.
Nakhaeizadeh G , Taylor C C .Machine learning and statistics : the interface[J].Journal of the American Statistical Association, 1997, 93(442).
Li-Pang C .Statistical Inference and Machine Learning for Big Data[J].Biometrics, 2023(4):4.
Franke B , Plante J F , Roscher R ,et al.Statistical Inference, Learning and Models in Big Data[J].International Statistical Review, 2016, 84.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







