Using Abalone’s Physical Features to Predict its Age
DOI:
https://doi.org/10.54097/hset.v47i.8171Keywords:
Abalone, Linear Regression, Polynomial Regression.Abstract
Abalone is one of the most delicious and highly-prized seafood around the world, its deliciousness also makes the abalone industry a non-negligible part of the global economic circle, a lot of people and countries rely on abalone for their lives and economy. Therefore, it means a lot for us to study about abalone and it’s population. However, as a necessary step when studying about abalone, getting the age of abalone is a very complicated and time-consuming task. That’s why we need a model to help us predict the age of abalone according to it’s physical measurements which are easy to acquire. The project considered three models, linear regression model, polynomial regression model and Random Forest. 10-fold cross validation is used to compute the mean square error, multiple R square is also considered when evaluating the models. In the results, the polynomial model is the best model among three models, with lowest mean square error and largest R-square. The research provides us with a model to get the age of abalone in an easy and convenient way, which makes the study about abalone more convenient and thus be beneficial for the development of abalone industry.
Downloads
References
Warwick J. Nash, Tracy L. Sellers, Simon R. Talbot, Andrew J. Cawthorn and Wes B. Ford (1994) The Population Biology of Abalone (Haliotis species) in Tasmania.I. Blacklip Abalone (H. rubra) from the North Coast and the Islands of Bass Strait. Sea Fisheries Division. Marine Research Laboratories - Taroona, Department of Primary Industry and Fisheries, Tasmania
Cook, P. (2014) The Worldwide Abalone Industry. Modern Economy, 5, 1181-1186. doi: 10.4236/me.2014.513110.
Caihuan Ke. (2013). Current situation and prospect of abalone aquaculture industry in China. Chinese Fisheries (1), 4.
Taohua Liu, & Muzi Hou. (2016). Discriminant analysis and cluster analysis in the abalone age classification. Journal of Shaoyang University: Natural Science Edition, 13 (1), 5.
Hang XING. (2007). Establishment and reliability test of multiple linear regression model. Journal of Vocational University (4), 3.
Linghui Fu, & Wang Huiwen. (2004). A comparative study of modeling methods for polynomial regression. Journal of Mathematical Statistics and Management, 23(1), 5.
Xinhai LI. (2013). Application of random forest model in classification and regression analysis. Chinese Journal of Applied Entomology, 50(4), 8.
Miles, J. (2005). R-Squared, Adjusted R-Squared. In Encyclopedia of Statistics in Behavioral Science (eds B.S. Everitt and D.C. Howell). https://doi.org/10.1002/0470013192.bsa526
David M. Allen (1971) Mean Square Error of Prediction as a Criterion for Selecting Variables, Technometrics, 13:3, 469-475, DOI: 10.1080/00401706.1971.10488811
Berrar, D. (2019). Cross-Validation.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







