AN ENSEMBLE APPROACH FOR COFFEE CROP YIELD PREDICTION BASED ON AGRONOMIC FACTORS
Keywords:Machine Learning(ML), Ensemble, Coffee, Yield Prediction, Agronomic factors
AbstractCoffee is the most burned-through handled drink beside water, which is said to be the most exchanged cultivating product followed by oil in the entire globe. The two most significant sorts of coffee assortment filled in India are Arabica and Robusta out of 103 assortments of class coffee bean variety, which are economically exchanged around the planet. In this regard, we are taking major plantation crop in India i.e., Coffee for our research to explore and develop a predictive model for the development of coffee planters to take precise decisions in time during adverse situations in advance. Hence we propose a framework for coffee yield prediction which using machine learning ensemble approach to estimate the influence of agronomic factors to get a good coffee yield. Here, for our research work, the historic dataset is considered which is obtained from Central Coffee Research Institute (CCRI), Karnataka for the year (2008-2019). For the coffee yield prediction, we are considering agronomic factors like Age, Soil Nutrients: Organic carbon (OC), Phosphorus (P), Potassium (K), Alkaline (pH), Zone and Respective yield obtained in chikkamagaluru region, Karnataka state, India. Different classifiers are used namely, Extra Tree Classifier, Random Forest Classifier, Decision Tree and Boosting Algorithms for prediction and performance of each is compared and analyzed. Our results shown that Extra Tree Classifier and Random forest (RF) classifier with a precision of 91% with good results based on performance metrics considered respectively is an effective and versatile machine-learning method compared to other algorithms used.
Bornemisza, E. 1982. Nitrogen cycling in coffee plantations/Ciclo de nitrógeno en plantaciones de café. Plant and Soil,67: 241-246.
Raghuramulu, Y.,Sreenath,H.L. 2016-2017 Coffee board research department – 70th annual report , Central coffee research institute, coffee research station, Government of India.
Jaramillo, J., Chabi-Olaye, A., Kamonjo, C., Jaramillo, A., Vega, F. E., Poehling, H. M., & Borgemeister, C. 2009. Thermal tolerance of the coffee berry borer Hypothenemus hampei: predictions of climate change impact on a tropical insect pest. PloS one, 4(8): e6487.
Raghuramulu, Y.,Sreenath,H.L. 2014 Coffee guide book-a manual of coffee Cultivation,, Central coffee research Institute (ministry of commerce and Industry, Govt. of india).
Illy, A., & Viani, R. (Eds.). 2005. Espresso coffee: the science of quality. Academic Press.
Russell, S. J. 2010. Artificial intelligence a modern approach. Pearson Education, Inc.
Alpaydin, E. 2020. Introduction to machine learning. MIT press.
Lakshmanan, V., Gilleland, E., McGovern, A., & Tingley, M. 2015. Machine learning and data mining approaches to climate science. In Proceedings of the 4th International Workshop on Climate Informatics. 3-246. Basel, Switzerland: Springer International Publishing.
Inza, I., Calvo, B., Armananzas, R., Bengoetxea, E., Larranaga, P., & Lozano, J. A. 2009. Machine learning: an indispensable tool in bioinformatics. In Bioinformatics methods in clinical research . 25-48. Totowa, NJ: Humana Press.
Gillison, A. N., Liswanti, N., Budidarsono, S., Van Noordwijk, M., & Tomich, T. P. 2004. Impact of cropping methods on biodiversity in coffee agroecosystems in Sumatra, Indonesia. Ecology and Society, 9(2).
Godsteven P. Maro, Jerome P. Mrema, Balthazar M. Msanya, Bert H. Janssen and James M. Teri. 2014. Developing a Coffee Yield Prediction and Integrated Soil Fertility Management Recommendation Model for Northern Tanzania. International Journal of Plant and Soil Science 3(4): 380-396. Article no. IJPSS.005.
Kouadio, L., Deo, R. C., Byrareddy, V., Adamowski, J. F., & Mushtaq, S. 2018. Artificial intelligence approach for the prediction of Robusta coffee yield using soil fertility properties. Computers and electronics in agriculture, 155: 324-338.
Romero-Alvarado, Y., Soto-Pinto, L., García-Barrios, L., & Barrera-Gaytán, J. F. 2002. Coffee yields and soil nutrients under the shades of Inga sp. vs. multiple species in Chiapas, Mexico. Agroforestry systems, 54: 215-224.
Geurts, P., Ernst, D., & Wehenkel, L. 2006) Extremely randomized trees. Machine learning, 63: 3-42.
Sharma, H., & Kumar, S. 2016. A survey on decision tree algorithms of classification in data mining. International Journal of Science and Research (IJSR), 5(4): 2094-2097.
Ruß, G., Kruse, R., Schneider, M., & Wagner, P. 2008. Estimation of neural network parameters for wheat yield prediction. In Artificial Intelligence in Theory and Practice II: IFIP 20 th World Computer Congress, TC 12: IFIP AI 2008 Stream, September 7-10, 2008, Milano, Italy 2: 109-118. Springer US.
Wang, N., Jassogne, L., van Asten, P. J., Mukasa, D., Wanyama, I., Kagezi, G., & Giller, K. E. 2015. Evaluating coffee yield gaps and important biotic, abiotic, and management factors limiting coffee production in Uganda. European Journal of Agronomy, 63: 1-11.
Bunn, C., Läderach, P., Pérez Jimenez, J. G., Montagnon, C., & Schilling, T. 2015. Multiclass classification of agro-ecological zones for Arabica coffee: an improved understanding of the impacts of climate change. PLoS One, 10(10): e0140490.
Shastry, K. A., Sanjay, H. A., & Deshmukh, A. 2016. A parameter based customized artificial neural network model for crop yield prediction. Journal of Artificial Intelligence, 9(1-3): 23-32.
Sahu, S., Chawla, M., & Khare, N. 2017. An efficient analysis of crop yield prediction using Hadoop framework based on random forest approach. In 2017 international conference on computing, communication and automation (ICCCA). 53-57. IEEE.
Mishra, S., Mishra, D., & Santra, G. H. 2016. Applications of machine learning techniques in agricultural crop production: a review paper. Indian J. Sci. Technol, 9(38): 1-14.
Joanes, D. N., & Gill, C. A. 1998. Comparing measures of sample skewness and kurtosis. Journal of the Royal Statistical Society: Series D (The Statistician), 47(1): 183-189.
Schwertman, N. C., Owens, M. A., & Adnan, R. 2004. A simple more general boxplot method for identifying outliers. Computational statistics & data analysis, 47(1): 165-174.
Natekin, A., & Knoll, A. 2013. Gradient boosting machines, a tutorial. Frontiers in neurorobotics, 7: 21.
Chen, T., & Guestrin, C. (2016, August). Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. 785-794.
Chengsheng, T., Huacheng, L., & Bing, X. 2017. AdaBoost typical Algorithm and its application research. In MATEC Web of Conferences 139: 00222. EDP Sciences.