Advances in rice yield estimation using Neural Networks

Introduction

Basically, secondary data is used in the estimation of future trends of any commodity. History and the present situation showcases that, timely and reliable yield estimates of major crops more so rice, need not be overstated for the most populous country like India where the economy is principally based on agricultural production. Yield estimation of crops viz., rice, wheat, and corn is significant in economic programming in the global scene as it aids effective policy decision

Weather is a major factor affecting crop yield in the agriculture domain. There are many weather parameters contributing to the growth and development of rice crop. India, the largest rice-producing country, plants rice in an area of about 43 million hectares and produced about 125 million tons of rice during 2018 (http://www.fao.org/). Ranga Reddy being the major rice growing district of Telangana was selected for the present study. This study was undertaken with a view to developing an appropriate neural network model for estimating of rice yield. Stastny et al., 2011 described that the neural network model is composed of artificial neurons that are interconnected; and depending on the network topology, they exchange the actuation signals in the form of an activation transition function. Neural network models are simple mathematical models defining a function f: X→Y. Every type of model created by the artificial neural network corresponds to a class of such functions (Michie et al., 1994); (Zheng & Ishizuka, 1995); (Lim et al., 2000). Nwankpa et al., 2018 described machine learning is a field of study that uses statistics and computer science principles, to create statistical models, used to perform major tasks like estimations and inference. Gomes et al., 2011)said ANNs have been widely used in studies of complex time series forecasting, such as weather, energy consumption and financial series..

Shastry et al ., 2016 said artificial neural networks (ANN) and multiple linear regressions (MLR) are widely used on crop yield estimation. They designed and developed Customized-ANN (C-ANN) by changing the number of hidden layers, the number of neurons in the hidden layer and the learning rate. Ruß, 2009 used neural networks, multi-layer-perception, regression tree, support vector regression to estimate wheat yield from fertilizer and additional sensor input. They found that support vector regression can serve as a better reference model for yield estimation. Ghodsi et al., 2012 demonstrated the effects of climate factors on wheat yield using ANN model. They found that the ANN model is a suitable way of estimating wheat yield. Paswan & Begum, 2013 studied a complete review of the literature comparing feed-forward neural networks and regression analysis with respect to estimation of crop yield. The majority of the research works have used linear regression models for the estimation of crop yield. However the yield of a crop has a non-linear relationship with independent weather variables. Thus ANN is better suited for estimating crop yield. The specific objective of the present study was to explore the possibility of suggesting suitable back propagation neural network for estimating of rice yield in Ranga Reddy district of Telangana.

MATERIALS AND METHODS

The Matlab R2018a software was used to explore the possibility of estimating the yield of rice due to the combined effects of weekly weather parameters. Rice yield data for Ranga Reddy District of Telangana for the years 1988-89 to 2018-19 were extracted from the Directorate of Economics and Statistics, Government of Telangana, 2018-19 (https://ecostat.telangana.gov.in). The meteorological data set for the same periods of Ranga Reddy station were collected from the Agro Climate Research Centre, ARI, PJTSAU, Hyderabad for the present study. Weekly averaged data of weather variables viz., Bright Sunshine Hours, Maximum Temperature, Minimum Temperature, Morning Relative Humidity, Evening Relative Humidity, and weekly total Rainfall were collected for the period of the growing season of rice in Ranga Reddy district for the years under consideration. The details of weekly weather variables included in the study up to 18 weeks of crop period are given in Table 1.

In assessing joint influence of week-wise weather variables, the Back Propagation Neural Networks approach was considered. Here 108 factors were considered as the input variables and rice crop yield was taken as the target variable. As few input variables may be superfluous, it affects the estimation of yield. So, from 108 input variables; only 11 input variables (Table 3) have positive and strong correlation with the target variable were selected using Pearson’s correlation coefficients. It is the test statistics that measure the degree of association between input and target variables. It gives information about the magnitude of the association and the direction of the relationship. Normalization is a scaling procedure, where, we can find new range between 0 and 1 from an existing range of values of different variables and is used to reduce the large variation of estimation. Min-Max Normalization (Eq.1) technique was used to normalize the experimental dataset to minimize the Average Estimating Error Rate (Eq.2).

Min-Mix normalization: It is one of the most familiar ways to normalize data. It transforms the data from measured units to a new interval from New_MinX to New_MaxX for feature X.

(1)

Where, V’ is Min-Max Normalized data one

V is the respective value of the attribute

MinX is the respective Minimum value of the attribute

MaxX is the respective Maximum value of the attribute

Average Estimating Error Rate (AERR %): The per cent deviations of estimated yields and actual yields were worked out to evaluate the suitability of fitted neural networks. (Meena & Singh, 2013 , Stastny et al., 2011).

(2)

Where, n is number of instances

Back Propagation Neural Network: Back Propagation is a learning algorithm used by neural network with supervised learning. Back Propagation works by resembling the nonlinear relationship between the input and the output (target) by correcting the weight values within. It can further be generalized for the input that is not included in the training patterns (estimation capacities). The Fig. 1 demonstrates the simple architecture of nodes in a neural network and Fig.2 demonstrates the architecture of neural network for the estimation of rice crop yield. There are n number of inputs coming from nodes (1,2, 3,…n) with related inputs and weights values as In₁, In₂,…In_n and W₁,W₂…W_n respectively. The rice yield estimation was shown by the output. Two activation functions (Logistic Sigmoid and Linear) were applied to the input values flow in the network.

Number of Neurons (Hidden Layer): A Neural Network has one hidden layer. It was examined with 10 and 12 numbers of hidden neurons to achieve the best output value (Table 4).

Activation Functions: Activation functions are mathematical equations that resolve the output of a neural network. Logistic Sigmoid and Linear activation functions were employed in the hidden layer and output layer respectively (Table 2).

Weight: It is the learnable parameter within a neural network that transforms input data within the neural network’s hidden layers. It is initialized randomly with ranges [0, 1], which are further updated using the gradient descent rule (Eq.3).

(3)

Where, is learning rate and is first order derivative of the cost function C(n).

(4)

Where, Y is the estimated yield, Y’ is actual yield and n is the number of instances

Learning Rate: It calculates the speed of convergence of the system. Its value ranges as 0, 1. The learning rate was set to 0.0001 and increases as long as the error does not increase in order to avoid trapping in local optima.

Momentum Factor: It is a method that frequently improves both training speed and accuracy. The momentum factor was set to 0.9.

Stopping Condition: A generally fixed number of epochs (Iterations) were considered as the stooping criteria. The stopping condition was set to a minimum of 1000 epochs (Iterations) or error 1×e ^-10, whichever occurs earlier.

RESULTS AND DISCUSSION

Table 4 shows the comparison of the Average Estimating Error Rate for train data set. Out of 6 formations of neural networks in this research work (given notation as “A” to “F”), Neural Network “A” has achieved highest AEER with 6.95 % and “F” has achieved lowest AEER (2.60 %) followed by Neural Network “E” (3.70 %) as compared with other neural networks.

The Fig.3 and Fig. 4 explains the Mean Absolute Error of Neural Network “F” for training and testing data set, respectively. The measure of estimation accuracy is also called as Mean Absolute Error (MAE) and a low MAE suggests the neural network is good at estimation, while a sizable MAE suggests that the neural network may have problems in certain areas. Multiple Correlation Coefficient (R) is a measure of how well a target variable can be estimated using a linear function of a set of input variables. Usually, the R values range between 0.0 and 1.0, a higher R value indicates a better estimation of the target variable from the selected input variables. In case of train data set, MAE (63.32) and R (0.95) values were low and high respectively and thus indicated good job by a neural network. On the other hand, MAE (188.17) and R (0.99) values were moderate and high respectively, for test data set,

Estimated Rice Yield Error Rate of Neural Network “F” for train and test data set respectively as shown in the Fig.5 and Fig. 6 depicts that, the estimated yields were over and underestimated for different years. In case of train data set, the estimated yield was underestimated by 0.3 % ,3.8 %, 0.9 %, 0.8 %, 1.1 %, 3.3 %, 0.3 %, 1.3 %, 8.7 %, 0.2 %, 0.6 %, 1.0 % ,0.3 % ,3.2 % and 10.3 % for the years 1990-91, 1993-94 , 1996-97 , 1997-98 , 1998-99, 2000-01, 2001-02 , 2003-04 , 2005-06 , 2006-07 , 2007-08 , 2008-09 , 2009-10, 20011-12 and 2012-13, respectively But, rice yield were overestimated by 2.4 % , 8.3 %, 4.0 % , 2.3 %, 5.6 %, 2.8 %, and 2.2 % accordingly for the years 1989-90, 1991-92, 1992-93, 1994-95, 1995-96, 1999-2000 and 2002-03, respectively. In case of test data set only underestimation happened and it was by 7.5 %, 8.1 %, 5.9 %, 8.0 %, 3.8 % and 6.1 % for the years 2013-14, 2014-15, 2015-16, 2016-17, 2017-18 and 2018-19, respectively. The estimated yield error rates ranged from –10.3 % to 8.3 % for train data set and while it ranged from – 8.1 % to -3.8 % in case of test data set.

The estimated rice yields based on the train data set were presented in Table 5. The actual rice yields are also given for comparison. The same is demonstrated in Fig.7. It is noticed that except for erring years like 1991-92, 1995-96, 2005-06 and 2012-13, the actual yield and the estimated yield are very close to each other. The estimated rice yields showed deviations from actual yields ranging between –10.3 to 8.3 %.

The simulated estimation of rice yield based on the test data set is shown in Table 6 and Fig.8. The actual rice yields were also given for comparison. It is observed that the actual yield and the simulated estimation of rice yield were close to each other. The simulated estimation of rice yields showed deviations from observed yields ranging between -8.1 to -3.8 %. Crop yield forecasting using neural networks studied by Meena & Singh, 2013 and fuzzy logic for crop yield forecasting studied by Kumar & Kumar, 2012 and Narendra et al., 2010 also corroborated similar results.

The Table 7 shows the comparison of the Average Estimating Error Rate of the proposed Neural Network “F” with other researchers. The proposed neural network “F” has achieved lowest AEER (2.60 %) as compared with other methods.

CONCLUSION

Rice crop yield estimation was carried out by considering different weekly weather variables viz., bright sunshine hours, maximum temperature, minimum temperature, morning relative humidity, evening relative humidity, rainfall, and supplied in back propagation neural network models. The proposed neural network architecture and various computational parameters like number of neurons in hidden layer, weight, learning rate, momentum factor and stopping condition were selected by trial-and-error approach. The proposed neural network model “F” (Input Neurons =11, Hidden Neurons=12, Output Neuron=1, Train Data Size = 80 % and Test data Size=20%, AEER=2.60 %) has obtained better results with low MAE and AEER (%). All the estimated yields of respective years were close to the actual yields as the multiple correlation coefficient (R) values for train and test data were close to 1. The proposed neural network model may be further enhanced by including more factors like economic, physical and technological aspects for better estimation of rice yields.

REFERENCES

Food and Agriculture Organization Of the united states.2018. FAOSTAT Statistical Database. (http://www.fao.org/).
Agricultural Statistics at a glance. 2018-2019. Government of Telangana, Directorate of Economics and statistics. Directorate of Economics and Statistics, Government of Telangana (https://ecostat.telangana.gov.in).
Ghodsi, R., mirabdollah yani, R., Jalali, R., & Ruzbahman, M. (2012). Predicting Wheat Production in Iran Using an Artificial Neural Networks Approach. Int.J.Acad.Res.Bus.Soc.Sci., 02, 34- 47.
Gomes, G., Ludermir, T., & Lima, L. (2011). Comparison of new activation functions in neural network for forecasting financial time series. Neural Computing and Appli, 20, 417–439. https://doi.org/10.1007/s00521-010-0407-3
Kumar, S., & Kumar, N. (2012). Fuzzy Time Series based Method for Wheat production Forecasting. Int. J. Comput.Appl., 44, 5–10. https://doi.org/10.5120/6313-8651
Lim, T.-S., Loh, W.-Y., & Shih, Y.-S. (2000). A Comparison of Prediction Accuracy, Complexity, and Training Time of Thirty-Three Old and New Classification Algorithms. Mach. Learn., 40(3), 203–228. https://doi.org/10.1023/A:1007608224229
Meena, M., & Singh, P. (2013). “Crop Yield Forecasting Using Neural Networks”. 8298, 319–331. https://doi.org/10.1007/978-3-319-03756-1_29
Michie, D., Spiegelhalter, D. J., & Taylor, C. C. (1994). Machine Learning, Neural and Statistical Classification. Inst. Public Health, Cambridge.
Narendra, K., Ahuja, S., Shashank, B., & Vipin, K. (2010). Fuzzy time series forecasting of wheat production. Int.J. Comput. Sci. Engg., 2, 635-640.
Nwankpa, C., Ijomah, W., Gachagan, A., & Marshall, S. (2018). “Activation Functions: Comparison of trends in Practice and Research for Deep Learning”.
Paswan, R. P., & Begum, S. A. (2013). Regression and Neural Networks Models for Prediction of Crop Production. Int. J. Sci. Engg.Res., 04(9), 98-108.
Ruß, G. (2009). Data Mining of Agricultural Yield Data: A Comparison of Regression Models. Adv. in Data Mining. 5633, 24–37. https://doi.org/10.1007/978-3-642-03067-3_3
Shastry, K. A., Sanjay, H., & Deshmukh, A. (2016). A Parameter Based Customized Artificial Neural Network Model for Crop Yield Prediction. J. Artf. Int., 9, 23–32. https://doi.org/10.3923/jai.2016.23.32
Stastny, J., Konecny, V., & Trenz, O. (2011). Agricultural data prediction by means of neural network. Agric.econ., 57. https://doi.org/10.17221/108/2011-AGRICECON
Zheng, T., & Ishizuka, O. (1995). Design and implementations of a learning T-model neural network. Electricals and Electrical Engg., 4, 259–263.

Post Views: 1,195

Agriculture Association of Textile Chemical and Critical Reviews Journal (AATCC Review)

Blog

Abstract

About Journal

Tags

Article Processing Charges (APC)