Jump to content

Data mining in agriculture

From Wikipedia, the free encyclopedia

In agriculture, data mining is the use of data science techniques to analyze large volumes of agricultural data. Recent advancements in technology, such as sensors, drones, and satellite imagery, have enabled the collection of large amounts of data on soil health, weather patterns, crop growth, and pest activity. Data is analyzed to improve agricultural efficiency, identify patterns and trends, identify issues early on, and minimize potential losses. [1]

Applications

[edit]

Detection of fruit and vegetable defects

[edit]

Data mining can be used in agriculture to enhance algorithms used to detect defects in harvested fruit and vegetables. The development of enhanced visual data collection methods can be used, for example, to classify fruit and vegetables according to various surface defects.[2] Data such as these can also be used to investigate potential causes of defects to fruit and vegetables. For example, chemical spraying can cause various defects in different types of fruit. Currently, much of this knowledge is based on anecdotal evidence rather than qualitative and quantitative data collection methods, which is why efforts are being made to apply data mining practices to horticulture research.[3]

Before being sent to the market, apples are checked and the ones showing some defects are removed. However, there are also invisible defects that can spoil the apple's flavor and look. An example of an invisible defect is a water core, an internal apple disorder that can affect the longevity of the fruit. Apples with slight or mild water core are sweeter, but apples with moderate to severe degree of water core cannot be stored for as long as apples. Moreover, a few fruits with severe water cores could spoil an entire batch of apples. Because of this, a computational system is under study, which takes X-ray photographs of the fruit while the apples run on conveyor belts. The system is also able to analyze (by data mining techniques) the pictures taken and estimate the probability of the fruit containing water cores.[4]

Wine fermentation diagnosis

[edit]

The metabolic transformations of the fermentation of wine impacts the quality of wine and the productivity of wine-related industries. Data science techniques, such as the k-means algorithm,[5] and classification techniques based on the concept of blustering[6] have been used to study these metabolic processes, successfully predicting fermentation outcomes after as much as three days of fermentation. These methods classify wine according to the metabolite profile of the fermentations and is different from traditional wine classifications systems. See the wiki page Classification of Wine for more details. Based on experimental data, scientists propose this is a valuable tool to diagnose unwanted fermentation outcomes and thus plan for intervention at the early stages of the fermentation.[7]

Predicting metabolizable energy of poultry by-product meal

[edit]

A Group Method of Data Handling (GMDH)-type network combined with an evolutionary method of genetic algorithm was used to predict the metabolizable energy of feather meal and poultry offal meal based on its respective protein, fat, and ash content. Data samples from published literature were collected and used to train a GMDH-type network model. The novel approach of combining GMDH-type network with an evolutionary method of genetic algorithm can be used to predict the metabolizable energy of poultry feed samples based on their chemical content.[8] It is also reported that the GMDH-type network can accurately estimate the poultry performance from their dietary nutrients such as metabolizable energy, protein, and amino acids.[9]

Detection of diseases from animal sounds

[edit]

The early detection of animal diseases can benefit farm productivity by allowing the farmer to treat and isolate the affected animal as soon as the symptoms appear thus reducing the spread of disease to other animals. Sounds emitted by pigs, such as coughing, can be analyzed for disease detection. A computational system is currently being developed to monitor and differentiate pig sounds through microphones installed in the farm.[10]

Growth of sheep from genes polymorphism using artificial intelligence

[edit]

Polymerase Chain Reaction-Single Strand Conformation Polymorphism (PCR-SSCP) method was used to determine growth hormone (GH), leptin, calpain, and calpastatin polymorphism in Iranian Balochi male sheep. An artificial neural network (ANN) model was developed to predict average daily gain (ADG) in lambs using input parameters of GH, leptin, calpain, and calpastatin polymorphism, birth weight, and birth type. The results revealed that the ANN-model is an appropriate tool for identifying the patterns of data to predict lamb growth in terms of ADG given specific genes polymorphism, birth weight, and birth type. The platform of PCR-SSCP approach and ANN-based model analyses may be used in molecular marker-assisted selection and breeding programs to design a scheme for improving the efficacy of sheep production.[11]

Optimizing pesticide use

[edit]

Recent studies by agricultural researchers in Pakistan showed that pro-pesticide state policies have enabled dangerously high pesticide use in cotton crops. These studies have reported a negative correlation between pesticide use and crop yield in Pakistan. As a result, the excessive use (or abuse) of pesticides is causing the farmers adverse financial, environmental and social impacts. By data mining within the cotton industry, pest scouting data along with the meteorological recordings show how pesticide use can be optimized (reduced). Clustering of data revealed interesting patterns in farming practices along with pesticide use dynamics.[12]

Analyzing chicken performance data by neural network models

[edit]

A platform of artificial neural network (ANN)-based models combined with sensitivity analysis and optimization algorithms was successfully used to integrate published data on the responses of broiler chickens to threonine. Analyses of the ANN models for weight gain and feed efficiency suggested that the dietary protein concentration was more important than the threonine concentration. The results revealed that a diet containing 18.69% protein, and 0.73% threonine may lead to producing optimal weight gain, while the optimal feed efficiency may be achieved with a diet containing 18.71% protein and 0.75% threonine.[13]

Literature

[edit]

There are precision agriculture journals, such as Springer's Precision Agriculture or Elsevier's Computers and Electronics in Agriculture. However, these journals are not exclusively devoted to data mining in agriculture.

References

[edit]
  1. ^ Ait Issad, Hassina (October 2019). "A comprehensive review of Data Mining techniques in smart agriculture". Engineering in Agriculture, Environment and Food. 12 (4): 511–525. doi:10.1016/j.eaef.2019.11.003.
  2. ^ Firouz, Mahmoud Soltani (2022). "Defect Detection in Fruit and Vegetables by Using Machine Vision Systems and Image Processing". Springer Nature Link. 14 (3): 353–379. doi:10.1007/s12393-022-09307-1.
  3. ^ Hill, M. G.; Connolly, P. G.; Reutemann, P.; Fletcher, D. (2014-10-01). "The use of data mining to assist crop protection decisions on kiwifruit in New Zealand". Computers and Electronics in Agriculture. 108: 250–257. doi:10.1016/j.compag.2014.08.011.
  4. ^ Schatzki, T.F.; Haff, R.P.; Young, R.; Can, I.; Le, L-C.; Toyofuku, N. (1997). "Defect Detection in Apples by Means of X-ray Imaging". Transactions of the American Society of Agricultural Engineers. 40 (5): 1407–1415. doi:10.13031/2013.21367.
  5. ^ Urtubia, A.; Perez-Correa, J.R.; Meurens, M.; Agosin, E. (2004). "Monitoring Large Scale Wine Fermentations with Infrared Spectroscopy". Talanta. 64 (3): 778–784. doi:10.1016/j.talanta.2004.04.005. PMID 18969672.
  6. ^ Mucherino, A.; Urtubia, A. (2010). "Consistent Biclustering and Applications to Agriculture". IbaI Conference Proceedings, Proceedings of the Industrial Conference on Data Mining (ICDM10), Workshop Data Mining in Agriculture (DMA10), Springer: 105–113.
  7. ^ Urtubia, Alejandra; Pérez-Correa, J. Ricardo; Soto, Alvaro; Pszczólkowski, Philippo (2007-12-01). "Using data mining techniques to predict industrial wine problem fermentations". Food Control. 18 (12): 1512–1517. doi:10.1016/j.foodcont.2006.09.010. ISSN 0956-7135.
  8. ^ Ahmadi, H.; Golian, A.; Mottaghitalab, M.; Nariman-Zadeh, N. (2008-09-01). "Prediction Model for True Metabolizable Energy of Feather Meal and Poultry Offal Meal Using Group Method of Data Handling-Type Neural Network". Poultry Science. 87 (9): 1909–1912. doi:10.3382/ps.2007-00507. ISSN 0032-5791. PMID 18753461.
  9. ^ Ahmadi, Dr H.; Mottaghitalab, M.; Nariman-Zadeh, N.; Golian, A. (2008-05-01). "Predicting performance of broiler chickens from dietary nutrients using group method of data handling-type neural networks". British Poultry Science. 49 (3): 315–320. doi:10.1080/00071660802136908. ISSN 0007-1668. PMID 18568756. S2CID 205399055.
  10. ^ Chedad, A.; Moshou, D.; Aerts, J.M.; Van Hirtum, A.; Ramon, H.; Berckmans, D. (2001). "Recognition System for Pig Cough based on Probabilistic Neural Networks". Journal of Agricultural Engineering Research. 79 (4): 449–457. doi:10.1006/jaer.2001.0719.
  11. ^ Mojtaba, Tahmoorespur; Hamed, Ahmadi (2012-01-01). "neural network model to describe weight gain of sheep from genes polymorphism, birth weight and birth type". Livestock Science. ISSN 1871-1413.
  12. ^ Abdullah, Ahsan; Brobst, Stephen; Pervaiz, Ijaz; Umar, Muhammad; Nisar, Azhar (2004). Learning Dynamics of Pesticide Abuse through Data Mining (PDF). Australasian Workshop on Data Mining and Web Intelligence, Dunedin, New Zealand. Archived from the original (PDF) on 2011-08-14. Retrieved 2010-07-20.
  13. ^ Ahmadi, H.; Golian, A. (2010-11-01). "The integration of broiler chicken threonine responses data into neural network models". Poultry Science. 89 (11): 2535–2541. doi:10.3382/ps.2010-00884. ISSN 0032-5791. PMID 20952719.