Kauzalni modeli obrade podataka o kakvoći hrane pomoću umjetne inteligencije

Kurtanjek, Želimir

doi:10.17113/ftb.62.01.24.8301

Food Technology and Biotechnology, Vol. 62 No. 1, 2024.

Izvorni znanstveni članak

https://doi.org/10.17113/ftb.62.01.24.8301

Kauzalni modeli obrade podataka o kakvoći hrane pomoću umjetne inteligencije

Želimir Kurtanjek orcid.org/0000-0001-5453-6255 ; University of Zagreb Faculty of Food Technology and Biotechnology, Pierotijeva 6, 10000 Zagreb, Croatia

Puni tekst: engleski pdf 1.287 Kb

str. 102-109

preuzimanja: 247

citiraj

APA 6th Edition

Kurtanjek, Ž. (2024). Kauzalni modeli obrade podataka o kakvoći hrane pomoću umjetne inteligencije. Food Technology and Biotechnology, 62 (1), 102-109. https://doi.org/10.17113/ftb.62.01.24.8301

MLA 8th Edition

Kurtanjek, Želimir. "Kauzalni modeli obrade podataka o kakvoći hrane pomoću umjetne inteligencije." Food Technology and Biotechnology, vol. 62, br. 1, 2024, str. 102-109. https://doi.org/10.17113/ftb.62.01.24.8301. Citirano 27.12.2024.

Chicago 17th Edition

Kurtanjek, Želimir. "Kauzalni modeli obrade podataka o kakvoći hrane pomoću umjetne inteligencije." Food Technology and Biotechnology 62, br. 1 (2024): 102-109. https://doi.org/10.17113/ftb.62.01.24.8301

Harvard

Kurtanjek, Ž. (2024). 'Kauzalni modeli obrade podataka o kakvoći hrane pomoću umjetne inteligencije', Food Technology and Biotechnology, 62(1), str. 102-109. https://doi.org/10.17113/ftb.62.01.24.8301

Vancouver

Kurtanjek Ž. Kauzalni modeli obrade podataka o kakvoći hrane pomoću umjetne inteligencije. Food Technology and Biotechnology [Internet]. 2024 [pristupljeno 27.12.2024.];62(1):102-109. https://doi.org/10.17113/ftb.62.01.24.8301

IEEE

Ž. Kurtanjek, "Kauzalni modeli obrade podataka o kakvoći hrane pomoću umjetne inteligencije", Food Technology and Biotechnology, vol.62, br. 1, str. 102-109, 2024. [Online]. https://doi.org/10.17113/ftb.62.01.24.8301

Preuzmi JATS datoteku

Sažetak

Pozadina istraživanja. Svrha je ovog istraživanja bila naglasiti važnost korištenja umjetne inteligencije (AI) i modeliranja uzročnosti kakvoće hrane, te analize velike količine podataka. Umjetna inteligencija sa strukturnim uzročnim modeliranjem (SCM), temeljena na Bayesovim mrežama i dubokom učenju, omogućuje integraciju teorijskog znanja iz područja prehrambene tehnologije s podacima o proizvodnom procesu, fizikalno-kemijskim svojstvima te organoleptičkim ocjenama proizvoda. Prehrambeni proizvodi su složene naravi, a podaci vrlo dimenzionalni, sa zamršenim međuodnosima (korelacijama), koje je teško povezati sa osjetilnom percepcijom potrošača o kakvoći hrane. Standardne tehnike regresijskog modeliranja, kao što su višestruki obični najmanji kvadrati (OLS) i parcijalni najmanji kvadrati (PLS), učinkovito se primijenjuju za predviđanje učinaka pomoću linearnih interpolacija opaženih podataka pod stacionarnim uvjetima presjeka. Nadogradnja modela linearne regresije pomoću strojnog učenja (ML) uzima u obzir nelinearne odnose i otkriva funkcionalne obrasce, ali je sklona zbunjujućim i neuspjelim predviđanjima u neopaženim nestacionarnim uvjetima. Interferencija varijabli podataka glavna je prepreka primjeni regresijskih modela u prehrambenim inovacijama pod prethodno neuvježbanim uvjetima. Stoga se ovaj rad usredotočuje na primjenu kauzalnih grafičkih modela s Bayesovim mrežama za zaključivanje uzročno-posljedičnih odnosa i učinaka intervencije između procesnih varijabli i senzorske procjene kakvoće hrane.
Eksperimentalni pristup. Ovo se istraživanje temelji na podacima dostupnim u literaturi, o procesu pečenja kruha od pšeničnog brašna, potrošačkim ocjenama senzorske kakvoće fermentiranih mliječnih proizvoda, te rezultatima stručnog kušanja vina. Podaci o kakvoći pečenja pšenice regulirani su operatorom najmanjeg apsolutnog skupljanja i odabira (LASSO elastična mreža). Bayesova statistika primijenjena je za procjenu zajedničke funkcije vjerojatnosti modela za zaključivanje mrežne strukture i parametara. Dobiveni strukturni kauzalni modeli prikazani su kao usmjereni aciklički grafovi (DAG). Kriteriji D-odvajanja primijenjeni su za blokiranje interferirajućih učinaka pri procjeni izravnih i ukupnih uzročnih učinaka procesnih varijabli i percepcije potrošača o kakvoći hrane. Distribucije vjerojatnosti uzročnih učinaka intervencije pojedinih procesnih varijabli na kakvoću prikazane su kao dijagrami djelomične ovisnosti, određeni Bayesovim neuronskim mrežama. U slučaju uzročnosti kakvoće vina, ukupni uzročni učinci utvrđeni pomoću SCM potvrđeni su algoritmom dvostrukog strojnog učenja (DML).
Rezultati i zaključci. Ispitan je skup podataka od 45 kontinuiranih varijabli koje odgovaraju različitim varijablama kemijskih, fizikalnih i biokemijskih svojstava sedam hrvatskih kultivara pšenice prikupljenim tijekom dvije godine kontroliranog uzgoja. LASSO regulacija skupa podataka dala je deset ključnih prediktora, koji obuhvaćaju 98 % varijance podataka o kakvoći pečenja. Na osnovi ključnih varijabli izveden je prediktivni model slučajne šume sa 75 % točnosti unakrsne provjere. Uzročna analiza između kakvoće i ključnih prediktora temeljila se na Bayesovom modelu prikazanom kao DAG. Udjel proteina imao je najveći izravni uzročni učinak s koeficijentom puta od 0,71; udjel ukupnih podjedinica glutenina velike molekularne mase bio je neizravni uzrok s koeficijentom puta od 0,42; dok je prosječni uzročni učinak (ACE) ukupnog udjela proteina bio 0,65. Veliki skup podataka o kakvoći fermentiranih mliječnih proizvoda uključivao je binarne senzorske podatke (okus, miris, zamućenost), kontinuirane fizikalne varijable (temperatura, masnoća, pH, boja) i tri stupnja potrošačke ocjene proizvoda. Model slučajnih šuma izveden je radi predviđanja klasifikacije kakvoće s „out-of-bag“ (OOB) pogreškom od 0,28 %. Bayesov mrežni model predviđa da na klasifikaciju okusa izravno utječu temperatura, boja i udjel masti, dok na klasifikaciju kakvoće izravno utječu temperatura, zamućenost, miris i udjel masti. Procijenjeni su ključni ACE od −0,04 stupnja kakvoće/°C i 0,3 stupnja kakvoće/udjelu masti. Ovisnost ACE o temperaturi pokazuje nelinearni tip kao negativno zasićenje s točkom „prijeloma” na 60 °C, dok je ACE udjela masti imao pozitivan linearni trend. Uzročna analiza kakvoće crnog i bijelog vina temeljila se na velikom skupu podataka od jedanaest kontinuiranih varijabli fizikalnih i kemijskih svojstava i procjena kakvoće razvrstanih u deset klasa, od 1 do 10. Svaku je klasifikaciju u tri ponavljanja proveo panel profesionalnih kušača vina. Za procjenu ACE ukupne kakvoće primijenjen je algoritam nestrukturalnog dvostrukog strojnog učenja (DML). Udjel alkohola u crnom i bijelom vinu imao je ključni pozitivni ACE relativni faktor od 0,35 kakvoće/udjelu alkohola, dok je hlapljiva kiselost imala ključni negativni ACE od –0,2 kakvoće/kiselosti. Dobivena predviđanja ACE nestrukturiranim algoritmom DML uvelike odgovaraju onima dobivenim strukturnim SCM-om.
Novost i znanstveni doprinos. Prikazane su nove metodologije i rezultati primjene kauzalnih modela umjetne inteligencije u analizi potrošačke procjene kakvoće prehrambenih proizvoda. Primjena Bayesovih mrežnih strukturno kauzalnih modela (SCM) omogućuje d-odvajanje izraženih učinaka konfuzije između parametara u nekauzalnim regresijskim modelima. Na temelju SCM-a, zaključivanje ACE-a daje potkrijepljene i potvrđene istraživačke hipoteze za nove proizvode i podršku za odluke o mogućim intervencijama u svrhu poboljšanja dizajna proizvoda, uvođenja novih procesa, kontrolu procesa, upravljanja i marketinga.

Ključne riječi

Bayesova mreža; AI uzročnost; učinci intervencije; prosječni uzročni učinak (ACE); kakvoća hrane

Hrčak ID:

315459

URI

https://hrcak.srce.hr/315459

Datum izdavanja:

7.3.2024.

Podaci na drugim jezicima: engleski

Posjeta: 1.216 *

Podaci o članku

License (https://creativecommons.org/licenses/by/4.0/):

This is an open-access article distributed under the terms of the Creative Commons Attribution (CC BY) 4.0 License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Date received: 19 July 2023

Date accepted: 27 December 2023

Publication date: March 2024

Volume: 62

Issue: 1

Pages: 102-109

Publisher ID: FTB-62-102

DOI: 10.17113/ftb.62.01.24.8301

Article Information (continued)

Categories:

Subject: Original scientific papers

Keywords: :

Keywords:

Keyword: Bayesian network

Keyword: AI causality

Keyword: intervention effects

Keyword: ACE

Keyword: food quality

Causal Artificial Intelligence Models of Food Quality Data

[https://orcid.org/0000-0001-5453-6255] Želimir Kurtanjek[*]

University of Zagreb Faculty of Food Technology and Biotechnology, Pierotijeva 6, 10000 Zagreb, Croatia

Author notes:

Correspondence to: [*] E-mail: zelimir.kurtanjek@gmail.com

Contributed by:

AUTHOR CONTRIBUTION

This author is the only contributor.

SUMMARY

Research background

The aim of this study is to emphasize the importance of artificial intelligence (AI) and causality modelling of food quality and analysis with ’big data’. AI with structural causal modelling (SCM), based on Bayesian networks and deep learning, enables the integration of theoretical field knowledge in food technology with process production, physicochemical analytics and consumer organoleptic assessments. Food products have complex nature and data are highly dimensional, with intricate interrelations (correlations) that are difficult to relate to consumer sensory perception of food quality. Standard regression modelling techniques such as multiple ordinary least squares (OLS) and partial least squares (PLS) are effectively applied for the prediction by linear interpolations of observed data under cross-sectional stationary conditions. Upgrading linear regression models by machine learning (ML) accounts for nonlinear relations and reveals functional patterns, but is prone to confounding and failed predictions under unobserved nonstationary conditions. Confounding of data variables is the main obstacle to applications of the regression models in food innovations under previously untrained conditions. Hence, this manuscript focuses on applying causal graphical models with Bayesian networks to infer causal relationships and intervention effects between process variables and consumer sensory assessment of food quality.

Experimental approach

This study is based on the data available in the literature on the process of wheat bread baking quality, consumer sensory quality assessments of fermented milk products, and professional wine tasting data. The data for wheat baking quality were regularized by the least absolute shrinkage and selection operator (LASSO elastic net). Bayesian statistics was applied for the evaluation of the model joint probability function for inferring the network structure and parameters. The obtained SCMs are presented as directed acyclic graphs (DAG). D-separation criteria were applied to block confounding effects in estimating direct and total causal effects of process variables and consumer perception on food quality. Probability distributions of causal effects of the intervention of individual process variables on quality are presented as partial dependency plots determined by Bayesian neural networks. In the case of wine quality causality, the total causal effects determined by SCMs are positively validated by the double machine learning (DML) algorithm.

Results and conclusions

The data set of 45 continuous variables corresponding to different chemical, physical and biochemical variables of wheat properties from seven Croatian cultivars during two years of controlled cultivation were analysed. LASSO regularization of the data set yielded the ten key predictors, accounting for 98 % variance of the baking quality data. Based on the key variables, the quality predictive random forest model with 75 % cross-validation accuracy was derived. Causal analysis between the quality and key predictors was based on the Bayesian model shown as a DAG graph. Protein content shows the most important direct causal effect with the corresponding path coefficient of 0.71, and THMM (total high-molecular-mass glutenin subunits) content was an indirect cause with a path coefficient of 0.42, and protein total average causal effect (ACE) was 0.65. The large data set of the quality of fermented milk products included binary consumer sensory data (taste, odour, turbidity), continuous physical variables (temperature, fat, pH, colour) and three grade classes of products by consumer quality assessment. A random forest model was derived for the prediction of the quality classification with an out-of-bag (OOB) error of 0.28 %. The Bayesian network model predicts that the direct causes of the taste classification are temperature, colour and fat content, while the direct causes of the quality classification are temperature, turbidity, odour and fat content. The key quality grade ACE of temperature -0.04 grade/°C and 0.3 quality grade/fat content were estimated. The temperature ACE dependency shows a nonlinear type as negative saturation with the ’breaking’ point at 60 °C, while for fat ACE had a positive linear trend. Causal quality analysis of red and white wine was based on the large data set of eleven continuous variables of physical and chemical properties and quality assessments classified in ten classes, from 1 to 10. Each classification was obtained in triplicate by a panel of professional wine tasters. A non-structural double machine learning (DML) algorithm was applied for total ACE quality assessment. The alcohol content of red and white wine had the key positive ACE relative factor of 0.35 quality/alcohol, while volatile acidity had the key negative ACE of –0.2 quality/acidity. The obtained ACE predictions by the unstructured DML algorithm are in close agreement with the ACE obtained by the structural SCM.

Novelty and scientific contribution

Novel methodologies and results for the application of causal artificial intelligence models in the analysis of consumer assessment of the quality of food products are presented. The application of Bayesian network structural causal models (SCM) enables the d-separation of pronounced effects of confounding between parameters in noncausal regression models. Based on the SCM, inference of ACE provides substantiated and validated research hypotheses for new products and support for decisions of potential interventions for improvement in product design, new process introduction, process control, management and marketing.

INTRODUCTION

According to the EU Commission report by Knowledge Centre for Food Fraud and Quality (KC-FFQ) based on 30 000 respondents, 65 % of them perceived food quality as ’very important’ when deciding what to buy, compared to food price, which is important to 54 % of consumers (1). The concept of consumer-perceived food quality is very complex, it is an untaggable interaction of the objective measurable physicochemical properties and numerous subjective factors such as consumer population culture, ethical issues, economic and social status, tradition, personal preferences and expected nutritional benefits. It is a multi-dimensional concept which is influenced by a wide range of unmeasurable situational and contextual factors. To food producers, these complexities are difficult to rationalize for possible applications of statistical and mathematical decision-making algorithms. The objective characterization of food complexity can be greatly rationalized using ’big data‘ generated with high throughput analytical instrumentation. Automation of instrument-measurable sensory attributes has led to the development of systems such as electronic noses, e-tongue, near-infrared spectroscopy (NIR), infrared spectroscopy (IR), photoacoustic detectors and computer vision (2-6). They are applied for online production monitoring, process control and food safety. The fusion of physicochemical and electronic sensory data with computer vision enables the application of machine learning models for the detection of specific signal patterns. It helps food companies in recognizing patterns which drive consumer choice of specific products and improve the chances of continued purchase and potentially in the innovation of new products optimally adjusted to specific markets. Commonly applied statistical models are principal component analysis (PCA) and partial least square regression (PLS), and advanced machine learning (ML) algorithms such as artificial neural networks (ANN), convolution neural networks (CNN), decision trees (DT) and random forests (RF). They analyse large data sets of food quality parameters such as appearance, texture, taste and odour, and identify patterns that may be difficult for humans to detect. Importantly, they can help in identifying food contamination, spoilage and adulteration, which are crucial factors in maintaining food safety. The main benefit of ML models is the ability to provide an ’in-time’ assessment of the statistically significant status of food products (7-10). Integration of ML models with business knowledge in a food company on a production system level leads to industrial artificial intelligence (AI). It collaborates by supporting and enhancing the human thinking process, enables knowledge management and storing, and most importantly, it can learn. A decade of bibliometric studies on AI related to food science and technology show an exponential increase (11). Literature reports indicate that besides academic research, there is also a very strong interest in AI in major companies in the food industry. Dominant industry interest is in the application of intelligent robotics in specific process unit operations and their integration into a whole company AI-supported management. Besides standard engineering applications, AI is becoming a key support in the discovery and introduction of food innovations such as new components for taste, flavour and fragrance, especially aimed to reduce the content of sugar and salt in foods and beverages (12). Recent advancements are focused on the integration of strategic decisions of food company policymakers, business intelligence and AI systems in industrial production (13-16). The success of global integration of AI in food-producing companies depends on understanding the human subjective component of the present and potentially new markets (Fig. 1). Understanding the intricate dependencies of human subjective and objective physicochemical data requires higher levels of AI models as given in the scheme of ’knowledge ladder’ (17). Most of the present AI models fit the first knowledge ladder rung with potentially high flexibility and prediction accuracy under unchangeable model training conditions. In a food company, policy decisions of business management and production technologies need AI’s ability to act under new model untrained conditions. Most unforeseen new conditions are due to disruptions of supply chains, effects of climate change on the production of agricultural raw materials, competitor products and shifts in market preferences. The key upgrade of the present AI models is the application of causal data fusion of human subjective and instrumental objective data. Causal AI models are on the second and third rungs of the knowledge ladder. Causal relations are deduced from field knowledge (economy, engineering, physics, chemistry, nutrition) and from big data by statistical and knowledge models. The causal relations are integrated into AI models as Bayesian networks. They enable causal analysis elimination (blocking) of numerous confounding relations present in integrated big data training sets. On the second knowledge rung, causal AI models are applied for predicting average causal effect (ACE) of potential policy decisions and/or production interventions, labelled as do operator do(x) inFig. 1. On the third rung, causal AI models are applied for counterfactual reasoning approaching human imaginative intelligence (17). The aim of this manuscript is to apply causal AI modelling of food quality assessed by consumers and a professional panel of evaluators of wheat baking quality, fermented dairy products and wine quality.

Fig. 1 System view of causal AI model application for process and market decision making, management and innovations of food quality by do(x) inference

MATERIALS AND METHODS

Wheat quality

The baking quality of seven winter wheat cultivars from the Slavonia region in eastern Croatia was analysed. The volume of bread loaf under the standard baking protocol was used as the baking quality test. The cultivars were grown for a period of three years under controlled conditions at the experimental field of the Agricultural Institute Osijek, Croatia. Their quality properties were evaluated by 45 physical, chemical and biochemical variables. Each parameter was determined in triplicate during three consecutive years of cultivation. The measured variables were grouped as 6 indirect quality parameters, 7 farinographic parameters, 5 extensographic parameters and 25 pieces of information from reversed phase-high performance liquid chromatography (RP-HPLC) of gluten proteins. The experiment methodology and the data are available in the published manuscripts (18,19). All properties are listed as a table of continuous numerical variables. The data were highly correlated and the average absolute Pearson correlation was R=0.41. Principal component analysis of the total data set revealed that the cumulative effect in explaining the total data variance by the first three components was 76.45 % and the first four components accounted for 82.68 %.

Dairy quality

This dairy dataset contained 1059 samples of consumer quality assessments of fermented dairy products (20,21). The dataset consisted of 7 variables: pH, temperature, taste, odour, fat, turbidity and colour. Temperature, pH and colour were instrument-measured properties defined as continuous variables. The average and standard deviation for pH was 6.63±1.4, milk pre-treatment temperatures were in the range from 34 to 90 °C, with an average temperature of 44.2 °C. The colour data were determined spectroscopically with low variability of 1.7 % relative standard deviation. The samples of the physical variables have a non-Gaussian probability distribution. Spearman’s rank-order correlation coefficients between temperature and pH, colour and odour were significant with an average value of ρ=0.25, while the ρ correlation between colour and odour is insignificant. The consumer quality evaluation was the ordinal categorical variable with three levels: low, medium and high. Spearman’s rank-order quality grade correlation with temperature, colour and odour was significant, while pH was insignificant.

Wine quality

The wine quality was a large dataset, 1599 red and 4898 white samples of the Portuguese Vinho Verde wine, characterized by 12 physical and chemical composition data and quality assessments provided by a panel of professional wine tasters (22-24). The data file is available from the UCI Machine Learning Repository from the University of California at Irvine, USA. The variables were fixed acidity, volatile acidity, citric acid, residual sugar, chlorides, free sulphur dioxide, total sulphur dioxide, density, pH, sulphates and alcohol. The wine compositions were continuous numerical variables and the quality was an ordinal categorical variable with levels 1-10. The variable density was removed from the data set due to its very high variance inflation factor (VIF) since it is a common effect (causal collider) and hence cofounds modelling parameters (24). The probability distributions of the variables were approximately Gaussian. The data were highly correlated and the first three principal components for the red and white wines accounted for 99.7 and 99.8 % of the respective variances. Both red and white wines had maximum relative data variability for citric acid of 71 %, given as the ratio of standard deviation and mean value. The maximum Pearson’s correlations of the quality were with the content of alcohol, R=0.48 and 0.44 for the red and white wines respectively. The maximum negative correlation was with volatile acidity, R=-0.39 and -0.19 for the red and white wines respectively.

Methodology

The basic principles of causal AI modelling are based on the concepts of Bayesian statistics and networks (BN). Bayesian statistics combines prior knowledge (old model) upgraded with new experimental observations (data) in the prediction of a new model. The nature of prior knowledge in modelling includes deductive (known theoretical knowledge) and inductive (empirical structures and model parameters known from previous studies) processes studies. Knowledge of a causal AI model was expressed as a joint probability density function P of the model conditioned on new data. Causal AI modelling is a two-stage process in which the first objective is to determine the structure of a BN graph G, and in the second stage to determine functional causal dependencies between variables followed by estimation of the model parameters θ.

P(model|data)=P(G=graph,θ=parameter|X=data) /1/

The two-stage process of structural causal modelling (SCM) was expressed as a product of the corresponding probability density functions:

P(G,θ|X)=P(G|X) P(θ|G,X) /2/

With inferred causal structure G and parameters, θ model posterior distribution was expressed by the basic Bayesian relationship:

/3/

In case of a model with continuous random variables (Gaussian), it is explicitly expressed in a functional form as:

/4/

Extensive sampling by Monte Carlo Markov chain (MCMC) algorithm was applied for statistical inferences from the model multivariable posterior probability distribution π(θ|X).

Commonly, the basic modelling presumes that all considered causal effects are directional, i.e. recurrent causal effects are not considered. It results in model graphs without close loops, which are consequently named directed acyclic graphs (DAG). Markov property of DAG greatly simplifies modelling of complex multivariable stochastic systems (25). Complete causal directed acyclic graph G is a set of vertices V (corresponding to the random model variables x_i) connected with a set E of oriented edges (arrows), G={V,E}. It is a Bayesian network (BN) with Markov property enabling decomposition of a joint probability density function P as a product of individual node (variables x_k) probabilities p conditioned on their parent variables Pa. The parent variables are those variables x_i (vertices) pointing directly to x_k via a single edge.

/5/

Causal dependencies, direct and total, depend on a set of network paths between the cause-and-effect variables. To infer causality, confounding of interfering variables must be blocked by directed d-separation, which implies conditional independence in the probability distribution (17). Variables which block interfering interactions define adjustment sets that enable deconfounded (linear and/or nonlinear) estimation of average causal effect (ACE). For models with continuous variables, ACE is evaluated as the derivative of expected value of output variable (effect) Y with respect to the change of input (cause) X at constant covariates, called intervention of cause by do(x) (6). In case of a linear SCM, ACE is a value corresponding to average change of effect Y due to the intervention by changing cause X for a unit value. For nonlinear SCM, ACE is a function of the cause X defined by the partial derivative:

/6/

RESULTS AND DISCUSSION

Wheat baking quality

The wheat data were regularized by the application of a flexible net of least absolute shrinkage and selection operator (LASSO) as a combination of L1 and L2 norm penalty functions (26):

/7/

The initial space of 45 wheat chemical, physical and biochemical variables was reduced to the space of 10 features obtained by optimisation algorithm provided with glmnet software (27-29). The selected optimal features were: protein, wet gluten, falling number, water absorption, dough resistance, resistance/extensibility ratio, total glutenin, total high-molecular-mass glutenin, alpha-gliadin and degree of softening of the dough.

The model was the assembly of 500 trees, each obtained by random split of 3 variables. Validation of the prediction model showed that with the untrained out-of-bag samples it accounted for 75 % of variance (30).Fig. 2 shows the performance of the model predictions. Causal relations between the key variables were evaluated as a directed acyclic graph (DAG). The DAG was shown with the key variables as the nodes, associations between the variables as the edges and the causal dependences as arrows. In the process of causal structural learning, the graph edges and orientations of arrows were considered as random variables with statistical properties estimated by Monte Carlo Markov Chain (MCMC) sampling from Bayesian posterior distribution, provided as BNDAG software support (31,32). The result was structural causal model (SCM) shown as a graph inFig. 3. The causal strengths, with positive and negative effects, were given as the path coefficients, which were calculated from corresponding d-separated (directionally separated subgraph) adjustment sets by ordinary least squares (OLS) regression with normalized data (17,33).

Fig. 2 Prediction of the wheat baking quality as volume of product with 10 key features by the random forest model

Fig. 3 Causal Bayesian network model of the wheat key features and bread baking quality as volume. The path coefficients are the direct causal strengths evaluated with the standardized variables. P=w(protein)/%, WG=w(wet gluten)/%, FN=falling number/s, V=V(bread loaf)/cm3, WA=water absorption/%, R=dough resistance/min, R/Ext=resistance/extensibility ratio, TGT=w(glutenin)total/%, THMM=total high molecular mass/%, a=w(α-gliadin)/% and DS=high degree of softening

The causal inferences of the SCM were compared (validated) using unstructured causal model with double machine learning (DML) algorithm for estimation of the average causal effect (ACE) (34). The effects were estimated as the ratio of covariance and variance of the residuals of volume V and k-th variable x_k predicted by the corresponding random forest (RF) mode l:

/8/

The ACE estimates with standardized data are shown as a bar chart (Fig. 4). The SCM and the ACE estimates confirm the dominant positive effects on bread baking quality (V) of protein (P) and total high-molecular-mass (THMM) content.

Fig. 4 Direct average causal effects (ACE) of the wheat key features on bread baking quality. The ACE values were evaluated with the standardized variables. DS=high degree of softening, a=w(α-gliadin)/%, WG=w(wet gluten)/%, R.1=dough resistance/min, WA=water absorption/%, R/Ext=resistance/extensibility ratio, FN=falling number/s, THMM=total high molecular mass/%, P=w(protein)/% and TGT=w(glutenin)total/%

The main technological benefit is the application of the SCM to predict unconfounded effects of intervention action, i.e. doing effects (17). The do(x) operator was applied to redesign original DAG and accordingly modify the joint probability function by replacement of random variable X_k with preselected deterministic value x_k and d-separation of confounding variables which simultaneously interfere with the intervention (treatment) and effect (outcome). To account for nonlinearity and probability in uncertainty of do(x) effects, Bayesian neural networks (BNN) were developed (35). The intervention effects of the key causal variables P and THMM on bread baking quality V are shown inFig. 5. The distributions of the effect V indicate considerable uncertainty due to the covariates from the adjustment sets and modest saturation type nonlinearities.

Fig. 5 Distributions of a bread loaf volume (do(x)) caused by the intervention of do(x) on the content of: a) total high-molecular-mass (THMM) gliadins and b) protein

Dairy product quality

Causal analysis of the dairy product quality data was based on the SCM. Causal structure network learns by hill-climbing (HC) algorithm of greedy search of DAG space of association structures and causal directions to optimize Bayesian information criterion (BIC) (36). A relatively simple DAG network shown inFig. 6 was obtained. Temperature and fat content were identified as the exogeneous variables, which with product quality and taste are common effects as colliders. The endogenous variables were product pH, odour, turbidity and colour. The product taste and quality grade had common causal ancestors with maximum negative correlation between the grade and temperature of R=-0.45 and maximum positive correlation between the taste and fat content of R=0.32. Predictive power of the random forest model with 500 trees and 2 randomly selected variables at each had very high yield with the average out-of-bag classification error of <1 % (30). The maximum causal effect on the quality as negative ACE on temperature was -0.04 quality grade/°C in the temperature range 25–60 °C. The ACE of fat content on quality grade was 0.4. Functional dependences of ACE were obtained using the adjusted d-separated variables of the Bayesian neural network shown as partial dependent plots inFig. 7. The ACE of temperature was highly nonlinear with the saturation low point at about 60 °C, while the ACE of fat content was positive and linear in the full range.

Fig. 6 Directed acyclic graph (DAG) of causal effects of milk composition and process parameters on consumer assessment of dairy quality. Temp.=temperature, Turb.=turbidity

Fig. 7 Probability distribution of quality(do(x)) of consumer assessment of dairy quality caused by a change in: a) pretreatment temperature (°C) and b) relative fat content

Wine quality analysis

For the wine quality detailed description of SCM and causal analysis is given by Kurtanjek (24). Here the causal effects were determined by SCM validated by unstructured DML causality model (34). The model given in Eq. 8 was applied. The random forest modelling was applied to standardized data sets separately for red and white wines. The models with relative average prediction errors of 5.13 and 4.17 % were obtained for red and white wines respectively. The comparative ACE of the red and white wines are jointly presented inFig. 8. Alcohol content, predicted by the DML and SCM, had the highest positive ACE on quality of red and white wines. The content of sulphates and free sulphur dioxide had the second most important positive ACE on both red and white wines, while volatile acidity had the highest negative ACE. Although SCM and DML are based on different assumptions, the corresponding ACE estimates were qualitatively and numerically almost in agreement.

Fig. 8 The average causal effect (ACE) of wine quality caused by the change of the standardized values of physical and chemical parameters

CONCLUSIONS

This manuscript provides methodologies of causal AI modelling applied to complex problem of integration of objective (instrumental) and subjective (human) food quality data. The obtained causal network model helps food engineers with intervention decisions for the existing and innovation of new technologies. The methodologies are illustrated by the models of bread baking quality, fermented dairy products and wine.

Machine learning models of neural networks and random forest of decision trees were applied. The key research objective is discovery of the causal relations between the objective physicochemical data and consumer perception of quality. To find causal relationships between complex data of wheat biochemical and physical properties and bread baking quality, Bayesian statistical model with Monte Carlo Markov chain (MCMC) sampling of the posterior distribution was applied. Structural causal learning and analysis of dairy products was achieved by hill-climbing optimization of the Bayesian information criterion (BIC). Besides the structural causal models, the unstructured algorithm of double machine learning (DML) models with the random forest decision trees were applied to obtain the vine quality data.

The main technological application of the presented causal artificial models is to evaluate the effects of interventions (’do’, do(x) operator) as improvements of production process parameters and compositions of food ingredients. The causal models help find process control patterns and support technological decisions outside the available regression data. Here, for each presented model, average causal effects (ACE) were evaluated based on d-separation criteria and selection of the corresponding unconfounding adjustment sets. For the models to compare wine quality, the structural models based on ACE are in agreement with the estimates by the unstructured DML algorithm. The importance of nonlinear causal effects is modelled by Bayesian neural networks with d-separated minimal adjustment sets and shown as partial dependency plots.

Notes

[1] Financial disclosure FUNDING

This research did not receive any financial support.

[2] Conflicts of interest CONFLICT OF INTEREST

The author declares that there is no conflict of interest.

REFERENCES

Food quality. Knowledge Centre for Food Fraud and Quality. Brussels, Belgium: European Commission; 2021. Available from:https://knowledge4policy.ec.europa.eu/food-fraud-quality/topic/food-quality_en.

Beurteaux D. Next-generation sensory science. Food Technol. 2023;77(1). Available fromhttps://www.ift.org/news-and-publications/food-technology-magazine/issues/2023/february/features/4-next-generation-sensory-science

Aguilera T, Lozano J, Paredes JA, Alvarez FJ, Suarez JI. Electronic nose based on independent component analysis combined with partial least squares and artificial neural networks for wine prediction. Sensors (Basel). 2012;12(6):8055–72. https://doi.org/10.3390/s120608055 PubMed: http://www.ncbi.nlm.nih.gov/pubmed/22969387

Rodríguez-Méndez ML, De Saja JA, González-Antón R, García-Hernández C, Medina-Plaza C, García-Cabezón C, et al. Electronic noses and tongues in wine industry. Front Bioeng Biotechnol. 2016;4:81. https://doi.org/10.3389/fbioe.2016.00081 PubMed: http://www.ncbi.nlm.nih.gov/pubmed/27826547

Tan J, Xu J. Applications of electronic nose (e-nose) and electronic tongue (e-tongue) in food quality-related properties determination: A review. Artif Intell Agric. 2020;4:104–15. https://doi.org/10.1016/j.aiia.2020.06.003

Ma R, Shen H, Cheng H, Zhang G, Zheng J. Combining e-nose and e-tongue for improved recognition of instant starch noodles seasonings. Front Nutr. 2023;9:1074958. https://doi.org/10.3389/fnut.2022.1074958 PubMed: http://www.ncbi.nlm.nih.gov/pubmed/36698480

Han J, Li T, He Y, Gao Q. Using machine learning approaches for food quality detection. Math Probl Eng. 2022;2022:6852022. https://doi.org/10.1155/2022/6852022

Zhu L, Spachos P, Pensini E, Plataniotis KN. Deep learning and machine vision for food processing: A survey. Curr Res Food Sci. 2021;4:233–49. https://doi.org/10.1016/j.crfs.2021.03.009 PubMed: http://www.ncbi.nlm.nih.gov/pubmed/33937871

Boz Z. Machine learning powers better predictive modeling. Food Technol. 2022;76(5)

Peris M, Escuder-Gilabert L. Electronic noses and tongues to assess food authenticity and adulteration. Trends Food Sci Technol. 2016;58:40–54. https://doi.org/10.1016/j.tifs.2016.10.014

Liu Z, Wang S, Zhang Y, Feng Y, Liu J, Zhu H. Artificial intelligence in food safety: A decade review and bibliometric analysis. Foods. 2023;12(6):1242. https://doi.org/10.3390/foods12061242 PubMed: http://www.ncbi.nlm.nih.gov/pubmed/36981168

Starling S. Industry embraces AI's limitless potential. South Perth, Australia: Stardust Komms; 2023. Available from:https://stardustkomms.com/2023/01/14/industry-embraces-ais-limitless-potential/?utm_source=rss&utm_medium=rss&utm_campaign=industry-embraces-ais-limitless-potential

Mavani NR, Ali JM, Othman S, Hussain MA, Hashim H. Application of artificial intelligence in food industry – A guideline. Food Eng Rev. 2022;14:134–75. https://doi.org/10.1007/s12393-021-09290-z

Sahni V, Srivastava S, Khan R. Modelling techniques to improve the quality of food using artificial intelligence. J Food Qual. 2021;2021:2140010. https://doi.org/10.1155/2021/2140010

Irfan D, Tang X, Narayan V, Mall PK, Srivastava S, Saravanan VJ. Prediction of quality food sale in mart using the AI-based TOR method. J Food Qual. 2022;2022:Article ID 6877520. https://doi.org/10.1155/2022/6877520

Grunert KG. Food quality and safety: Consumer perception and demand. ERAE. 2005;32(3):369–91. https://doi.org/10.1093/eurrag/jbi011

Pearl J, Mackenzie D. The book of why: The new science of cause and effect. Harlow, UK: Penguin Books; 2019.

Kurtanjek Ž, Horvat D, Magdić D, Drezner G. Factor analysis and modelling for rapid quality assessment of Croatian wheat cultivars with different gluten characteristics. Food Technol Biotechnol. 2008;46(3):270–7.

Horvat D, Kurtanjek Ž, Drezner G, Šimić G, Magdić D. Effect of HMM glutenin subunits on wheat quality attributes. Food Technol Biotechnol. 2009;47(3):253–9.

Celik A. Using machine learning algorithms to detect milk quality. EJFST. 2022;6(2):76–87.

Rajendran S. Milk quality prediction. San Francisco, CA; USA: Kaggle Inc.; 2022. Available from:https://www.kaggle.com/datasets/cpluzshrijayan/milkquality

Cortez P, Cerdeira A, Almeida F, Matos T, Reis J. Wine quality. Irvine, CA, USA: UCI Machine Learning Repository; 2009.

Cortez P, Cerdeira A, Almeida F, Matos T, Reis J. Modelling wine preferences by data mining from physicochemical properties. Decis Support Syst. 2009;47(4):547–53. https://doi.org/10.1016/j.dss.2009.05.016

Kurtanjek Ž. Wine quality analysis by structural causal model (SCM). Croat J Food Sci Technol. 2023;15(2):173–84. https://doi.org/10.17508/CJFST.2023.15.2.05

Glymour C, Zhang K, Spirtes P. Review of causal discovery methods based on graphical models. Front Genet. 2019;10:524. https://doi.org/10.3389/fgene.2019.00524 PubMed: http://www.ncbi.nlm.nih.gov/pubmed/31214249

Efron B, Hastie T. Computer age statistical inference – Algorithms, evidence, and data science. New York, NY, USA: Cambridge University Press; 2016.

Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Softw. 2010;33(1):1–22. https://doi.org/10.18637/jss.v033.i01 PubMed: http://www.ncbi.nlm.nih.gov/pubmed/20808728

R Core Team. R: A language and environment for statistical computing. Vienna, Austria: Foundation for Statistical Computing; 2023. Available from:https://www.R-project.org.

Sharma A, Kiciman E. DoWhy: A Python package for causal inference; 2019. Available from:https://github.com/microsoft/dowhy.

Liaw A, Wiener M. R package “randomForest”. Available from:https://cran.rproject.org/web/packages/randomForest/index.html.

Castelletti F, Mascaro A. Structural learning and estimation of joint causal effects among network-dependent variables. Stat Methods Appl. 2021;30:1289–314. https://doi.org/10.1007/s10260-021-00579-1

Castelletti F, Mascaro A. BCDAG: Bayesian structure and causal learning of Gaussian directed graphs. R package v. 1.0.0, 2022. Available from:https://CRAN.R-project.org/package=BCDAG.

Textor J, van der Zander B, Gilthorpe MK, Liskiewicz M, Ellison GTH. Robust causal inference using directed acyclic graphs: The R package ‘dagitty’. Int J Epidemiol. 2016;45(6):1887–94. https://doi.org/10.1093/ije/dyw341 PubMed: http://www.ncbi.nlm.nih.gov/pubmed/28089956

Bach P, Kurz MS, Chernozhukov V, Spindler M, Klaassen S, Double ML. An object-oriented implementation of double machine learning in R. J Stat Softw. 2024;108(3):1–56. https://doi.org/10.18637/jss.v108.i03

Scott SL. BoomSpikeSlab: MCMC for spike and slab regression. R package v. 1.2.5; 2022. Available from:https://CRAN.R-project.org/package=BoomSpikeSlab.

Scutari M. Bayesian network constraint-based structure learning algorithms: Parallel and implementations in the bnlearn R package. J Stat Softw. 2017;77(2):1–20. https://doi.org/10.18637/jss.v077.i02

This display is generated from NISO JATS XML with jats-html.xsl. The XSLT engine is libxslt.

Prijava i registracija