Bayesian Additive Regression Trees (BART) in Food Authenticity: A Classification Approach to Food Fraud Detection
By: Mengxiang Zhu, Riccardo Rastelli
Potential Business Impact:
Finds fake olive oil using light's colors.
Feature engineering plays a critical role in handling hyperspectral data and is essential for identifying key wavelengths in food fraud detection. This study employs Bayesian Additive Regression Trees (BART), a flexible machine learning approach, to discriminate and classify samples of olive oil based on their level of purity. Leveraging its built-in variable selection mechanism, we employ BART to effectively identify the most representative spectral features and to capture the complex interactions among variables. We use network representation to illustrate our findings, highlighting the competitiveness of our proposed methodology. Results demonstrate that when principal component analysis is used for dimensionality reduction, BART outperforms state-of-the-art models, achieving a classification accuracy of 96.8\% under default settings, which further improves to 97.2\% after hyperparameter tuning. If we leverage a variable selection procedure within BART, the model achieves perfect classification performance on this dataset, improving upon previous optimal results both in terms of accuracy and interpretability. Our results demonstrate that three key wavelengths, 1160.71 nm, 1328.57 nm, and 1389.29 nm, play a central role in discriminating the olive oil samples, thus highlighting an application of our methodology in the context of food quality. Further analysis reveals that these variables do not function independently but rather interact synergistically to achieve accurate classification, and improved detection speed.
Similar Papers
Bayesian Additive Regression Trees for functional ANOVA model
Machine Learning (Stat)
Shows how different things affect results.
Bayesian Additive Regression Trees for functional ANOVA model
Machine Learning (Stat)
Explains how different things affect results.
An Infinite BART model
Computation
Lets computers learn better from different data.