Main content area

Optimizing genetic algorithm–partial least squares model of soluble solids content in Fukumoto navel orange based on visible–near‐infrared transmittance spectroscopy using discrete wavelet transform

Song, Jie, Li, Guanglin, Yang, Xiaodong
Journal of the science of food and agriculture 2019 v.99 no.11 pp. 4898-4903
Citrus sinensis, algorithms, least squares, models, prediction, spectral analysis, spectroscopy, total soluble solids, transmittance, wavelet
BACKGROUND: The thick rind of Fukumoto navel orange is a great barrier to light penetration, which makes it difficult to evaluate the internal quality of Fukumoto navel orange accurately by visible–near‐infrared (Vis‐NIR) transmittance spectroscopy. The information carried by the transmission spectrum is limited. Thus, the application of genetic algorithm (GA) for variable selection may not reach the expected results, and selected variables may contain redundancy. In this paper, we present the use of discrete wavelet transforms for optimizing a GA–partial least squares (PLS) model based on Vis‐NIR transmission spectra of Fukumoto navel orange. Haar, Db, Sym, Coif and Bior wavelets were used to compress the spectral data selected by GA. Then a PLS model was established based on the variables compressed by each wavelet function. RESULTS: The use of Db4, Sym4, Coif2 and Bior3.5 succeeded in further simplification of the GA‐PLS model by reducing the number of variables by 40–44% without decreasing the prediction accuracy. The application of Bior3.5 not only could reduce the number of variables in the GA‐PLS model by 40%, but also increase the value of correlation coefficient of prediction by 1% and decrease the value of root mean square error of prediction by 3%. CONCLUSIONS: The results indicated that the combination of GA and discrete wavelet transforms for variable selection in the internal quality assessment of Fukumoto navel orange by Vis‐NIR transmittance spectroscopy was feasible. © 2019 Society of Chemical Industry