Portable optical spectroscopy and machine learning techniques for quantification of the biochemical content of raw food materials

Background Accuracy in determining food authenticity, possible contamination, content analysis, and even geographical origin is of considerable scientific and economic value. The aim of this study is to facilitate quantitative evaluation of protein content in the seeds of cereals ( Triticum turgidum var. durum and Tritordeum genotypes) and rip-ening pomegranate fruits (Wonderful cultivar). Methods Two species of wheat were evaluated in this study: durum wheat, Triticum turgidum var. durum , and Tritor-deum ( durum wheat × wild barley ) together with pomegranate fruits of the variety Wonderful. Two different portable Near InfraRed (NIR) spectrometers have been used: a prototype developed in the PhasmaFood project and the commercial SCiO ™ molecular sensor. Results Considering the specific samples, the obtained results of the classification models indicate a validation mean absolute error of 0.8% (percentage of total protein content in dry matter) for two species of wheat using Convolu-tional Neural Network following normalization procedures and 0.32% using Partial Least Square (PLS) analysis applied to Tritordeum samples; visible reflectance spectra have been used to discriminate the two cereal species. A Root Mean Square Error (RMSE) of 1.25 was obtained for the determination of total soluble solids (TSS) over a 2-year period for pomegranate fresh fruits of Wonderful cultivar, which is commonly harvested with TSS values of 16–17. Conclusions The application of portable sensors using NIR spectroscopy can be a valid and rapid alternative to the use of destructive laboratory techniques for the assessment of protein content in intact wheat seeds and ripeness grade (TSS) in intact pomegranates.


Introduction
The development of rapid fingerprinting methods for the determination of crop quality is generating great interest in agricultural industries, as these are increasingly reliable tools in food analytics.For this purpose, several nondestructive detection techniques are currently available, including optical spectroscopic techniques such as visible and near-infrared (VNIR) spectroscopy, multi-and hyperspectral imaging, fluorescence spectroscopy (Bertani et al. 2020) and imaging (Bertani et al. 2024), visible imaging, and colorimetry (Žilić et al. 2011;Ferrara et al. 2022aFerrara et al. , 2022b;;Li et al. 2018).Near-infrared region (NIR) applications were demonstrated to be valid tools for the determination of vitamins, water, proteins and carbohydrates, molecules that contain a wide spectrum of NIR-active chemical groups (Osborne 2006).Recent studies have demonstrated the ability of NIR to estimate the quality of fruit in the field (Donis-González et al. 2020a) and post-harvest (McGlone et al. 2017).More generally, the development of portable sensors (Müller-Maatsch et al. 2021) has opened up the possibility of taking lab analysis directly in the field or to production lines to monitor food matrices in real time and/or optimize production processes.The challenge, in this framework, is to achieve high analytical accuracy in classification/ regression tasks starting with data that typically have lower spectral resolution and lower signal to noise ratios than data generated by expensive benchtop instrumentation.Moreover, seeds and single fruits are inherently less homogeneous than flour and juices thus increasing the variance in the measurements.
Analytical information is extracted from the sample spectra by application of advanced chemometrics models (Zhong and Wang 2019;Zhang et al. 2019a).In this field, a great deal of progress has been possible thanks to the deployment of machine learning (including deep learning, like in Nallan et al., 2022;Zhou et al. 2022) methods in the analysis workflow and the increasing availability of data to build reliable databases.In this study, two different food matrices and two different applications have been investigated to assess the power of the combination of portable spectroscopy and chemometrics: cereal seeds to evaluate total protein content, and pomegranate fruits to evaluate ripeness grade.
Among staple crops, cereals such as rice, wheat and maize play a key role in human nutrition and amongst cereals, durum wheat is the leading commercial crop as its consumption is the highest within Mediterranean countries.The current conflict in Ukraine is also putting pressure on the production of wheat since flour is consumed in many countries from Africa to South America.
In cereals, and more specifically in wheat, the nutritional quality depends on three main component classes: proteins, lipids, carbohydrates.These features, along with the color, determine the commercial value at the beginning of wheat distribution/processing.In fact, protein content, in terms of both quantity and quality (in terms of the gliadin and glutenin content), influences the probability of being processed and the characteristics of the end-product food.As an example, high protein content leads to increased water absorbance and nutritional value, increasing also the shelf-life of the products.Grain storage proteins include gliadin, glutenin, albumin, and globulin, for which accurate, economical, and rapid assessment could be beneficial for researchers and the industry.
Infrared measurements performed in the spectral region around 200 cm −1 are commonly used for determining the protein content of wheat flour and can be considered a standard industrial method.On the other hand, it has been shown that quantification of protein content using NIR instruments achieves an error in predicting protein content of 0.05% protein on dry matter (d.m.), comparable to benchtop IR instruments (Ye et al. 2018).This result was obtained by developing a Partial Least Square (PLS) model and data from benchtop NIR.
Recently, protein evaluation in intact seeds (barley) using NIR hyperspectral data (Caporaso et al. 2018) and a Convolutional Neural Network (CNN) has reached an error of 0.08% d.m (Singh et al. 2023).
Several studies based on near-infrared spectroscopy investigated the ripening of fruits in recent years.NIR spectroscopy has been used to determine Total Soluble Solids (TSS) concentration (TSS in %) in apples (Nturambirwe et al. 2019;Pourdarbani et al. 2020;Y. Zhang et al. 2019b) mandarins (McGlone et al. 2003), peaches (Kawano et al. 1992), mangoes (Saranwong et al. 2001), table grapes (Donis-González et al. 2020b), and wine grapes (Benelli et al. 2020).To the best of our knowledge, the pomegranate intact fruit has never been tested before with a portable miniaturized NIR device for the determination of its ripeness, and pomegranate fruits have received less attention compared to more common fruits such as apple or citrus (Opara et al. 2022).
Pomegranate (Punica granatum L.) is a member of the Punicaceae family, and it was one of the earliest fruit crops to be domesticated.It is native to the arid and semi-arid regions of Iran (old Persia) and northern India, and is nowadays cultivated throughout the world from the Mediterranean basin to subtropical and tropical areas of Asia and America.
The pomegranate has a natural appeal related both to its long history of domestication, cultivation, and symbolism (which goes back to the origin of agriculture) and to the health effects attributed to the flower, fruit and other parts of the tree (Sarkhosh et al. 2020).In recent years, several new plantings have been established mainly with the cultivar Wonderful for juice production.To harvest the fruit at the correct ripening time, a maturation index widely used is the total soluble content also known as Total Soluble Solids (TSS) expressed as % or °Brix.The decision on harvesting time is critical for obtaining the best fruit quality either for fresh consumption or for juice production or other processing activities.
A method such as the one proposed in this paper, which combines NIR spectroscopy and machine learning approaches, could therefore be of considerable interest either for decision making in the agro-industrial sector or to limit the food waste along the farm-to-fork chain.

Cereal seeds samples
Two species of wheat were evaluated in this study: namely durum wheat, Triticum turgidum var.durum (hereinafter referred to as Durum Wheat), and Tritordeum (durum wheat × wild barley).They differ in protein content covering a range from 8.1 to 19.88% d.m. calculated as: [Tot.GPC%/(100 − H%)] × 100; the samples derived from different varieties (see Additional file 1: Table S1) for both species and in total twelve wheat samples (grains) have been assayed; each sample is in technical triplicates.Grain protein content of samples was determined for a 3 g sample of wholemeal flour by NIRspectroscopy using a SpectraAnalyzer device (Basic model, Zeuton) spectral range 1400-2500 nm since this is currently used in the industry as reference method (Du et al. 2022).

Pomegranate samples
The pomegranate samples were collected in an orchard located in the Puglia region, Southern Italy, in the countryside of Castellaneta marina (Taranto province).The pomegranate cultivar tested in this trial was the Wonderful, trained to ypsilon trellising and the spacing was 3.3 × 5.7 m.
The trial was conducted over two seasons (2020 and 2021) and the harvested fruits were used to evaluate TSS (Total Soluble Solids), in the two years, whereas titratable acidity (TA), pH, polyphenols (as gallic acid) and vitamin C were analyzed for the 2021 samples only.In 2020, a total of 280 fruits were collected and analyzed from 11 July to 15 October on 7 sampling dates; 210 fruits were used for model calibration and 70 fruits for the external model validation.In 2021, 320 fruits were collected and analyzed from July 21 to December 21 (last sampling in post-harvest) on 8 sampling dates; 240 fruits were used for model calibration and 80 fruits for the external model validation.In both years, the fruits were collected at various development and ripening stages and were chosen to be representative of the conditions observed (color, size, phenological stage) in the orchard at each sampling time.Spectral data for each fruit were acquired under laboratory conditions using the SCiO ™ mini (v1.2), followed by destructive analysis for the determination of TSS by using a digital refractometer (model HI 96801, Hanna Instruments, Woonsocket, RI, USA); TA and pH were measured for each fruit in 2021 by using an automated pH-meter (PH-Burette 24, Crison Instruments, Barcelona, Spain).For each sample acquisition, two (in 2020) or three (in 2021) scans per fruit were performed at different locations around the fruit equator, approximately 180° and 120° apart respectively, at a distance < 0.5 cm from the device as indicated by the manufacturer.The juice for the destructive analysis was collected from each fruit.

Portable sensors Optical multi-spectral measurements using the 'PhasmaFood' sensor
Multi-sensor spectral data were acquired using a prototype developed in the framework of the PhasmaFood project (Fig. 1).The sensor is equipped with a miniaturized commercial UV-VIS spectrometer (C12880MA, Hamamatsu, Japan), a prototype miniaturized NIR sensor with a spectral range 950-1900 nm (Pügner et al. 2016), and a miniaturized RGB-camera (MU9PC-MH, CMOS, Ximea, Münster, Germany).The UV-VIS spectrometer is used both in fluorescence-emission spectroscopy (FLUO) and a diffuse-reflectance visible-spectroscopy (VIS) modality: a long-pass optical filter with cutoff at 400 nm ensures a block of fluorescence-excitation reflection.The NIR and UV-VIS spectrometers apertures and their respective illumination sources (white LED for visible reflectance OSRAM minitopled LCW MVSG.EC, 365 nm wavelength Nichia NSSU100DT for fluorescence excitation and halogen mini lamp for NIR reflectance) are positioned inside the case, pointing at the interface of the observation window (Fig. 1, bottom right), thereby ensuring that recorded spectra come from the same sample area.The PhasmaFood platform also includes a custom-designed Android application to control the sensor by a smartphone and an online open database: spectral measurements from the different spectrometers are sent to the mobile app and then directly to the cloud database (https:// dashb oard.phasm afood.eu/).
The settings for the individual sensors in terms of illumination power and acquisition times for VIS and FLUO signals were optimized for the current samples.Each sample replicate is acquired three times, and the acquisition routine is organized in three steps: VIS, NIR and FLUO.Visible diffuse-reflectance spectral acquisition consists of 10 consecutive acquisitions followed by 10 dark acquisitions; Near infrared (NIR) reflectance spectra acquisition consists of 255 averages and dark acquisitions; fluorescence emission spectra acquisition consists of 10 consecutive acquisitions followed by 10 dark acquisitions.The whole acquisition routine takes only 2 min.
The PhasmaFood sensor was calibrated by conducting a 99% diffuse reflectance (S white ) and dark acquisition (S dark ) measurement of a white standard material (Spectralex White Diffuser WDF-030-95, Lake Photonics, Uhldingen-Muehlhofen, Germany).Spectral measurements were pre-processed as follows.
The FLUO acquired spectral data were pre-processed by subtracting the dark acquisition sample spectrum from the raw sample spectrum for FLUO measurements (S FLUOraw ) VIS and NIR measurements were pre-processed by subtracting the S dark from the raw sample spectrum and then dividing it by the 99% diffuse reflectance white standard spectrum's difference with the dark acquisition of white standard spectrum.
Cereal seeds were placed in a 60 mm-diameter plastic petri dishes completely filled (10 mm thickness).The PhasmaFood' sensor was put on top of the Petri dish with its window in direct contact with grains allowing 50 mm between the sample surface and the illumination and sensors modules.A custom sample holder was designed to maintain optical alignment and contact between the sample and the instrument and to block environmental light.

SCiO sensor
The SCiO ™ molecular sensor is a spectrometer produced by Consumer Physics Inc. (Tel-Aviv, Israel) used in horticulture to evaluate the physical and chemical properties of fruit, corn, soybean, wheat and other products.It operates in the range of 740-1070 nm, with a sampling S FLUO = S FLUOraw − S dark S VIS or NIR = S VIS or NIRraw − S dark S white − S dark spectral resolution of 1 nm.It has been extensively described in a recent paper when it was used to test the ripeness parameters of a wine-grape cultivar (Ferrara et al. 2022b).The device can be managed by the proprietary software SCiO ™ Lab running on smartphones, tablets, and personal computers with a Bluetooth connection.The device was calibrated before every session for correcting data acquisition using the white tile located inside the sensor cover.

Spectral analysis Cereal spectral analysis and performance evaluation
NIR spectra acquired by PhasmaFood sensor were used for the data analysis and sample classification.Spectral data have been processed separately according to the species (Durum Wheat, Tritordeum) or together (Whole set).Different combinations of data, pre-processing methods, and analysis have been considered and evaluated with performance indicators.NIR spectra with different pre-processing methods are shown in the Additional file 1: Fig. S2).
The NIR cereal spectral-pre-processing step has included three different approaches: no pre-processing (Raw Dataset), Smoothing, and Normalization.In the Raw dataset, spectra have been analyzed without any treatment.Smoothed spectra have been obtained applying Savitzky-Golay filter with a second-order polynomial and an 18-point window to remove the high-frequency noise from the data.Different normalization procedures have been tested on the smoothed dataset: unit-value normalization (UVN) spectra have been obtained by applying x normalized = (x -x minimum )/(x maximum -x minimum ); Area normalization (AREANORM) have been obtained by applying x normalized = x/(integral of the spectra) and Standard Normal Variate (SNV) by x normalized = (x-mean of the spectra)/(standard deviation of the spectra).All datasets were used to evaluate the effects of pretreatment methods on the final regression results.
After the pre-processing step, the dataset was split into a training set and a test set, with an 80/20 split for the regression step.Scaling was conducted on the datasets using the training-set mean and standard deviation to transform both the training set and the test set.
Partial Least Square (PLS) and Convolutional Neural Network (CNN) (Bian et al. 2018) methods were then used to build a predictive model of protein content and their performances compared.
The CNN used for this approach is fully described in the Additional file 1: Section 1.The number of PLS latent variables was optimized by plotting the predictive performance of the PLS on a reduced dataset against the number of latent variables.
For the PLS and CNN evaluation, a fivefold random shuffle test-training split was used for cross validation.A random split was chosen instead of the k-fold to reduce the risk of overfitting.Data Analysis was conducted using Python v3.9.12 and Sklearn v1.0.2 (Pedregosa et al. 2011).
After a first run of data analysis and results evaluation, it was clear that the regression methods performed better when samples of different species (Durum Wheat and Tritordeum) were considered separately.
To identify Durum Wheat spectra from Tritordeum ones, Principal Component Analysis (PCA) on VIS spectra was then conducted.The PCA and Linear Discriminant Analysis (LDA) were implemented in Python v3.9.12 with Sklearn v1.0.2, the LDA was cross-validated with a 5 split stratified k-fold cross-validation.
The permutation feature importance of the CNN model was calculated using the ELI5 python package, which calculates the permutation importance of a feature by shuffling the values of a feature inside the training set and calculating the decrease of the Mean Absolute Error against the normal MAE calculated without shuffling.

Pomegranate NIR spectral analysis and performance evaluation
Data collected by the SCiO ™ sensor were analyzed using SCiO ™ Lab online interface models, whose predictive capability have already been validated in previous work (Li et al. 2018).
The pre-processing step was empirically designed by testing different combinations of techniques and choosing the one which maximized the performances metrics of the overall model.The combinations of the pre-processing techniques were obtained, taking either the first or the second derivative of the raw signal or the logtransformed signal.Finally, an SNV step was added.The two best combinations were considered for the prediction model construction.
For PLS in this trial, the number of latent variables was chosen by means of a feature of the SCiO ™ software that automatically allows users to select the number of parameters for each model.For the validation step, more detailed information on data analysis can be found in recent papers on table and wine grapes (Ferrara et al. 2022a).

Cereal spectral analysis
The predictive performances of CNN and PLS were evaluated in terms of: Root Mean Square Error (RMSE), Mean Absolute Error (MAE), R 2 , RPD and Ratio of Performance to the Interquartile (that is the interquartile range of the observed values divided by the Root Mean Square Error of the prediction) (Bellon-Maurel et al. 2010).All results are available in the Additional file 1: Table S4 while MAE and R 2 are presented and discussed in this paragraph as representative results of model performance.
Figure 2 shows the values of R 2 and MAE obtained with CNN and PLS models after different preprocessing treatments to raw data.In this case, samples of both species are considered together, and the results evidence that the CNN model generally outperforms the PLS model.
The importance of data pre-processing is highlighted by the fact that the CNN model cannot accurately predict protein content on the raw validation data and gives a negative R 2 .Regarding the normalization approaches, the unit value normalization improves the predictive performance and produces less overfitting compared to SNV and the area normalization, which have higher errors and a greater difference between Calibration and Validation R 2 that typically indicate overfitting.
Starting from these results, to go deeper into the chemical meaning of the spectral characteristics of the samples, it was necessary to identify the spectral regions of interest (Huan et al. 2021).For this purpose, Feature Permutation Importance analysis (Fig. 3) was used.From this 'black-box' method, we can see the features (wavelengths) that are most important for the CNN model.Notably, the results of Feature Permutation Importance from the CNN model of both species highlighted the same spectral areas as the PLS weights, as shown in the Additional file 1: Fig. S1, but with decreased width of peaks.It is also only a single set of importance values compared to the multiple weights of the PLS.This approach makes easier the identification of peaks related to chemical bonds already identified in NIR literature (Workman and Weyer 2012).
The H 2 O bands of the second and third overtone at 980 nm and 1430 nm have the greatest importance in the model, while the Feature Permutation Importance peaks at 1184 nm, 1510 nm and 1702 nm can be directly attributed to protein content (Workman and Weyer 2012).A detailed interpretation of the peaks is given in the Additional file 1: Table S3), while the combination of bands around 1300 nm is still to be unequivocally identified to our knowledge.
When data analysis is applied to single-species datasets, the predictive performance of CNN and PLS is improved (Fig. 4), and the PLS outperforms the CNN (which fails to predict the protein content in the RAW DW, RAW TD, SMOOTHED TD and UVN TD datasets as indicated by the low R 2 ).
Considered separately, the models achieve an error of less than 0.5% d.m. for both species after pre-processing.This result can be considered accurate enough for practical application.
A strategy to take advantage of the better performance of the species-specific model and its wider applicability is to determine the sample species with a discriminant model before predicting the protein content with the appropriate model.In agro-industrial conditions, and at any point of the farm-to-fork value chain, this methodology can be implemented in a fast and effective way.
To separate spectra belonging to different species using a spectral-analysis method, we considered visible spectra of Durum Wheat (DW) and Tritordeum (TD), acquired during the same acquisition routine for NIR spectra.Applying a PCA to the visible dataset in the score plot of the two principal components PC2 and PC3, which together account for 10.9% of the total variance, it is possible to see a separation between the two types of wheat.This natural separation of the species was observed only in the visible dataset: the mean spectra differ in the region between 525 and 700 nm.To evaluate the possibility of species discrimination, a Linear Discriminant Analysis was trained on the visible dataset.The result is shown in the Additional file 1: Fig. S2, and the accuracy obtained allows the discriminant algorithms to be used.
The use of VIS data in combination with NIR data is an example of a data-fusion approach, where different spectroscopic methods and relative-analysis algorithms related to the same sample are used in combination (sequentially in this case) to improve the prediction accuracy of a model based on spectroscopic data.
The measurable spectral difference in the visible range between Durum Wheat and Tritordeum support the hypothesis that the two cannot be used by the same model for an accurate protein prediction: the considered region of NIR spectra doesn't give a direct measure of the protein, but a combination of overtone emissions from a variety of molecular bonds that correlate with the protein content, and the two species have different biology and genes, and consequently proportions in the classes of proteins can differ between the two cereals.It can be hypothesized that the CNN performs better on both species dataset for its ability to isolate the protein signal as it is shown in Fig. 3. Nevertheless, the different behavior of the two species shows that a chemometric analysis should be optimized and should consider every aspect of the nature of the samples and the objective of the classification.Further studies considering both species can lead to improved predictions.
The performances of the CNN and PLS prediction models don't reach the error level of the models trained on flour samples with benchtop instruments (like the one used as reference in this study) but are accurate enough (98% of similarity between the two instruments) to allow the use of the portable instruments for fast non-destructive evaluation of crop quality in the field.

Pomegranate spectral analysis
The results of pomegranate fruits can be divided into three parts, in relation to the year of data collection and the different ripening parameter considered: 1. Data collected in 2020 (TSS).2. Data collected in 2021 (TSS, TA, pH, vitamin C, gallic acid).

Data collected in 2020 and 2021 (TSS).
In 2020 (Table 1), the model that has a higher R 2 and RPD and a lower RMSE is the one in which the data have been pre-treated using the logarithm, the second derivative and the SNV, which had an RMSE of 0.98, an R 2 of 0.87 and an RPD of 2.77; the second-best model, however, was obtained using the first derivative and the SNV.
It is possible to compare these values with those obtained by (Ferrara et al. 2022b) for sensor calibration on wine grapes (Primitivo cultivar) trained to Alberello and Guyot in 2019 and 2020.
Table 1 shows an R 2 around 0.81, an indication of a good accuracy in predicting the data of TSS on the validation dataset.In this case, the best model and the best combination of data pre-processing steps is the one that involves the logarithmic transformation, followed by the second derivative and SNV.The RPD of 2.34 and 2.27, being between 2.0 and 2.5, indicates a model capable of making quantitative predictions.
However, the R 2 values for the TSS of this study do not reach the values obtained for wine grape, ≈0.90 (Ferrara et al. 2022b), but pomegranate and grape are two very different fruit species.Only the RMSE, in our case, is better (1.06 and 1.09) than that obtained for wine grape (Ferrara et al. 2022b) which averaged 1.76.
In the case of table grape (Ferrara et al. 2022a), higher values of R 2 and RPD (0.93 and 12.9, respectively) and lower values of RMSE (0.61) were obtained, indicating an excellent predictive model, better than the one obtained in this study for pomegranate.
The difference between grapevine and pomegranate in the accuracy of prediction could be attributed to the thickness of the skin, because in the case of wine-and table grape, this is considerably thinner than that of the pomegranate (which instead has a considerably thicker skin and associated inner tissues for the Wonderful cultivar).
In 2021, however, various parameters were measured in addition to the TSS, such as TA, pH, vitamin C and gallic acid (used as a measure of polyphenols) and the respective predictive models were built for each parameter with the different combinations of pre-treatment, as reported in Table 2.
In 2021, the parameter with the lowest RMSE value is pH, with an average value of 0.10; the TSS is the parameter with the R 2 closest to 1, with an average value of 0.87, and with the highest average RPD, around 2.82.
In the case of TSS, the best model was obtained using raw spectra, while the second-best model was obtained using a combination of two pre-treatment techniques such as the second derivative and SNV.
The R 2 value for the TSS (0.88, Table 2) was clearly higher than values reported in previous studies carried out on other fruit species such as feijoa, apple and  González et al. 2020a).In other studies, using NIR sensors working in different wavelength ranges (780-1700 nm and 680-1000 nm), the correlation coefficients were 0.60 and 0.68 respectively (Nicolaï et al. 2008;Travers et al. 2014).
Higher R 2 values (> 0.9), on the other hand, have been reported for kiwifruits using a spectral range from 800 to 1100 nm (McGlone and Kawano 1998) and for grapes using a spectral range from 800 to 2500 nm (Jarén et al. 2001), but with lab devices.
With regards of the RMSE, the value of 2021 (1.09 on average) was higher than values reported in other experiments, such as for Royal Gala apple (0.72) reported in (McGlone et al. 2002), for pear (0.44) reported in (Nicolaï et al. 2008) or for Cotton Candy table grape in 2019 (0.68) and in 2020 (0.59) as detailed in (Ferrara et al. 2022a).Values of RPD of 3.90 and 7.42 were obtained for Cotton Candy table grape in 2019 and 2020, respectively (Ferrara et al. 2022a), which resulted higher than the one obtained for pomegranate in 2021 (2.93).
In the case of pH, both the best models were obtained using the first or second derivative and SNV, with an R 2 equal to 0.83, an extremely low RMSE (0.1) and a fairly high RPD (2.49).The RMSE obtained was very similar to that obtained for Cotton Candy in 2019 in the work of (Ferrara et al. 2022a), while R 2 and RPD resulted much lower.
In the case of titratable acidity (TA), all the models presented an R 2 that was generally low (around 0.64), while in the case of vitamin C and gallic acid the R 2 values observed were extremely low (around 0.27 and 0.08 for vitamin C and gallic acid, respectively) suggesting that the models were not accurate enough.
However, most of the best models seen in Table 2 were obtained by pre-treating the data first with the derivative (first or second) and then with SNV, as also reported by (Donis-González et al. 2020a) for several seedless table grape cultivars.
When testing the models on the validation set, the only parameter with a high R 2 is TSS, as also seen in 2020, with an average of 0.84 between the two best predictive models; the RMSE is around 1.16 on average.The poor estimation of acidity and pH, and of vitamin C and gallic acid, could be associated with the high thickness of the skin which prevents the penetration of radiation up to the arils, thus creating considerable interference in the determination of such molecules.Moreover, the presence of polyphenols (organic acids in the mesocarp beneath the skin) can be another factor affecting the accuracy of the NIR measurements with respect to soluble solids.Therefore, the only parameter for which a good predictive model was obtained resulted from TSS.
The depth of penetration of NIR applied on fruit depends on the wavelength (Lammertyn et al. 2000), and is about 2-3 mm in the range of 900-1900 nm but can also reach 4 mm at a shorter wavelength (about 700-900 nm).Consequently, knowing the wavelength associated with our SCiO sensor, we can assume that the depth reached in our study is approximately 3-4 mm, given the wavelength of 740-1070 nm.Thus, the spectra data obtained are probably related mainly to the properties of the inner part of the fruit skin and partly of the arils, and this could have greatly influenced the prediction accuracy and the model performance for the different molecules, in particular the ones at low concentration (vitamin C, polyphenols).
Therefore, the correlation between spectra data obtained with the sensor and the data obtained by destructive analysis have given rise to predictive models with a medium-high accuracy in the case of TSS, a medium-low accuracy in the case of TA and pH and a very low accuracy in the case of vitamin C and total polyphenols (as gallic acid).The latter are not usable for practical application considering the low accuracy.
The best combinations among the pre-processing techniques always included the logarithmic transformation (Log), first or second derivative (1st Der or 2nd Der) and SNV (Standard Normal Variate).The major peaks, as can be seen from Additional file 1: Fig. S8, are present at a wavelength of 950-1000 nm, corresponding to the main functional groups such as OH, NH 2 , H 2 O, ROH and ArOH (with OH bond on the aromatic) (Cen and He 2007;Johnson et al. 2020).
These groups are related to major constituents of juices such as sugars, vitamins, polyphenols and other biologically active compounds.TSS mainly represents sugars, but also, to a less extent, other organic molecules (vitamins, amino acids, hormones, etc.) containing C-C, C-O, O-H and C-H bonds, and a NIR spectroscopy can be used for a non-destructive measurement of sugars (Johnson et al. 2020;Ma et al., 2019).
The data of TSS collected both in 2020 and in 2021 were used to create a two-year model.For the creation of these models, two scans per fruit were performed using the SCiO Lab portal, in such a way as to take into consideration both the 2020 and 2021 samples.
The model with the higher R 2 and RPD (0.87 and 2.75, respectively) and a lower RMSE (1.07) was the one in which the data were processed using the logarithmic transformation, second derivative and SNV.
These values, not as high as in other studies previously reported, could be due to the different growing conditions that occurred in 2020 and 2021, thus leading to substantial differences in the creation of effective predictive models.
A study carried out for 4-5 years, in different growing and pedoclimatic conditions, could help in the construction of more accurate predictive models, with a high R 2 and RPD and a low RMSE.Additional file 1 shows PLS components weights, List of Samples, NIR and VIS Spectra, CNN architecture, peak identification, PCA score plot for wheat samples, classification performances.

Conclusions
In this paper, potential advantages of using NIR portable devices coupled to machine-learning analysis methods for the qualitative evaluation of intact raw food products have been investigated, together with specific limitations.
In particular, the total protein content in wheat seeds has been determined for Durum Wheat and Tritordeum with a minimum validation error of 0.4% and 0.32% d.m. respectively, and TSS in intact pomegranate fruits has been determined with a RMSE of 1.25.
The application of portable sensors using NIR spectroscopy has the potential be a valid and rapid alternative to destructive laboratory techniques for the assessment of protein content in intact wheat seeds and ripeness grade in intact pomegranates by evaluating TSS only.It must be always taken into account that NIR is sensitive to the overtones of molecular bond vibrations, that they overlap and are non-specific to a single analyte, so chemometric models trained on this type of data can be case specific and may fail (or significantly reduce regression/classification performance) when applied to different families of samples, as observed in the case of the two wheat species.In these cases, the use of data-fusion approaches, and, in particular, the combination of different spectroscopic data (in our work visible reflectance data beside NIR spectra) can lead to improved results and widen the range of potential applications and samples, as the collection of information from different techniques allows more complex scenarios to be considered in terms of variable parameters.
In the case of pomegranate, not all the parameters tested were successfully predicted by the models, while the same models proved very accurate when applied to different types of fruits (e.g., grapes) in previous studies.In this case, the limitation of the approach is due to the physically-limited penetration depth of NIR radiation inside pomegranate skin: TSS signal is detectable from the outer layers of pomegranate skin, whereas other molecules at low concentration are not accurately detected.Also in this case, the set of parameters available for accurate detection must be chosen considering the specificity of the samples.
For the possibility of rapid detection, portable spectroscopic devices could be used either in the field by agronomic consultants to quickly assess the ripeness of pomegranate fruits thus reducing costs, personnel labour and use of chemicals or by technicians/farmers on arrival of fruits at the farm to make a first selection of fruits based on ripeness for successive processing.Applications would be possible also in stores by retailers for the on-site monitoring of the ripening status of the fruits, always considering that near infrared light penetrates only a few millimetres so only properties that influence the surface can be measured.
A further miniaturization of devices and/or evolution of the whole platforms in the next future will probably allow the use of this technology for wider application in food safety, authenticity and quality along all the farmto-fork value chain.The possibility to implement complex chemometric models in cloud-based automatic classification procedures will also allow non-expert users to receive data and inform decision-making in a fast and reliable way.

Fig. 1
Fig.1The Phasmafood multisensor prototype (left side, top and bottom pictures) and SCiO spectrometer (right side) with the respective samples (cereals and pomegranate)

Fig. 2 Fig. 3
Fig. 2 Predictive Performance of % d.m. in cereal samples of CNN and PLS on the dataset when the samples from two species were considered together.Performance in terms of MAE (Mean Absolute Error) and R 2 according to different preprocessing strategies (Raw: no treatment, smoothed: SG smoothing, SNV Standard Normal Variate, Areanorm: area normalization, UVN Unit Value Normalization)

Fig. 4
Fig. 4 Predictive Performance of % d.m. in cereal samples of CNN and PLS on the dataset when the two species are considered separately (DW Durum Wheat, TD Tritordeum).Performance is shown in terms of MAE (Mean Absolute Error) and R 2 according to different preprocessing strategies (Raw: no treatment, smoothed: SG smoothing, SNV Standard Normal Variate, Areanorm: area normalization, UVN Unit Value Normalization)

Table 1
PLSR accuracy on pomegranate TSS by means of the best combinations of pre-processing techniques in the season 2020 (Li et al. 2018)al.2018)or on table grapes (Donis-

Table 2
PLSR accuracy on pomegranate TSS, TA, pH, vitamin C and gallic acid by means of the best combinations of pre-processing techniques in the season 2021 and 2020-2021 (TSS only)