A hybrid algorithmic approach to feature importance analysis in agro-industrial efficiency assessment using SHAP, gradient boosting, and PCA
Abstract
The increasing need for efficient resource management and sustainable production in the agro-industrial sector necessitates advanced analytical approaches capable of accurately identifying key influencing factors. This study proposes a hybrid algorithmic framework for feature importance analysis in agro-industrial efficiency assessment by integrating Shapley Additive Explanations (SHAP), Gradient Boosting, and Principal Component Analysis (PCA). The proposed methodology combines linear and nonlinear feature evaluation techniques to enhance interpretability and predictive performance. The approach was tested on data collected from agro-industrial enterprises in the North Kazakhstan region, covering production, climatic, and economic indicators from 2020 to 2022. The results revealed that crop area, yield per hectare, and climatic factors are the most significant contributors to key performance indicators, including yield increase, seasonal profit, and risk reduction. The hybrid analysis lowered prediction uncertainty by 28% and increased model accuracy by 15 to 20% compared to single-method approaches. Using SHAP made the model clearer and helped identify key features, which aided decision-making in agro-industrial management. The proposed framework has high potential for implementation in precision agriculture and strategic management and provides an effective tool for maximizing agricultural efficiency under varying environmental conditions.
Authors

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.