|
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
| Volume 187 - Issue 92 |
| Published: March 2026 |
| Authors: Lance Angelo P. Arcega, Jeremy D. Samonte, Jomar Benedict B. Balagtas, Ian Miguel M. Bautista, Melissa M. Pantig |
10.5120/ijca2026926593
|
Lance Angelo P. Arcega, Jeremy D. Samonte, Jomar Benedict B. Balagtas, Ian Miguel M. Bautista, Melissa M. Pantig . A-Rice: A Machine Learning-Based Approach Using Random Forest and Gaussian Process Regression for Forecasting Rice Production in Pampanga. International Journal of Computer Applications. 187, 92 (March 2026), 46-52. DOI=10.5120/ijca2026926593
@article{ 10.5120/ijca2026926593,
author = { Lance Angelo P. Arcega,Jeremy D. Samonte,Jomar Benedict B. Balagtas,Ian Miguel M. Bautista,Melissa M. Pantig },
title = { A-Rice: A Machine Learning-Based Approach Using Random Forest and Gaussian Process Regression for Forecasting Rice Production in Pampanga },
journal = { International Journal of Computer Applications },
year = { 2026 },
volume = { 187 },
number = { 92 },
pages = { 46-52 },
doi = { 10.5120/ijca2026926593 },
publisher = { Foundation of Computer Science (FCS), NY, USA }
}
%0 Journal Article
%D 2026
%A Lance Angelo P. Arcega
%A Jeremy D. Samonte
%A Jomar Benedict B. Balagtas
%A Ian Miguel M. Bautista
%A Melissa M. Pantig
%T A-Rice: A Machine Learning-Based Approach Using Random Forest and Gaussian Process Regression for Forecasting Rice Production in Pampanga%T
%J International Journal of Computer Applications
%V 187
%N 92
%P 46-52
%R 10.5120/ijca2026926593
%I Foundation of Computer Science (FCS), NY, USA
Rice production forecasting is critical for food security planning in the Philippines, yet existing estimation methods in provinces like Pampanga remain reliant on historical averages and manual assessments. This study presents A-Rice, a sequential hybrid machine learning model that integrates Random Forest (RF) and Gaussian Process Regression (GPR) for municipal-level rice production prediction across 20 municipalities over a 25-year period (2000–2025). The model was developed using historical agricultural data from the Philippine Statistics Authority and the Office of the Provincial Agriculturist, combined with six environmental variables from NASA POWER. Bayesian Small Area Estimation was applied to reconstruct missing municipal-level records for 2000–2016. In the hybrid framework, RF first generates baseline predictions and identifies key predictive features, after which GPR refines these predictions by modeling residual errors and quantifying uncertainty through calibrated confidence intervals. Evaluated against standalone RF and GPR models, the hybrid RF→GPR model achieved R² values exceeding 0.99 across all data splits, with an 80% reduction in RMSE on the test set (from 5,555 to 1,130 metric tons). Feature importance analysis revealed that municipality, cropping season, and relative humidity were the most influential predictors. The model was deployed as a web-based decision-support application providing production forecasts with confidence intervals, environmental diagnostics, and historical benchmarking. Results demonstrate the viability of sequential hybrid ML approaches for precision agriculture and data-driven agricultural planning in developing regions.