Borradores de Economía - Enhancing inflation nowcasting with online search data: a random forest application for Colombia
The series Working Papers on Economics is published by the Office for Economic Studies at the Banco de la República (Central Bank of Colombia). It contributes to the dissemination and promotion of the work by researchers from the institution. This series is indexed at Research Papers in Economics (RePEc).
On multiple occasions, these works have been the result of collaborative work with individuals from other national or international institutions. The works published are provisional, and their authors are fully responsible for the opinions expressed in them, as well as for possible mistakes. The opinions expressed herein are those of the authors and do not necessarily reflect the views of Banco de la República or its Board of Directors.
The results highlight the usefulness of combining machine learning techniques with alternative sources of information to generate timely forecasts that are comparable to those of the market.
This paper evaluates the predictive capacity of a machine learning model based on Random Forests (RF), combined with Google Trends (GT) data, for nowcasting monthly inflation in Colombia. The proposed RF-GT model is trained using historical inflation data, macroeconomic indicators, and internet search activity. After optimizing the model’s hyperparameters through time series cross-validation, we assess its out-of-sample performance over the period 2023–2024. The results are benchmarked against traditional approaches, including SARIMA, Ridge, and Lasso regressions, as well as professional forecasts from the Banco de la República’s monthly survey of financial analysts (MES). In terms of forecast accuracy, the RF-GT model consistently outperforms the statistical models and performs comparably to the analysts’ median forecast, while offering the additional advantage of producing predictions approximately one and a half weeks earlier. These findings highlight the practical value of integrating alternative data sources and machine learning techniques into the inflation monitoring toolkit of emerging economies.