Research on the effectiveness of ensemble machine learning algorithms in financial forecasting tasks

Authors

DOI:

https://doi.org/10.34185/1562-9945-3-164-2026-19

Keywords:

financial forecasting, machine learning, ensemble algorithms, Random Forest, Gradient Boosting, XGBoost, intelligent systems, decision support, financial indicators

Abstract

The relevance of the study is driven by the need to improve the accuracy of forecasting enterprises’ financial indicators under conditions of economic instability, growing data volumes, and increasing complexity of their structure. Traditional statistical methods often prove insufficiently effective due to their limited ability to capture nonlinear relationships and the multifactorial nature of financial processes. In this context, machine learning methods—particularly ensemble algorithms - are becoming increasingly important, as they provide enhanced accuracy and robustness of forecasts.

The research problem lies in the absence of a unified approach to selecting and evaluating the effectiveness of ensemble algorithms in forecasting financial indicators of enterprises, as well as insufficient consideration of the specific characteristics of financial data, including their temporal structure, presence of lagged dependencies, and the influence of external factors.

The aim of the study is to examine the effectiveness of ensemble machine learning algorithms and to identify the most appropriate models for forecasting enterprise financial indicators. The research employs machine learning methods, including Random Forest, Gradient Boosting, AdaBoost, Extra Trees, and XGBoost, as well as a baseline linear regression model for comparative analysis. The experiment is conducted using quarterly financial data and evaluated with MAE, RMSE, MAPE, and R² metrics. The methodology includes data preprocessing, formation of lagged features, and splitting the dataset into training and test sets without violating its temporal structure.

The results show that ensemble algorithms provide higher forecasting accuracy compared with baseline models. The best performance was demonstrated by the XGBoost model, which reduced forecast errors by 18–25% across key metrics. It was established that the use of lagged features and a comprehensive approach to factor formation significantly improves model quality.

The key conclusions indicate that ensemble algorithms are an effective tool for financial forecasting and are suitable for implementation in decision support systems, analytical platforms, and FinTech solutions. The obtained results can be used to enhance the validity of managerial decisions and improve enterprises’ adaptability to dynamic economic environments.

References

Breiman L. Random Forests // Machine Learning. 2001. Vol. 45, No. 1. P. 5–32. DOI: https://doi.org/10.1023/A:1010933404324

Friedman J. H. Greedy Function Approximation: A Gradient Boosting Machine // Annals of Statistics. 2001. Vol. 29, No. 5. P. 1189–1232. DOI: https://doi.org/10.1214/aos/1013203451

Freund Y., Schapire R. E. A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting // Journal of Computer and System Sciences. 1997. Vol. 55, No. 1. P. 119–139. DOI: https://doi.org/10.1006/jcss.1997.1504

Chen T., Guestrin C. XGBoost: A Scalable Tree Boosting System // Proceedings of the 22nd ACM SIGKDD International Conference. 2016. P. 785–794. DOI: https://doi.org/10.1145/2939672.2939785

Geurts P., Ernst D., Wehenkel L. Extremely Randomized Trees // Machine Learning. 2006. Vol. 63. P. 3–42. DOI: https://doi.org/10.1007/s10994-006-6226-1

Ho T. K. The Random Subspace Method for Constructing Decision Forests // IEEE Transactions on Pattern Analysis and Machine Intelligence. 1998. Vol. 20, No. 8. P. 832–844. DOI: https://doi.org/10.1109/34.709601

Scornet E., Biau G., Vert J.-P. Consistency of Random Forests // Annals of Statistics. 2015. Vol. 43, No. 4. P. 1716–1741. DOI: https://doi.org/10.1214/15-AOS1321

Hastie T., Tibshirani R., Friedman J. The Elements of Statistical Learning. New York : Springer, 2009. DOI: https://doi.org/10.1007/978-0-387-84858-7

James G., Witten D., Hastie T., Tibshirani R. An Introduction to Statistical Learning. New York : Springer, 2021. DOI: https://doi.org/10.1007/978-1-0716-1418-1

Géron A. Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow. 3rd ed. O’Reilly, 2022.

Molnar C. Interpretable Machine Learning. 2022. URL: https://christophm.github.io/interpretable-ml-book/

Tsay R. S. Analysis of Financial Time Series. 3rd ed. Wiley, 2010. DOI: https://doi.org/10.1002/9780470644560

Provost F., Fawcett T. Data Science for Business. O’Reilly Media, 2013.

Mnikh Є.V. Yekonomіchnii analіz dіyalnostі pіdpriєmstva: pіdruchnik. Kiїv: Kiїvs-kii natsіonalnii torgovelno-ekonomіchnii unіversitet, 2011. 513 s..

Vіtlіnskii V. V., Velikoіvanenko G. І. Rizikologіya v yekonomіtsі ta pіdpriєmnitstvі. Kiїv: KNEU, 2004.

Kuzmіn O. Є., Melnik O. G. Teoretichnі zasadi upravlіnnya fіnansovimi rezulta-tami pіdpriєmstva // Yekonomіka Ukraїni. 2019. № 3. S. 67–78.

Sitnik N.S., Yasіnovska І.F. Fіnansovii analіz : navch. posіb. Lvіv : LNU іmenі Іvana Franka. 2025. – 300 s. URL: https://lnk.ua/rMyGyZmES.

Published

2026-04-30