改进的NSGA-III-XGBoost算法在股票预测中的应用

doi:10.3778/j.issn.1002-8331.2206-0031

摘要/Abstract

摘要： 为提高股票预测的准确度和减少运行时间，提出了一种改进的非支配排序遗传算法与极致梯度提升树模型相结合（INSGA-III-XGBoost）的股票预测模型。该模型特征工程包括小波分解、扩展特征、数据清洗、归一化。模型采用两种过滤式特征选择的集成信息初始化种群优化NSGA-III算法，以最大化准确度和最小化解的解决方案大小作为优化方向，使用多染色体混合编码的方式同步进行特征选择和优化模型参数。将选择的特征子集和参数输入XGBoost训练预测并迭代优化。实验结果表明，INSGA-III-XGBoost算法与未改进的多目标特征选择算法和单目标特征选择算法相比，平均准确度最高、解方案最小、运行时间最短；与深度学习模型相比，不仅准确度更高、运行用时大幅减少，并且该模型具有可解释性。

关键词: 多目标优化, 特征工程, 特征选择, 股票预测

Abstract: To improve the accuracy of stock forecasting and reduce the running time, a stock forecasting model combining an improved non-dominated sorting genetic algorithm and extreme gradient boosting tree model（INSGA-III-XGBoost） is proposed. The model feature engineering includes wavelet decomposition, extended features, data cleaning, and normalization. The model uses the integrated information of two types of filtered feature selection to initialize the population optimization NSGA-III algorithm, maximize the accuracy and minimize the solution size of the solution as the optimization direction, and use the multi-chromosome hybrid encoding method to simultaneously perform feature selection and optimize model parameters. The selected feature subsets and parameters are input into XGBoost for training and forecasting, and iteratively optimizes according to the evaluation metrics. The experimental results show that compared with the unimproved multi-objective feature selection algorithm and single-objective feature selection algorithm, the INSGA-III-XGBoost algorithm has the highest average accuracy, the smallest solution scheme, and the shortest running time; compared with the deep learning model, it not only has higher accuracy, but the runtime is greatly reduced as well, and the model is interpretable.

Key words: multi-objective optimization, feature engineering, feature selection, stock forecasting

何泳, 李环. 改进的NSGA-III-XGBoost算法在股票预测中的应用[J]. 计算机工程与应用, 2023, 59(18): 293-300.

HE Yong, LI Huan. Application of Improved NSGA-III-XGBoost Algorithm in Stock Forecasting[J]. Computer Engineering and Applications, 2023, 59(18): 293-300.

参考文献

[1] JIANG W.Applications of deep learning in stock market prediction：recent progress[J].Expert Systems with Applications，2021，184：115537.
[2] LAM M.Neural network techniques for financial performance prediction：integrating fundamental and technical analysis[J].Decision Support Systems，2004，37（4）：567-581.
[3] BAO D，YANG Z.Intelligent stock trading system by turning point confirming and probabilistic reasoning[J].Expert Systems with Applications，2008，34（1）：620-627.
[4] SI Y W，YIN J.OBST-based segmentation approach to financial time series[J].Engineering Applications of Artificial Intelligence，2013，26（10）：2581-2596.
[5] DING G，QIN L.Study on the prediction of stock price based on the associated network model of LSTM[J].International Journal of Machine Learning and Cybernetics，2020，11（6）：1307-1317.
[6] YUN K K，YOON S W，WON D.Prediction of stock price direction using a hybrid GA-XGBoost algorithm with a three-stage feature engineering process[J].Expert Systems with Applications，2021，186：115716.
[7] CHOLLET F.Deep learning with Python[M].[S.l.]：Simon and Schuster，2021：98.
[8] MONTAVON G，SAMEK W，MüLLER K R.Methods for interpreting and understanding deep neural networks[J].Digital Signal Processing，2018，73：1-15.
[9] CHUNG H，SHIN K.Genetic algorithm-optimized long short-term memory network for stock market prediction[J].Sustainability，2018，10（10）：3765.
[10] BAEK Y，KIM H Y.ModAugNet：a new forecasting framework for stock market index value with an overfitting prevention LSTM module and a prediction LSTM module[J].Expert Systems with Applications，2018，113：457-480.
[11] LIU H，LONG Z.An improved deep learning model for predicting stock market price time series[J].Digital Signal Processing，2020，102：102741.
[12] AMPOMAH E K，QIN Z，NYAME G.Evaluation of tree-based ensemble machine learning models in predicting stock price direction of movement[J].Information，2020，11（6）：332.
[13] CHEN Y，HAO Y.A feature weighted support vector machine and K-nearest neighbor algorithm for stock market indices prediction[J].Expert Systems with Applications，2017，80：340-355.
[14] NAIK N，MOHAN B R.Stock price movements classification using machine and deep learning techniques-the case study of indian stock market[C]//International Conference on Engineering Applications of Neural Networks.Cham：Springer，2019：445-452.
[15] LI J，CHENG K，WANG S，et al.Feature selection：a data perspective[J].ACM Computing Surveys（CSUR），2017，50（6）：1-45.
[16] KUMAR B S，RAVI V，MIGLANI R.Predicting Indian stock market using the psycho-linguistic features of financial news[J].Annals of Data Science，2021，8（3）：517-558.
[17] XUE Y，XUE B，ZHANG M.Self-adaptive particle swarm optimization for large-scale feature selection in classification[J].ACM Transactions on Knowledge Discovery from Data（TKDD），2019，13（5）：1-27.
[18] Rharish101.Hyperopt[EB/OL].[2021-12-31].https：//github.com/hyperopt.
[19] DEB K，JAIN H.An evolutionary many-objective optimization algorithm using reference-point-based nondominated sorting approach，part I：solving problems with box constraints[J].IEEE Transactions on Evolutionary Computation，2013，18（4）：577-601.
[20] DEB K，PRATAP A，AGARWAL S，et al.A fast and elitist multiobjective genetic algorithm：NSGA-II[J].IEEE Transactions on Evolutionary Computation，2002，6（2）：182-197.
[21] CHEN T，GUESTRIN C.Xgboost：a scalable tree boosting system[C]//Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining，2016：785-794.
[22] NOBRE J，NEVES R F.Combining principal component analysis，discrete wavelet transform and XGBoost to trade in the financial markets[J].Expert Systems with Applications，2019，125：181-194.
[23] BAO W，YUE J，RAO Y.A deep learning framework for financial time series using stacked autoencoders and long-short term memory[J].PloS One，2017，12（7）：e0180944.
[24] JIN Z，YANG Y，LIU Y.Stock closing price prediction based on sentiment analysis and LSTM[J].Neural Computing and Applications，2020，32（13）：9713-9729.
[25] YANG C，ZHAI J，TAO G.Deep learning for price movement prediction using convolutional neural network and long short-term memory[J].Mathematical Problems in Engineering，2020（6）：1-13.
[26] LONG J，CHEN Z，HE W，et al.An integrated framework of deep learning and knowledge graph for prediction of stock price trend：an application in Chinese stock exchange market[J].Applied Soft Computing，2020，91：106205.
[27] QIU M，SONG Y.Predicting the direction of stock market index movement using an optimized artificial neural network model[J].PloS One，2016，11（5）：e0155133.
[28] CHUNG H，SHIN K.Genetic algorithm-optimized multi-channel convolutional neural network for stock market prediction[J].Neural Computing and Applications，2020，32（12）：7897-7914.
[29] SINGH R，SRIVASTAVA S.Stock prediction using deep learning[J].Multimedia Tools and Applications，2017，76（18）：18569-18584.
[30] AMPOMAH E K，QIN Z，NYAME G.Evaluation of tree-based ensemble machine learning models in predicting stock price direction of movement[J].Information，2020，11（6）：332.