Text Classification Model Based on GloVe and GRU

doi:10.3778/j.issn.1002-8331.2001-0272

Abstract

Abstract:

Text classification has a wide range of applications, and the research of its classification algorithm has been concerned. However, traditional text classification algorithms generally have some problems, such as too high dimension of text feature vectorization, not considering the semantic relationship between keywords, too many training parameters, which will affect the performance of classification accuracy and so on. In order to solve these problems, this paper proposes a text classification algorithm which combines word vectorization and GRU. First, it preprocesses the text. Then it extracts features through GloVe to contain as much semantic and grammatical information as possible, while reducing the vector space dimension. Finally, it uses GRU neural network model for training to retain the semantic association between long-distance words in the long text to the greatest extent. The experimental results show that the algorithm can improve the performance of text classification.

Key words: GloVe, Gated Recurrent Unit（GRU）, text classification

摘要：

文本分类有着广泛的应用，对其分类算法的研究也一直备受关注。但是，传统文本分类算法普遍存在文本特征向量化维度过高、没有考虑关键词之间语义关系、训练参数过多等问题，这些都将影响到分类准确率等性能。针对这些问题，提出了一种结合词向量化与GRU的文本分类算法。对文本进行预处理操作；通过GloVe进行词向量化，尽可能多地蕴含文本语义和语法信息，同时降低向量空间维度；再利用GRU神经网络模型进行训练，最大程度保留长文本中长距离词之间的语义关联。实验结果证明，该算法对提高文本分类性能有较明显的作用。

关键词: GloVe, 门控循环单元（GRU）, 文本分类

FANG Jiongkun, CHEN Pinghua, LIAO Wenxiong. Text Classification Model Based on GloVe and GRU[J]. Computer Engineering and Applications, 2020, 56(20): 98-103.

方炯焜，陈平华，廖文雄. 结合GloVe和GRU的文本分类模型[J]. 计算机工程与应用, 2020, 56(20): 98-103.

[1]	WANG Ru, LIU Daming, ZHANG Jian. Wear-YOLO：Research on Detection Methods of Safety Equipment for Power Personnel in Substations [J]. Computer Engineering and Applications, 2024, 60(9): 111-121.
[2]	SONG Jianping, WANG Yi, SUN Kaiwei, LIU Qilie. Short Text Classification Combined with Hyperbolic Graph Attention Networks and Labels [J]. Computer Engineering and Applications, 2024, 60(9): 188-195.
[3]	YANG Wentao, LEI Yuqi, LI Xingyue, ZHENG Tiancheng. Chinese Long Text Classification Model Based on BERT Fused Chinese Input Methods and BLCG [J]. Computer Engineering and Applications, 2024, 60(9): 196-202.
[4]	JIANG Jielin, ZHU Yongwei, XU Xiaolong, CUI Yan, ZHAO Yingnan. Chinese Short Text Classification with Hybrid Features and Multi-Head Attention [J]. Computer Engineering and Applications, 2024, 60(9): 237-243.
[5]	HU Zhiqiang, LI Pengjun, WANG Jinlong, XIONG Xiaoyun. Research on Policy Tools Classification Based on ChatGPT Augmentation and Supervised Contrastive Learning [J]. Computer Engineering and Applications, 2024, 60(7): 292-305.
[6]	CHEN Zhaohong, HONG Zhiyong, YU Wenhua, ZHANG Xin. Extreme Multi-Label Text Classification Based on Balance Function [J]. Computer Engineering and Applications, 2024, 60(4): 163-172.
[7]	WANG Xuyang, GENG Liuqing, ZHANG Xin. Multi-Label Text Classification Based on DistilBERT and Label Correlation [J]. Computer Engineering and Applications, 2024, 60(23): 168-175.
[8]	GUO Ruiqiang, YANG Shilong, JIA Xiaowen, WEI Qianqiang. Fine-Grained Text Classification Based on Label Augmentation [J]. Computer Engineering and Applications, 2024, 60(21): 134-141.
[9]	SU Yilei, LI Weijun, LIU Xueyang, DING Jianping, LIU Shixia, LI Haonan, LI Guanfeng. Review of Text Classification Methods Based on Graph Neural Networks [J]. Computer Engineering and Applications, 2024, 60(19): 1-17.
[10]	LI Jiandong, FU Jia, LI Jiaqi. Multi-Label Text Classification Combining Bidirectional Attention and Contrast Enhancement Mechanism [J]. Computer Engineering and Applications, 2024, 60(16): 105-115.
[11]	YANG Chunxia, HUANG Yukun, YAN Han, WU Yalei. Multi-Label Text Classification Model Integrating GAT and Head-Tail Label [J]. Computer Engineering and Applications, 2024, 60(15): 150-160.
[12]	DONG Xiaohui, GUO Tingfu, ZHU Haijiang, DANG Xiaochao, LI Fenfang. Construction and Application of Fault Knowledge Graph for Mine Hoist [J]. Computer Engineering and Applications, 2024, 60(14): 348-356.
[13]	GU Xunxun, LIU Jianping, XING Jialu, REN Haiyu. Text Classification：Comprehensive Review of Prompt Learning Methods [J]. Computer Engineering and Applications, 2024, 60(11): 50-61.
[14]	CAO Yukun, WEI Ziyue, TANG Yijia, JIN Chengkun, LI Yunfeng. Hierarchical Label Text Classification Method with Deep Label Assisted Classification Task [J]. Computer Engineering and Applications, 2024, 60(10): 105-112.
[15]	XIN Miaomiao, MA Li, HU Bofa. Research on Text Classification by Fusing Multi-Granularity Information [J]. Computer Engineering and Applications, 2023, 59(9): 104-111.

Text Classification Model Based on GloVe and GRU

结合GloVe和GRU的文本分类模型

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics