
Computer Engineering and Applications ›› 2024, Vol. 60 ›› Issue (3): 228-236.DOI: 10.3778/j.issn.1002-8331.2209-0043
• Graphics and Image Processing • Previous Articles Next Articles
ZHOU Yan, LIAO Junwei, LIU Xiangyu, ZHOU Yuexia, ZENG Fanzhi
Online:2024-02-01
Published:2024-02-01
周燕,廖俊玮,刘翔宇,周月霞,曾凡智
ZHOU Yan, LIAO Junwei, LIU Xiangyu, ZHOU Yuexia, ZENG Fanzhi. Improved FCENet Algorithm for Natural Scene Text Detection[J]. Computer Engineering and Applications, 2024, 60(3): 228-236.
周燕, 廖俊玮, 刘翔宇, 周月霞, 曾凡智. 改进FCENet的自然场景文本检测算法[J]. 计算机工程与应用, 2024, 60(3): 228-236.
Add to citation manager EndNote|Ris|BibTeX
URL: http://cea.ceaj.org/EN/10.3778/j.issn.1002-8331.2209-0043
| [1] 刘艳菊, 伊鑫海, 李炎阁, 等. 深度学习在场景文字识别技术中的应用综述[J]. 计算机工程与应用, 2022, 58(4): 52-63. LIU Y J, YI X H, LI Y G, et al. Application of scene text recognition technology based on deep learning: a survey[J]. Computer Engineering and Applications, 2022, 58(4): 52-63. [2] REDMON J, FARHADI A. Yolov3: an incremental improvement[J]. arXiv:1804.02767, 2018. [3] LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot multibox detector[C]//European Conference on Computer Vision. Cham: Springer, 2016: 21-37. [4] ZHOU X, YAO C, WEN H, et al. EAST: an efficient and accurate scene text detector[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017. [5] 杨锶齐, 易尧华, 汤梓伟, 等. 嵌入注意力机制的自然场景文本检测方法[J]. 计算机工程与应用, 2021, 57(24): 185-191. YANG S Q, YI Y H, TANG Z W, et al. Text detection in natural scenes embedded attention mechanism[J]. Computer Engineering and Applications, 2021, 57(24): 185-191. [6] LIAO M, ZHU Z, SHI B, et al. Rotation-sensitive regression for oriented scene text detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 5909-5918. [7] LIU Y, JIN L. Deep matching prior network: toward tighter multi-oriented text detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017: 1962-1969. [8] WANG W, XIE E, LI X, et al. Shape robust text detection with progressive scale expansion network[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019: 9336-9345. [9] TIAN Z, SHU M, LYU P, et al. Learning shape-aware embedding for scene text detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019: 4234-4243. [10] LIAO M, WAN Z, YAO C, et al. Real-time scene text detection with differentiable binarization[C]//Proceedings of the AAAI Conference on Artificial Intelligence, 2020: 11474-11481. [11] 骆文莉, 吴秦. 多层次特征融合与注意力机制的文本检测[J]. 小型微型计算机系统, 2022, 43(4): 815-821. LUO W L, WU Q. Text detection based on multi-level feature fusion and attention mechanism[J]. Journal of Chinese Computer Systems, 2022, 43(4): 815-821. [12] 王延昭, 顾晓东. 注意力机制在自然场景文字检测中的应用[J]. 计算机辅助设计与图形学学报, 2021, 33(12): 1908-1915. WANG Y Z, GU X D. Using of attention for scene text detection[J]. Journal of Computer-Aided Design & Computer Graphics, 2021, 33(12): 1908-1915. [13] ZHU Y, CHEN J, LIANG L, et al. Fourier contour embedding for arbitrary-shaped text detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021: 3123-3131. [14] LIU Y L, JIN L W, ZHANG S T, et al. Detecting curve text in the wild: new dataset and new solution[J]. arXiv:1712. 02170, 2017. [15] WOO S, PARK J, LEE J Y, et al. Cbam: convolutional block attention module[C]//Proceedings of the European Conference on Computer Vision (ECCV), 2018: 3-19. [16] WANG Q, WU B, ZHU P, et al. Supplementary material for “ECA-Net: efficient channel attention for deep convolutional neural networks”[C]//Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle, WA, USA: IEEE, 2020: 13-19. [17] LIU S, QI L, QIN H, et al. Path aggregation network for instance segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 8759-8768. [18] DAI Y, GIESEKE F, OEHMCKE S, et al. Attentional feature fusion[C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2021: 3560-3569. [19] KARATZAS D, GOMEZ-BIGORDA L, NICOLAOU A, et al. ICDAR 2015 competition on robust reading[C]//2015 13th International Conference on Document Analysis and Recognition (ICDAR), 2015: 1156-1160. [20] CH'NG C K, CHAN C S. Total-text: a comprehensive dataset for scene text detection and recognition[C]//2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), 2017: 935-942. [21] ZHU X, HU H, LIN S, et al. Deformable convnets v2: more deformable, better results[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019: 9308-9316. [22] FENG W, HE W, YIN F, et al. Textdragon: an end-to-end framework for arbitrary shaped text spotting[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019: 9076-9085. [23] SHRIVASTAVA A, GUPTA A, GIRSHICK R. Training region-based object detectors with online hard example mining[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 761-769. [24] GUPTA A, VEDALDI A, ZISSERMAN A. Synthetic data for text localisation in natural images[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 2315-2324. [25] NAYEF N, YIN F, BIZID I, et al. ICDAR 2017 robust reading challenge on multi-lingual scene text detection and script identification-RRC-MLT[C]//2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), 2017: 1454-1459. [26] ZHANG S X, ZHU X, HOU J B, et al. Deep relational reasoning graph network for arbitrary shape text detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020: 9699-9708. [27] WANG Y, XIE H, ZHA Z J, et al. Contournet: taking a further step toward accurate arbitrary-shaped scene text detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020: 11753-11762. [28] ZHAO Y, CAI Y, WU W, et al. Explore faster localization learning for scene text detection[J]. arXiv:2207.01342, 2022. [29] LIAO M, ZOU Z, WAN Z, et al. Real-time scene text detection with differentiable binarization and adaptive scale fusion[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45(1): 919-931. [30] ZHANG S, LIU Y, JIN L, et al. OPMP: an omnidirectional pyramid mask proposal network for arbitrary-shape scene text detection[J]. IEEE Transactions on Multimedia, 2020, 23: 454-467. [31] MA C, SUN L, ZHONG Z, et al. ReLaText: exploiting visual relationships for arbitrary-shaped scene text detection with graph convolutional networks[J]. Pattern Recognition, 2021, 111: 107684. [32] QIN X, ZHOU Y, GUO Y, et al. Mask is all you need: rethinking mask R-CNN for dense and arbitrary-shaped scene text detection[C]//Proceedings of the 29th ACM International Conference on Multimedia, 2021: 414-423. [33] DAI P, ZHANG S, ZHANG H, et al. Progressive contour regression for arbitrary-shape scene text detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021: 7393-7402. [34] SU Y, SHAO Z, ZHOU Y, et al. TextDCT: arbitrary-shaped text detection via discrete cosine transform mask[J]. IEEE Transactions on Multimedia, 2022: 250072914. [35] 邵海琳, 季怡, 刘纯平, 等. 基于增强特征金字塔网络的场景文本检测算法[J]. 计算机科学, 2022, 49(2): 248-255. SHAO H L, JI Y, LIU C P, et al. Scene text detection algorithm based on enhanced feature pyramid network[J]. Computer Science, 2022, 49(2): 248-255. [36] ZHANG S X, ZHU X, HOU J B, et al. Kernel proposal network for arbitrary shape text detection[J]. IEEE Transactions on Neural Networks and Learning Systems, 2023, 34(11): 8731-8742. |
| [1] | SONG Yu, WANG Banghai, CAO Ganggang. Cross-Modality Person Re-identification Combined with Data Augmentation and Feature Fusion [J]. Computer Engineering and Applications, 2024, 60(4): 133-141. |
| [2] | ZHANG Duona, ZHAO Hongjia, LU Yuanyao, CUI Jian, ZHANG Baochang. Few-Shot Scene Classification with Attention Mechanism in Remote Sensing [J]. Computer Engineering and Applications, 2024, 60(4): 173-182. |
| [3] | LI Qing, LI Haitao, LI Hui, ZHANG Junhu. Photovoltaic Panel Segmentation Using Attention Mechanism and Global Convolution [J]. Computer Engineering and Applications, 2024, 60(4): 237-248. |
| [4] | GUAN Wenqing, ZHOU Shibin, ZHANG Guopeng. Aerial Image Object Detection with Feature Enhancement Using Hybrid Attention [J]. Computer Engineering and Applications, 2024, 60(4): 249-257. |
| [5] | CHEN Lifang, LUO Shiyong. Multi-Scale Liver Tumor Segmentation Algorithm by Fusing Convolution and Transformer [J]. Computer Engineering and Applications, 2024, 60(4): 270-279. |
| [6] | LI Xun, GAN Rundong, QIAN Junfeng, ZHANG Shiheng, ZHAO Wenbin, WANG Daolei. Improved YOLOv5 Mixed Sample Training for Detection of Insulator Umbrella Plate Falling Defects [J]. Computer Engineering and Applications, 2024, 60(4): 289-297. |
| [7] | LIU Bingkun, PI Jiatian, XU Jin. End-to-End Robotic Arm Vision Servo Research Combined with Bottleneck Attention Mechanism [J]. Computer Engineering and Applications, 2024, 60(4): 347-354. |
| [8] | ZHU Kai, LI Li, ZHANG Tong, JIANG Sheng, BIE Yiming. Survey of Vision Transformer in Low-Level Computer Vision [J]. Computer Engineering and Applications, 2024, 60(4): 39-56. |
| [9] | ZHANG Peng, XIE Li, YANG Hailin. Lightweight Network ICA-Res2Net for Cervical Cell Classification [J]. Computer Engineering and Applications, 2024, 60(3): 187-195. |
| [10] | TAN Guangpu, ZHU Guangli, WEI Siyu. Implicit Sentiment Classification Model Based on Enhancement of Sentiment Features Oriented to Chinese Text [J]. Computer Engineering and Applications, 2024, 60(3): 196-204. |
| [11] | JIN Haibo, MA Linlin, TIAN Guiyuan. Single Image Defogging Method Under Adaptive Transformer Network [J]. Computer Engineering and Applications, 2024, 60(3): 237-245. |
| [12] | WU Zeju, SONG Lijun, JI Yang. Tire X-Ray Image Defect Detection Based on Improved Feature Pyramid Network [J]. Computer Engineering and Applications, 2024, 60(3): 270-279. |
| [13] | TIAN Hao, ZHOU Qiang, HE Chenlong. Defect Detection of Photovoltaic Modules Based on Multi-Scale Feature Fusion [J]. Computer Engineering and Applications, 2024, 60(3): 340-347. |
| [14] | XIN Shi’ao, GE Haibo, YUAN Hao, YANG Yudi , YAO Yang. Improved Lightweight Underwater Target Detection Algorithm of YOLOv7 [J]. Computer Engineering and Applications, 2024, 60(3): 88-99. |
| [15] | NIU Xinyu, MAO Pengjun, DUAN Yuntao, LOU Xiaoheng. Research on Lightweight Improved Algorithm for Indoor Target Detection Based on YOLOv5s [J]. Computer Engineering and Applications, 2024, 60(3): 109-118. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||