
Computer Engineering and Applications ›› 2025, Vol. 61 ›› Issue (5): 250-260.DOI: 10.3778/j.issn.1002-8331.2310-0243
• Graphics and Image Processing • Previous Articles Next Articles
LIAN Zhe, YIN Yanjun, MI Zeng, ZHI Min, XU Qiaozhi
Online:2025-03-01
Published:2025-03-01
连哲,殷雁君,米增,智敏,徐巧枝
LIAN Zhe, YIN Yanjun, MI Zeng, ZHI Min, XU Qiaozhi. Asymmetric Iterative Refinement Prediction Network for Scene Text Detection[J]. Computer Engineering and Applications, 2025, 61(5): 250-260.
连哲, 殷雁君, 米增, 智敏, 徐巧枝. 用于场景文本检测的非对称迭代细化预测网络[J]. 计算机工程与应用, 2025, 61(5): 250-260.
Add to citation manager EndNote|Ris|BibTeX
URL: http://cea.ceaj.org/EN/10.3778/j.issn.1002-8331.2310-0243
| [1] 连哲, 殷雁君, 云飞, 等. 基于深度学习的自然场景文本检测综述[J]. 计算机工程, 2024, 50(3): 16-27. LIAN Z, YIN Y J, YUN F, et al. Review of natural scene text detection based on deep learning[J]. Computer Engineering, 2024, 50(3): 16-27. [2] WANG J, CHEN Y, DONG Z, et al. Improved YOLOv5 network for real-time multi-scale traffic sign detection[J]. Neural Computing and Applications, 2022, 35: 7853-7865. [3] HONG T, KIM D, JI M, et al. BROS: a pre-trained language model focusing on text and layout for better key information extraction from documents[C]//Proceedings of the AAAI Conference on Artificial Intelligence, 2022: 10767-10775. [4] BANU J F, MUNEESHWARI P, RAJA K, et al. Ontology based image retrieval by utilizing model annotations and content[C]//Proceedings of the 2022 12th International Conference on Cloud Computing, Data Science & Engineering, 2022: 300-305. [5] NAIEMI F, GHODS V, KHALESI H. Scene text detection and recognition: a survey[J]. Multimedia Tools and Applications, 2022, 81(14): 20255-20290. [6] 刘平, 姜永峰, 张良. 基于高阶图卷积推理网络的任意形状文本检测[J]. 计算机工程与应用, 2024, 60(1): 263-270. LIU P, JIANG Y F, ZHANG L. Arbitrary shape text detection based on high-order graph convolution reaso-ning network[J]. Computer Engineering and Applications, 2024, 60(1): 263-270. [7] RAISI Z, NAIEL M A, YOUNES G, et al. Transformer-based text detection in the wild[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021: 3162-3171. [8] HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 770-778. [9] YANY X, YANG J, YAN J, et al. SCRDet: towards more robust detection for small, cluttered and rotated objects[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019: 8232-8241. [10] ZHANG H, WU C, ZHANG Z, et al. ResNeSt: split-attention networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022: 2736-2746. [11] LIU Z, MAO H, WU C Y, et al. A convNet for the 2020s[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022: 11976-11986. [12] TAN M, PANG R, LE Q V. EfficientDet: scalable and efficient object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020: 10781-10790. [13] QIAO S, CHEN L C, YUILLE A. DetectoRS: detecting objects with recursive feature pyramid and switchable atrous convolution[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021: 10213-10224. [14] WANG W, XIE E, LI X, et al. Shape robust text detection with progressive scale expansion network[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019: 9336-9345. [15] LIAO M, WAN Z, YAO C, et al. Real-time scene text detection with differentiable binarization[C]//Proceedings of the AAAI Conference on Artificial Intelligence, 2020: 11474-11481. [16] LIN T Y, DOLLáR P, GIRSHICK R, et al. Feature pyramid networks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017: 2117-2125. [17] LIU B, JIN J. Text detection based on bidirectional feature fusion and sa attention mechanism[C]//Proceedings of the 2022 IEEE Asia-Pacific Conference on Image Processing, Electronics and Computers, 2022: 912-915. [18] ZHANG Q L, YANG Y B. SA-Net: Shuffle attention for deep convolutional neural networks[C]//Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2021: 2235-2239. [19] GU S, ZHANG F. Applicable scene text detection based on semantic segmentation[J]. Journal of Physics: Conference Series, 2020: 12080. [20] DING X, GUO Y, DING G, et al. ACNet: strengthening the kernel skeletons for powerful cnn via asymmetric convolution blocks[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019: 1911-1920. [21] ZHANG X, DU B, WU Z, et al. LAANet: lightweight attention-guided asymmetric network for real-time semantic segmentation[J]. Neural Computing and Applications, 2022, 34(5): 3573-3587. [22] LI X, MA X. Image semantic space segmentation based on cascaded feature fusion and asymmetric convolution module[J]. Wireless Communications and Mobile Computing, 2022, 4: 1-9. [23] RUDER S. An overview of gradient descent optimization algorithms[J]. arXiv:1609.04747, 2016. [24] LIN W, ZHANG Z, XUE X. An agile and efficient neural network based on knowledge distillation for scene text detection[J]. Wireless Communications and Mobile Computing, 2022: 1-9. [25] ZHANG C, LIANG B, HUANG Z, et al. Look more than once: an accurate detector for text of arbitrary shapes[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019: 10552-10561. [26] ZHU X, HU H, LIN S, et al. Deformable convNets v2: more deformable, better results[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019: 9308-9316. [27] CHEN L C, PAPANDREOU G, KOKKINOS I, et al. DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 40(4): 834-848. [28] GUPTA A, VEDALDI A, ZISSERMAN A. Synthetic data for text localisation in natural images[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 2315-2324. [29] KARATZAS D, GOMEZ-BIGORDA L, NICOLAOU A, et al. ICDAR 2015 competition on robust reading[C]//Proceedings of the 2015 13th International Conference on Document Analysis and Recognition, 2015: 1156-1160. [30] YAO C, BAI X, LIU W, et al. Detecting texts of arbitrary orientations in natural images[C]//Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012: 1083-1090. [31] LIU Y, JIN L, ZHANG S, et al. Curved scene text detection via transverse and longitudinal sequence connection[J]. Pattern Recognition, 2019, 90: 337-345. [32] KINGMA D P, BA J. Adam: a method for stochastic optimization[J]. arXiv:1412.6980, 2014. [33] BAEK Y, LEE B, HAN D, et al. Character region awareness for text detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019: 9365-9374. [34] WANG W, XIE E, SONG X, et al. Efficient and accurate arbitrary-shaped text detection with pixel aggregation network[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019: 8440-8449. [35] WANG W, XIE E, LI X, et al. PAN++: towards efficient and accurate end-to-end spotting of arbitrarily-shaped text[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 44(9): 5349-5367. [36] LIAO M, ZOU Z, WAN Z, et al. Real-time scene text detection with differentiable binarization and adaptive scale fusion[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 45(1): 919-931. [37] CHEN Z, WANG J, WANG W, et al. FAST: faster arbitrarily-shaped text detector with minimalist kernel representation[J]. arXiv:2111.02394, 2021. [38] LIN J, YAN Y, WANG H. A dual-path transformer network for scene text detection[C]//Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing , 2023: 1-5. [39] WANG L, YAO X, SONG C. Text detection method based on HDBNet in natural scenes[J]. The Journal of Engineering, 2023, 2023(1): 12212. |
| [1] | DENG Shengjun, CHEN Niannian. Text Kernel Reconstruction and Expansion for Arbitrary Shape Text Detection [J]. Computer Engineering and Applications, 2024, 60(9): 228-236. |
| [2] | ZHOU Yan, LIAO Junwei, LIU Xiangyu, ZHOU Yuexia, ZENG Fanzhi. Improved FCENet Algorithm for Natural Scene Text Detection [J]. Computer Engineering and Applications, 2024, 60(3): 228-236. |
| [3] | WEI Zheliang, LI Yueyang, LUO Haichi. Scene Text Detection Based on Multi-Scale Pooling and Bidirectional Feature Fusion [J]. Computer Engineering and Applications, 2024, 60(2): 154-161. |
| [4] | LIU Ping, JIANG Yongfeng, ZHANG Liang. Arbitrary Shape Text Detection Based on High-Order Graph Convolution Reasoning Network [J]. Computer Engineering and Applications, 2024, 60(1): 263-270. |
| [5] | SHE Xiangyang, LIU Zhe, DONG Lihong. Text Detection Algorithm Based on Multi-Scale Attention Feature Fusion [J]. Computer Engineering and Applications, 2024, 60(1): 198-206. |
| [6] | MA Jing, XUE Hao, GUO Xiaoyu. TextRail:Irregular Text Detection Algorithm in Complicated Natural Scenarios [J]. Computer Engineering and Applications, 2023, 59(21): 112-122. |
| [7] | LIU Yanju, YI Xinhai, LI Yange, ZHANG Huiyu, LIU Yanzhong. Application of Scene Text Recognition Technology Based on Deep Learning:A Survey [J]. Computer Engineering and Applications, 2022, 58(4): 52-63. |
| [8] | SHI Yihan, TONG Minglei, ZHANG Kui, YAO Hongyang. Scene Text Detection Model Based on Double Tower Structure [J]. Computer Engineering and Applications, 2022, 58(3): 242-248. |
| [9] | ZHAO Jianghe, WANG Hairui, WU Lei. FPN-Centernet Helmet Wearing Detection Algorithm [J]. Computer Engineering and Applications, 2022, 58(14): 114-120. |
| [10] | YANG Youwei, ZHOU Gang. Improved NMS Algorithm for Text Detection in Natural Scenes [J]. Computer Engineering and Applications, 2022, 58(1): 204-208. |
| [11] | LI Yihong, CHEN Yuanyu. Review on Deep Learning Based Scene Text Detection [J]. Computer Engineering and Applications, 2021, 57(6): 42-48. |
| [12] | ZHAO Fan, ZHANG Lin, WEN Zhiquan, YANG Linlin, LIN Guangfeng. Direct and Efficient Natural Scene Chinese Character Approaching Spotting Method [J]. Computer Engineering and Applications, 2021, 57(6): 159-167. |
| [13] | ZHANG Cuijun, AN Ran, MA Li. Building Change Detection in Remote Sensing Image Based on Improved U-Net [J]. Computer Engineering and Applications, 2021, 57(3): 239-246. |
| [14] | YANG Siqi, YI Yaohua, TANG Ziwei, WANG Xinyu. Text Detection in Natural Scenes Embedded Attention Mechanism [J]. Computer Engineering and Applications, 2021, 57(24): 185-191. |
| [15] | FANG Chengzhi, HUO Xinglong, CHENG Youcheng. Natural Scene Text Detection Combined with Bounding Box Calibration [J]. Computer Engineering and Applications, 2021, 57(1): 161-167. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||