Underwater Image Enhancement Based on Parallel Guidance of Transformer and CNN

doi:10.3778/j.issn.1002-8331.2302-0036

Abstract

Abstract: To overcome the problems of low contrast and color deviation in underwater images, a parallel guided underwater image enhancement algorithm based on Transformer and convolutional neural networks (CNN) is proposed. Using a 3D position embedding model to provide Transformer with relative position information, color deviation information, and global features of feature maps, using a CNN encoder to extract local features of the image, integrating the global features extracted by Transformer and the local features extracted by CNN through a feature modulation matrix, improving the resolution of the image through a CNN decoder, and inputting the feature maps output by the decoder into a feature enhancement network, enhance the network output with features to obtain the final result. Using the existing EUVP paired dataset for training, to verify the superiority of the algorithm, underwater images with varying degrees of color deviation are selected for qualitative and quantitative experiments. The results show that the enhanced underwater image peak signal-to-noise ratio (PSNR) and structural similarity index measure (SSIM) are higher than other comparison algorithms, and the subjective quality is significantly improved, the proposed algorithm can generate enhanced images with rich colors and high clarity.

Key words: underwater image enhancement, Transformer, convolutional neural networks (CNN), 3D position embedding model, characteristic modulation matrix

摘要： 为克服水下图像对比度低和色偏的问题，提出了基于Transformer与CNN并行引导的水下图像增强算法。利用3D位置嵌入模型为Transformer提供相对位置信息、色偏信息和特征图的全局特征，利用CNN编码器提取图像局部特征，将Transformer提取的全局特征和CNN提取的局部特征通过特征调制矩阵整合在一起，通过CNN解码器提高图像的分辨率，将解码器输出的特征图输入到特征加强网络中，由特征加强网络输出最终结果。采用现有的EUVP配对数据集进行训练，为验证该算法的优越性，选取具有不同程度色偏的水下图像进行定性比较和定量实验，结果显示，该算法增强后的水下图像峰值信噪比指标（peak signal-to-noise ratio,PSNR）和结构相似性指标（structural similarity index measure，SSIM）均高于其他对比算法，主观质量也得到显著提高，能够产生颜色丰富且清晰度较高的增强图像。

关键词: 水下图像增强, Transformer, 卷积神经网络（CNN）, 3D位置嵌入模型, 特征调制矩阵

CHANG Jian, CHEN Hongfu, WANG Bingbing. Underwater Image Enhancement Based on Parallel Guidance of Transformer and CNN[J]. Computer Engineering and Applications, 2024, 60(4): 280-288.

常戬, 陈洪福, 王冰冰. Transformer与CNN并行引导的水下图像增强[J]. 计算机工程与应用, 2024, 60(4): 280-288.

References

[1] PIZER S M, JOHNSTON R E, ERICKSEN J P, et al. Contrast-limited adaptive histogram equalization: speed and effectiveness[C]//Proceedings of the 1st Conference on Visualization in Biomedical Computing, Atlanta, 1990: 337-345.
[2] HE K M, SUN J, TANG X O. Single image haze removal using dark channel prior[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011, 33(12): 2341-2353.
[3] DREWS P L J, NASCIMENTO E R, CAMPOS M F M. Underwater depth estimation and image restoration based on single images[J]. IEEE Computer Graphics and Applications, 2016, 36(2): 24-35.
[4] SONG W, WANG Y, HUANG D, et al. A rapid scene depth estimation model based on underwater light attenuation prior for underwater image restoration[C]//LNCS 11164: Proceedings of the 19th Pacific-Rim Conference on Multimedia, Advances in Multimedia Information Processing, Hefei, Sep 21-22, 2018: 678-688.
[5] LI C Y, ANWAR S, PORIKLI F. Underwater scene prior inspired deep underwater image and video enhancement[J]. Pattern Recognition, 2020, 98: 107038.
[6] ISLAM M J, XIA Y Y, SATTAR J. Fast underwater image enhancement for improved visual perception[J]. IEEE Robotics and Automation Letters, 2020, 5(2): 3227-3234.
[7] WANG N, ZHOU Y B, HAN F L, et al. UWGAN: underwater GAN for real-world underwater color restoration and dehazing[J]. arXiv:1912.10269, 2019.
[8] WU S C, LUO T, JIANG G Y, et al. A two stage underwater enhancement network based on structure decomposition and characteristics of underwater imaging[J]. IEEE Journal of Oceanic Engineering, 2021, 46(4): 1213-1227.
[9] LI C Y, ANWAR S, HOU J H, et al. Underwater image enhancement via medium transmission-guided multi-color space embedding[J]. IEEE Transactions on Image Processing, 2021, 30: 4985-5000.
[10] JIANG Z Y, FAN X, LIU R, et al. Target oriented perceptual adversarial fusion network for underwater image enhancement[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2022, 32(10): 6584-6598.
[11] ZHANG W D, WANG Y D, LI C Y.Underwater image enhancement by attenuated color channel correction and detail preserved contrast enhancement[J]. IEEE Journal of Oceanic Engineering, 2022, 47(3): 718-735.
[12] MA Z Y, OH C. A wavelet-based dual-stream network for underwater image enhancement[C]//Proceedings of the 2022 IEEE International Conference on Acoustics, Speech and Signal Processing, 2022: 2769-2773.
[13] PENG L T, ZHU C L, BIAN L H. U-shape transformer for underwater image enhancement[J]. IEEE Transactions on Image Processing, 2023, 32: 3066-3079.
[14] ULYANOV D, VEDALDI A, LEMPITSKY V. Instance normalization: the missing ingredient for fast stylization[J]. arXiv:1607.08022, 2016.
[15] ZAMIR S W, ARORA A, KHAN S, et al. Learning enriched features for real image restoration and enhancement[C]//Proceedings of the 16th European Conference on Computer Vision, Glasgow, Aug 23-28, 2020: 492-511.