Text-Image Generative Adversarial Model for Fusion Capsule Networks

doi:10.3778/j.issn.1002-8331.2005-0050

Abstract

Abstract:

In the implementation of the traditional text image confrontation model, the convolution network in the discriminator is used to extract image features, but the convolution network can not consider the spatial relationship between the underlying objects, resulting in poor quality of the generated image, and the capsule network is an effective method. In this paper, the traditional text conditional generation adversarial model is improved based on the capsule network method. The convolution network in the discriminator is replaced by the capsule network to enhance its robustness to image size. The experimental results on data set show that the new model can effectively improve the quality of generation. The FID value of flower image generation is reduced by 14.49%, and the FID value of bird image generation is reduced by 7.18%. The inception scores of the images generated on Oxford-102 and CUB data sets are 22.60% and 26.28% higher, which shows that the image features generated by the improved model are more rich and meaningful.

Key words: generating images, capsule network, generation adversarial network, convolutional network, robustness

摘要：

在传统文本-图像对抗模型的实现中，判别器中的卷积网络用于提取图像特征，但是卷积网络无法考虑到底层对象之间的空间关系，导致生成图像的质量较差，而胶囊网络是一种有效的解决方法。基于胶囊网络的方法对传统的文本条件式生成对抗网络模型进行了改进，将判别器中卷积网络换为胶囊网络，增强其对图像的鲁棒性。在Oxford-102和CUB数据集上的实验结果表明新模型可以有效提高生成质量，生成花卉图像的FID的数值降低了14.49%，生成鸟类的图像的FID的数值降低了9.64%。在Oxford-102和CUB两个数据集上生成图像的Inception Score分别提高了22.60%和26.28%，说明改进后模型生成的图片特征更丰富、更有意义。

关键词: 生成图像, 胶囊网络, 生成对抗网络, 卷积网络, 鲁棒性

HUANG Xiaoqi, WANG Li, LI Gang. Text-Image Generative Adversarial Model for Fusion Capsule Networks[J]. Computer Engineering and Applications, 2021, 57(14): 176-180.

黄晓琪，王莉，李钢. 融合胶囊网络的文本-图像生成对抗模型[J]. 计算机工程与应用, 2021, 57(14): 176-180.

[1]	JIA Xiang’en, DONG Yihong, ZHU Feng, QIAN Jiangbo. Research Progress of Heterogeneous Graph Convolutional Networks [J]. Computer Engineering and Applications, 2021, 57(9): 36-49.
[2]	LI Hui, ZHANG Tianyuan, JIN Shuyu. Social Emotion Mining in Ancient Chinese Metrical Poetry [J]. Computer Engineering and Applications, 2021, 57(7): 171-177.
[3]	ZHENG Cheng, DONG Chunyang, HUANG Xiayan. Short Text Classification Method Based on BTM Graph Convolutional Network [J]. Computer Engineering and Applications, 2021, 57(4): 155-160.
[4]	HE Wenliang, ZHU Minling. Research Status and Future Analysis of Capsule Neural Network [J]. Computer Engineering and Applications, 2021, 57(3): 33-43.
[5]	BAI Zhixu, WANG Hengjun, GUO Kexiang. Summary of Adversarial Examples Techniques Based on Deep Neural Networks [J]. Computer Engineering and Applications, 2021, 57(23): 61-70.
[6]	ZHAO Pengfei, LI Yanling, LIN Min. Intent Detection of Domain Adaptation Combined with Capsule Network [J]. Computer Engineering and Applications, 2021, 57(21): 188-194.
[7]	LIU Pengkun, ZHU Chengjie, ZHANG Yue. Research on Improved Lightweight High Resolution Human Keypoint Detection [J]. Computer Engineering and Applications, 2021, 57(2): 143-149.
[8]	WANG Xiaoru, ZHANG Heng. Relation Network Based on Attention Mechanism and Graph Convolution for Few-Shot Learning [J]. Computer Engineering and Applications, 2021, 57(19): 164-170.
[9]	LIU Chen, CHEN Jingxian, HAO Yuchen, LI Qiu, ZHEN Juntao. Entrance and Exit Passenger Flow Prediction of Subway Stations Based on Spatio-Temporal Network [J]. Computer Engineering and Applications, 2021, 57(18): 248-254.
[10]	LI Song, LIU Zhe, TANG Xiaomei, WU Jian, WANG Feixue. Fixed-Point Iterated Huber-Based Robust Cubature Kalman Filter [J]. Computer Engineering and Applications, 2021, 57(16): 90-96.
[11]	LIU Linlin, YE Qiang, HE Lingmin. Detection of Breast Cancer Metastasis Based on SENet Multi-channel Network [J]. Computer Engineering and Applications, 2021, 57(16): 190-196.
[12]	ZUO Baochuan, ZHANG Qing. Feature Guidance Mechanism for Saliency Detection Network [J]. Computer Engineering and Applications, 2021, 57(14): 201-208.
[13]	CHEN Danlei, CHEN Hong, REN Anhu. Short-Time Traffic Flow Prediction of Graph Convolutional Network Considering Influence of Space and Time [J]. Computer Engineering and Applications, 2021, 57(13): 269-275.
[14]	LIU Suolan, GU Jiahui, WANG Hongyuan, ZHANG Yunpeng. Human Behavior Recognition Based on Associative Partition and ST-GCN [J]. Computer Engineering and Applications, 2021, 57(13): 168-175.
[15]	YAO Jiaqi, XU Zhengguo, YAN Jikun, WANG Keren. GCN-PU: PU Text Classification Algorithm Based on Graph Convolutional Network [J]. Computer Engineering and Applications, 2021, 57(11): 162-167.

Text-Image Generative Adversarial Model for Fusion Capsule Networks

融合胶囊网络的文本-图像生成对抗模型

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics