融合多特征图的野生动物视频目标检测方法

doi:10.3778/j.issn.1002-8331.1901-0087

计算机工程与应用 ›› 2020, Vol. 56 ›› Issue (7): 221-227.DOI: 10.3778/j.issn.1002-8331.1901-0087

融合多特征图的野生动物视频目标检测方法

陈建促，王越，朱小飞，李章宇，林志航

重庆理工大学计算机科学与工程学院，重庆 400054

出版日期:2020-04-01 发布日期:2020-03-28

Wild Animal Video Object Detection Method Combining Multi-feature Map

CHEN Jiancu, WANG Yue, ZHU Xiaofei, LI Zhangyu, LIN Zhihang

Chongqing University of Technology, School of Computer Science and Engineering, Chongqing 400054, China

Online:2020-04-01 Published:2020-03-28

摘要/Abstract

摘要：

针对YOLOv3在野生动物视频目标检测领域中，存在的前后视频帧同区域关系难以描述的缺点，提出了Context-aware YOLO模型。该模型使用互信息熵对相邻帧的图像相似度进行量化，根据量化结果拟合出帧融合的相关因子，并使用相关因子对视频前后帧的特征图进行线性迭代融合；引入直方图均衡计算相似度的方法，判断“镜头切换”的情况，以确定特征图融合的临界条件。实验结果表明，Context-aware YOLO模型相对于YOLOv3模型F1值提升了2.4%，平均准确率（mAP）提升了4.71%。

关键词: YOLOv3模型, 视频目标检测, 互信息熵, 线性迭代, 直方图均衡

Abstract:

Aiming at the disadvantage of YOLOv3 in the field of wildlife video target detection, it is difficult to describe the relationship between the front and back video frames and the region, the Context-aware YOLO model is proposed. The model uses mutual information entropy to quantize the image similarity of adjacent frames, fits the correlation factor of frame fusion according to the quantization result, and uses the correlation factor to linearly iterate the feature map of the video before and after the frame; the histogram equalization method is introduced to calculate the similarity and judge the situation of “shot switching” to determine the critical condition of feature map fusion. The experimental results show that the Context-aware YOLO model has an increase of 2.4% over the F1 value of the YOLOv3 model, and the average accuracy（mAP） has increased by 4.71%.

Key words: YOLOv3 model, video object detection, mutual information entropy, linear iteration, histogram equalization

陈建促，王越，朱小飞，李章宇，林志航. 融合多特征图的野生动物视频目标检测方法[J]. 计算机工程与应用, 2020, 56(7): 221-227.

CHEN Jiancu, WANG Yue, ZHU Xiaofei, LI Zhangyu, LIN Zhihang. Wild Animal Video Object Detection Method Combining Multi-feature Map[J]. Computer Engineering and Applications, 2020, 56(7): 221-227.

[1]	李文龙，李兴广，胡冉冉，崔炜. 基于天空分割的单幅交通标志图像去雾算法[J]. 计算机工程与应用, 2021, 57(20): 221-228.
[2]	韩纪普，段先华，常振. 基于SLIC和区域生长的目标分割算法[J]. 计算机工程与应用, 2021, 57(1): 213-218.
[3]	鲍敬源，薛榕刚. 基于YOLOv3模型压缩的交通标志实时检测算法[J]. 计算机工程与应用, 2020, 56(23): 202-210.
[4]	陈晓倩，唐晶磊，王栋. 基于SLIC方法的光照偏强农田图像分割研究[J]. 计算机工程与应用, 2018, 54(2): 177-181.
[5]	覃宏超，李炎炎，龙伟，赵瑞朋，王倩. 改进的暗原色先验理论视频去雾算法研究[J]. 计算机工程与应用, 2018, 54(16): 176-181.
[6]	范晓鹏1，2，3，4，朱枫1，3，4. 人眼灰度感知建模及其在图像增强中的应用[J]. 计算机工程与应用, 2018, 54(13): 209-215.
[7]	赵鹏飞，周绍光，裔阳，胡屹群. 基于SLIC和主动学习的高光谱遥感图像分类方法[J]. 计算机工程与应用, 2017, 53(3): 183-187.
[8]	丁畅，董丽丽，许文海. “直方图”均衡化图像增强技术研究综述[J]. 计算机工程与应用, 2017, 53(23): 12-17.
[9]	杨燕，樊林庆. 基于光照变换的Gabor小波人脸识别[J]. 计算机工程与应用, 2016, 52(5): 220-224.
[10]	邢世宏1，杨晓东1，姜璐1，张少康2. 薄雾条件下退化图像的直方图均衡模型[J]. 计算机工程与应用, 2016, 52(3): 211-214.
[11]	胡志立，郭敏. 基于SLIC的改进GrabCut彩色图像快速分割[J]. 计算机工程与应用, 2016, 52(2): 186-190.
[12]	余权，马胜前，马冬梅. 保持图像亮度的自适应局部对比度增强[J]. 计算机工程与应用, 2015, 51(7): 160-164.
[13]	陈永亮，王华彬，陶亮. 自适应动态峰值剪切直方图均衡化[J]. 计算机工程与应用, 2015, 51(1): 167-171.
[14]	王轶冰1，胡邦君2. CLAHE优化低频DCT系数重变换的鲁棒人脸识别[J]. 计算机工程与应用, 2014, 50(9): 135-140.
[15]	李绘卓1，范勇1，唐峻1，唐遵烈2，熊平2，周建勇2. 一种非线性变换的双直方图红外图像增强方法[J]. 计算机工程与应用, 2014, 50(9): 155-159.

融合多特征图的野生动物视频目标检测方法

Wild Animal Video Object Detection Method Combining Multi-feature Map

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics