计算机工程与应用 ›› 2026, Vol. 62 ›› Issue (6): 122-133.DOI: 10.3778/j.issn.1002-8331.2507-0218

• 模式识别与人工智能 • 上一篇    下一篇

YOLO-PD:轻量级实时行人检测算法

陈胜宝,施隆照+   

  1. 福州大学 物理与信息工程学院,福州 350000
    + 通信作者 E-mail:slz@fzu.edu.cn
  • 收稿日期:2025-07-16 修回日期:2025-09-30 在线发布日期:2026-03-15 出版日期:2026-03-15
  • 基金资助:
    福建省自然科学基金(2022J02015)。

YOLO-PD:Lightweight Real-Time Pedestrian Detection Algorithm

CHEN Shengbao, SHI Longzhao+   

  1. College of Physics and Information Engineering, Fuzhou University, Fuzhou 350000, China
    + Corresponding author E-mail:slz@fzu.edu.cn
  • Received:2025-07-16 Revised:2025-09-30 Online:2026-03-15 Published:2026-03-15

摘要: 针对边缘端设备资源受限和行人检测算法在小目标物体、多尺度物体和重叠遮挡造成的几何形变物体上检测精度低的问题,提出了一种适合部署在边缘端设备的基于YOLOv8的行人检测算法YOLO-PD。针对小目标物体,设计局部Transformer块(partial Transformer block,PTB),增强模型的特征提取能力,PTB的混合结构在保持高效特征提取的同时还能降低计算成本;针对多尺度物体,设计金字塔共享空洞卷积(pyramid shared dilation convolution,PSDC),利用共享权重参数的多尺度空洞卷积来提高模型多尺度特征提取能力的同时减小模块体积;针对几何形变物体,设计轻量级可变形动态检测头(light deformable dynamic head,LDDH),在检测头中通过动态调整加权因子来提升检测精度、通过可变形卷积来更精准地获取形变物体的特征。实验结果表明,与基线模型YOLOv8n相比,YOLO-PD在自建的行人检测数据集COCO-Person和VOC-Person上的mAP50分别提高了2.9和1.9个百分点,参数量减少了34.3%;在公开的行人检测数据集WidePerson上,mAP50和mAP50:95分别提高了1.8和1.1个百分点。在行人检测任务上该算法检测精度高、泛化能力强、极小的参数量让它适合部署在边缘端设备。

关键词: YOLOv8, 行人检测, WidePerson数据集

Abstract: To address the limited resources of edge devices and the low detection accuracy of pedestrian detection algorithms on small objects, multi-scale objects, and geometrically deformed objects caused by overlapping occlusions, YOLO-PD is proposed which is a YOLOv8-based pedestrian detection algorithm suitable for edge deployment. For small objects, a partial Transformer block (PTB) is designed to enhance feature extraction. Hybrid structure of PTB maintains efficient feature extraction while reducing computational costs. For multi-scale objects, a pyramid shared dilation convolution (PSDC) is introduced, leveraging shared-weight multi-scale dilation convolutions to improve multi-scale feature extraction while minimizing module size. For geometrically deformed objects, a light deformable dynamic head (LDDH) is developed. It dynamically adjusts weighting factors to improve detection accuracy and employs deformable convolutions to better capture features of deformed objects. Experimental results show that compared to the baseline model YOLOv8n, YOLO-PD achieves 2.9 and 1.9 percentage points higher mAP50 on the custom pedestrian detection datasets COCO-Person and VOC-Person, respectively, while reducing parameters by 34.3%. On the public dataset WidePerson, it improves mAP50 by 1.8 percentage points and mAP50:95 by 1.1 percentage points. The algorithm excels in pedestrian detection with high accuracy, strong generalization, and minimal parameters, making it ideal for edge devices.

Key words: YOLOv8, pedestrian detection, WidePerson dataset