YOLO v4框架下Multi-Patch多帧增量式交通视频目标检测 Multi-Patch multi-frame incremental traffic video object detection method based on YOLO v4期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

YOLO v4框架下Multi-Patch多帧增量式交通视频目标检测

引用本文：	文奴,郭仁忠,贺彪,万远.YOLO v4框架下Multi-Patch多帧增量式交通视频目标检测[J].测绘通报,2022,0(5):38-44.

作者姓名：	文奴郭仁忠贺彪万远

作者单位：	1. 深圳大学建筑与城市规划学院, 广东深圳 518061;2. 深圳大学智慧城市研究院, 广东深圳 518061;3. 粤港澳智慧城市联合实验室, 广东深圳 518061;4. 城市国土资源监测与仿真重点实验室, 广东深圳 518034;5. 湖北师范大学城市与环境学院, 湖北黄石 435002

基金项目：	自然资源部城市土地资源监测与仿真重点实验室开放基金;广东省科技创新战略专项

摘要：	提升目标检测模型的泛化能力是计算机视觉领域的研究热点和关键难点。本文提出了一种Multi-Patch方法和多帧增量式预测策略,提升了不同场景下交通视频目标检测的稳健性,有效解决了目标尺度多变导致的视频中目标召回率低的问题。根据视频分辨率和目标尺寸,基于Multi-Patch方法自动将视频帧分割成最佳输入尺寸,使用YOLO v4神经网络并关联连续帧的上下文信息,采用增量式预测策略降低视频目标检测的漏检率,提升不同场景下视频目标的检测置信度得分和召回率。采集不同拍摄条件下的交通视频,验证该方法的有效性。试验结果表明,本文提出的目标检测方法召回率在80%以上,置信度平均得分在0.84以上。
关键词：	视频目标检测多帧融合 YOLO v4 卷积神经网络
收稿时间：	2021-06-07
修稿时间：	2022-02-25
Multi-Patch multi-frame incremental traffic video object detection method based on YOLO v4

WEN Nu,GUO Renzhong,HE Biao,WAN Yuan.Multi-Patch multi-frame incremental traffic video object detection method based on YOLO v4[J].Bulletin of Surveying and Mapping,2022,0(5):38-44.

Authors:	WEN Nu GUO Renzhong HE Biao WAN Yuan

Institution:	1. School of Architecture & Urban Planning, Shenzhen University, Shenzhen 518061, China;2. Research Institute for Smart Cities, Shengzhen University, Shenzhen 518061, China;3. Guangdong-Hong Kong-Macau Joint Laboratory for Smart Cities, Shenzhen 518061, China;4. Key Laboratory of Urban Land Resources Monitoring and Simulation, Ministry of Natural Resources, Shenzhen 518034, China;5. College of Urban and Environmental Sciences, Hubei Normal University, Huangshi 435002, China

Abstract:	Improving the generalization ability of object detection model is a research focus and key issue in the field of computer vision. This paper proposes a Multi-Patch method and a multi-frame incremental prediction strategy to improve the robustness of traffic video object detection in different scenarios, and effectively solve the problem of low object recall ratio in videos caused by variable object scales. According to the video resolution and object size, the video frame is automatically divided into the best input size based on the Multi-Patch method, the YOLO v4 neural network is used to correlate the context information of the continuous frame, and the incremental prediction strategy is used to reduce the missed detection rate of the video object detection, and to improve the detection confidence score and recall rate of video object in different scenarios. Collect traffic videos under different shooting conditions to verify the effectiveness of the algorithm. Experimental results show that the object detection method proposed in this paper has a recall rate of more than 80% and an average confidence score of more than 0.84.

Keywords:	video object detection multi-frame fusion YOLO v4 convolutional neural networks
本文献已被万方数据等数据库收录！
	点击此处可从《测绘通报》浏览原始摘要信息
	点击此处可从《测绘通报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏