Vision

Latest Object Detection Research Papers

The newest Object Detection papers from across the field — arXiv, NeurIPS, CVPR, Nature, and more — refreshed daily and ranked by relevance. Distill AI tracks Object Detection so you don’t have to: get the standout work delivered to your inbox every morning, with 2-sentence summaries and the option to chat with any paper.

Get the latest Object Detection papers in your inbox — free →

Recent papers

Ray Augmented Supervision for 3D Object Detection
Huy-Hoang Duong, Adrian Voicila, Guillaume Allibert · HAL (Le Centre pour la Comm... · Aug 17, 2026
International audience...
Multispecies Weed Detection Using Unmanned Aerial Vehicles and Deep Learning Object Detection Models in Utah Forage Crop Corn Field
Utsav Bhandari · Utah State Research and Sch... · Aug 1, 2026
Weeds cost global agriculture over $32 billion annually and reduce crop yields by nearly one-third. Current weed control relies heavily on spraying herbicides uniformly across entire fields, leading to herbicide-resistant weeds, environment…
Synthetic data generation framework for quality control automation in gravure printing
Korota Arsène Coulibaly, Mohamed Hamlich, Khalid Hmali, Andrea Trombin · arXiv · Jul 23, 2026
Quality control in printing, particularly in rotogravure printing, still depends on slow, costly, and subjective manual inspection. Automated surface defect detection is critical for maintaining high-quality standards in rotogravure printin…
T-STAR: A Large-Scale Benchmark for Spatio-Temporal Panoptic Scene Graph Generation in Satellite Video
Linlin Wang, Xue Yang, Zhihuang Zhou, Zhenyu Zhong et al. · arXiv · Jul 23, 2026
Structured understanding of satellite video is essential for advancing dynamic geospatial scene analysis from low-level perception to high-level cognition. To move beyond object-centric perception, this paper introduces spatio-temporal pano…
Real-Time EEG Cap Electrode Detection for Guided Point-of-Care Placement
William Lehn-Schiøler, Mads Sverker Nilsson, Nicki Skafte Detlefsen · arXiv · Jul 22, 2026
We present a two-stage vision system that detects EEG cap electrodes in a live webcam stream and validates their anatomical placement in real time. A single-class YOLO detector localises electrodes; a geometric stage assigns each detection …
CoGoal3D: Collaborative 3D Object Detection with 3D-Aware Fusion and Refinement
Zhihao Yang, Zhiyu Xiang, Peng Xu, Tianyu Pu et al. · arXiv · Jul 21, 2026
V2X collaborative object detection features overcoming the limitations of single-vehicle systems by aggregating environmental features from multiple collaborative agents. However, existing mainstream V2X perception methods mainly focus on 2…
Object Detection In Autonomous Vehicles
Rohan Chaudhary, M N Chauhan, Dr. Partap Singh · Zenodo (CERN European Organ... · Jul 17, 2026
In the context of autonomous vehicle (AV) perception, this study examines the implementation and performance assessment of the You Only Look Once version 5 (YOLOv5) model for object detection (OD). Safe and reliable navigation requires high…
Towards Hierarchical Structure Understanding of Newspaper Images
William Mocaër, Solène Tarride, Thomas Constum, Merveilles Agbeti-Messan et al. · arXiv · Jul 16, 2026
Understanding newspaper images remains a challenging task due to their complex, nested hierarchical structures and dense, heterogeneous layouts. In this paper, we explore two complementary approaches for newspaper structure understanding. F…
Weakly-Supervised RGB-D Salient Object Detection via SAM-driven Pseudo Annotation and State Space Interaction-based Diffusion
Wenqi Si, Gongyang Li, Shixiang Shi, Weisi Lin · arXiv · Jul 16, 2026
Weakly-supervised RGB-D Salient Object Detection (SOD) is explored to reduce the heavy burden of pixel-level annotations. But scribble annotations lack the structure and details of objects, resulting in inaccurate saliency maps. In this pap…
ViCo3D: Empowering LiDAR-based Collaborative 3D Object Detection with Vision Foundation Models
Haojie Ren, Songrui Luo, Lingfeng Wang, Yan Xia et al. · arXiv · Jul 14, 2026
LiDAR-based collaborative 3D perception in Vehicle-to-Everything (V2X) systems typically relies on fusing bird's-eye-view (BEV) features across agents. However, current BEV representations, typically extracted by LiDAR backbones trained fro…
Evidence-Backed Video Question Answering
Shijie Wang, Honglu Zhou, Ziyang Wang, Ran Xu et al. · ECCV 2026 · Jul 13, 2026
Current Video Large Language Models (Video LLMs) excel in question answering (QA) but largely operate as black boxes, providing textual answers without verifiable visual grounding. Existing explainability efforts rely on textual rationales …
GFR-SAM: Training-Free Referring Camouflaged Object Segmentation via Cross-Image Prompting
Yilong Yang, Jianxin Tian, Shengchuan Zhang, Liujuan Cao · arXiv · Jul 13, 2026
Referring Camouflaged Object Detection (Ref-COD) requires segmenting hidden targets guided by reference cues. While supervised methods are annotation-heavy and training-free approaches via sparse point-prompting are sensitive to localizatio…
Frequency-aware detection transformer for SAR object detection
Yunshan Tang, Yanbin Liu, Zhongjun Yu · Journal of Applied Remote S... · Jul 11, 2026
Synthetic aperture radar (SAR) imagery is widely used for maritime surveillance, infrastructure monitoring, and emergency response owing to its all-weather, day-and-night imaging capability. However, accurate object detection in SAR images …
Event Burst Trigger: An Availability Backdoor Attack on Event-Based SNN Object Detection
Jaesun Baek, Chanwook Lee, Eun-Kyu Lee · arXiv · Jul 10, 2026
Event-based vision and spiking neural networks (SNNs) are increasingly adopted for edge intelligence under strict latency and energy constraints. However, the vulnerability of event-based SNN object detection models to availability backdoor…
Toward Active Object Detection for UAVs in the Wild: A Large-Scale Dataset, Benchmark and Method
Tianpeng Liu, Xinhua Jiang, Li Liu, Qinmu Shen et al. · arXiv · Jul 10, 2026
Object detection is a fundamental component in numerous Unmanned Aerial Vehicle (UAV) applications, yet it has long been plagued by hindrances like occlusion or target pixel scarcity. Active Object Detection (AOD) provides a novel paradigm …
Optimization and Deployment of Real-Time On-Orbit Intelligent Interpretation Algorithms for Spaceborne Remote Sensing
Cankai Li, Haiming Jiang, Y P Li, Hongbo Xie et al. · Sensors · Jul 10, 2026
Orbital remote sensing platforms increasingly rely on CNN-based object detection for real-time situational awareness. However, deploying these models on spaceborne edge devices is challenging because of stringent Size, Weight, and Power (SW…
VocaDet: Sample-Driven Open-Vocabulary Object Detection and Segmentation via Visual Tokenization and Vector Database Retrieval
ZhiXin Sun · arXiv · Jul 9, 2026
Open-vocabulary object detection and segmentation aim to recognize arbitrary objects beyond predefined categories. Although recent vision-language and reference-based approaches have significantly advanced this field, they often rely on tex…
Dual-Correlation Hypergraph Network for Unaligned RGBT Video Object Detection and A Large-scale Benchmark
Qishun Wang, Yapeng Li, Bin Luo, Zhengzheng Tu et al. · arXiv · Jul 9, 2026
RGB-Thermal (RGBT) Video Object Detection (VOD) has gained significant traction due to its ability to overcome the limitations of conventional RGB-based VOD under challenging conditions. However, spatial misalignment commonly exists between…
LDFE: Laplacian Decoupled Feature Enhancement Block for Dual-Stream CNN-based RGB-IR Object Detection
Wenhao Dong, Xiaoyan Luo, Linlin Yang, Haodong Zhu et al. · arXiv · Jul 9, 2026
The complementary information between RGB and IR images can significantly enhance object detection performance under extreme conditions. Existing methods prefer dual-stream CNN backbones built upon YOLO for feature extraction and focus on t…
InfraQR: Edge-Placed QR-Inspired Structured Patch Attacks on Infrared Vision-Language Models
Xin Li, Jiaju Han, Ma Yaqi, Chengyin Hu et al. · arXiv · Jul 8, 2026
Infrared vision-language models are increasingly used for perception under low-light and adverse visual conditions, yet their robustness to localized structured perturbations remains underexplored. Existing infrared adversarial studies main…
Prototype-Anchored Generalized Manifold Regression for Unknown-Domain Object Detection
Zihao Zhang, Aming Wu, Yang Li, Yahong Han · arXiv · Jul 8, 2026
In this paper, we study Single-Domain Generalized Object Detection (Single-DGOD), which aims to transfer a detector trained on a single source domain to multiple unseen domains. Existing methods mainly rely on simulation-driven strategies, …
Dynamic Object Detection and Tracking in Construction: A Fisheye Camera and LiDAR Sensor Fusion Model
Yilong Chen, Huili Huang, Yong K. Cho · arXiv · Jul 8, 2026
Robust dynamic object detection and tracking are essential for enabling robots to operate safely and effectively alongside humans in complex environments such as construction sites. While LiDAR-based SLAM and occupancy grid methods offer vi…
Food Portion Weight Prediction and Nutritional Estimation from Images Using YOLOv8 Segmentation and XGBoost Regression
Ery Setiyawan Jullev Atmadji, Freda Adi Ferdana, Husin, Aji Seto Arifianto · Teknika · Jul 8, 2026
Understanding the nutritional content of food is essential for maintaining balanced dietary habits. However, most existing nutrition information sources rely on fixed portion sizes and do not reflect the actual amount of food consumed. This…
Comprehensive Robustness Analysis of LiDAR-based 3D Object Detection in Autonomous Driving
Adwait Chandorkar, Kai Krink, Yerdana Maulenbay, Hasan Tercan et al. · arXiv · Jul 2, 2026
Recent advancements in LiDAR-only 3D object detection have demonstrated improved detection accuracy over benchmark datasets. However, the adversarial robustness of these models remains untested. Very few adversarial robustness studies exist…
PS-MOT: Cultivating Instance Awareness from Point Seeds for Multi-Object Tracking
Kai Luo, Fei Teng, Mengfei Duan, Wanjun Jia et al. · arXiv · Jun 29, 2026
We introduce Point-supervised Multi-Object Tracking (PS-MOT) as a cost-effective alternative to traditional bounding box supervision, shifting the focus from spatial fitting to topological center-driven representation. However, PS-MOT faces…
FR-DETR: Frequency and Recurrent Feature Refinement for Robust Object Detection under Adverse Weather
Tuan-Duc Nguyen, Duc-Trong Le · arXiv · Jun 29, 2026
Object detection under adverse weather remains challenging due to severe visual degradations and domain shifts. Existing enhancer-based approaches attempt to improve detection by cascading an enhancer with a detector, but they introduce red…
Hippocampus-DETR: An Explicit Memory Object Detection Framework Based on Hippocampus Modeling
Zhaoning Shi, Bo Ma, Hao Xu, Zepeng Yang et al. · arXiv · Jun 26, 2026
This paper addresses the lack of explicit memory mechanisms in current object detection models and proposes Hippocampus-DETR, a novel detection framework based on biological hippocampal memory modeling. This framework integrates a hippocamp…
Liquid Fusion of Heterogeneous Representations Towards General Salient Object Detection
Ke Chen, Ling Zhou, Guangqi Jiang, Gengshen Wu et al. · arXiv · Jun 25, 2026
General Salient Object Detection (SOD) aims to identify and segment visually interesting objects from uni-modality or multi-modality scenes, recently advanced by cutting-edge State Space Models (SSMs). However, a critical limitation of curr…
Identifying the Unknown: Prompt-Free Open Vocabulary Anomaly Recognition for Robot-Object Interaction
Philipp Allgeuer, Jan-Gerrit Habekost, Stefan Wermter · arXiv · Jun 25, 2026
Robots operating in real-world environments must in general be able to recognize previously unseen objects. As robotic systems move toward open-world autonomy, there is a growing, yet largely unmet, need for open vocabulary object detectors…
Depth-Semantic Alignment and Affinity-Guided Fusion for Structured Radar Point Cloud Generation
Amjad Hussain, Xin Qiu, Fuyuan Ai, Yuchen Tan et al. · arXiv · Jun 25, 2026
Point clouds are an important carrier of three-dimensional spatial information, and their quality directly affects the performance of downstream perception tasks such as object detection and tracking. However, millimeter-wave radar point cl…

Track Object Detection on Distill AI — start free →

Latest Object Detection Research Papers

Recent papers

Related topics