Publication:
Test-time Correction: An Online 3D Detection System via Visual Prompting

dc.contributor.coauthorZhang, Hanxue
dc.contributor.coauthorYang, Zetong
dc.contributor.coauthorSun, Yanan
dc.contributor.coauthorChen, Li
dc.contributor.coauthorXia, Fei
dc.contributor.coauthorLi, Hongyang
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorGüney, Fatma
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.date.accessioned2026-01-16T08:47:09Z
dc.date.available2026-01-16
dc.date.issued2025
dc.description.abstractThis paper introduces Test-time Correction (TTC), an online 3D detection system designed to rectify test-time errors using various auxiliary feedback, aiming to enhance the safety of deployed autonomous driving systems. Unlike conventional offline 3D detectors that remain fixed during inference, TTC enables immediate online error correction without retraining, allowing autonomous vehicles to adapt to new scenarios and reduce deployment risks. To achieve this, we equip existing 3D detectors with an Online Adapter (OA) module-a prompt-driven query generator for real-time correction. At the core of OA module are visual prompts: image-based descriptions of objects of interest derived from auxiliary feedback such as mismatches with 2D detections, road descriptions, or user clicks. These visual prompts, collected from risky objects during inference, are maintained in a visual prompt buffer to enable continuous correction in future frames. By leveraging this mechanism, TTC consistently detects risky objects, achieving reliable, adaptive, and versatile driving autonomy. Extensive experiments show that TTC significantly improves instant error rectification over frozen 3D detectors, even under limited labels, zero-shot settings, and adverse conditions. We hope this work inspires future research on post-deployment online rectification systems for autonomous driving.
dc.description.fulltextNo
dc.description.harvestedfromManual
dc.description.indexedbyScopus
dc.description.publisherscopeInternational
dc.description.readpublishN/A
dc.description.sponsoredbyTubitakEuN/A
dc.description.sponsorshipThis work was supported by National Key R&D Program of China (2022ZD0160104), NSFC (62206172), and Shanghai Committee of Science and Technology (23YF1462000). We thank team members from OpenDriveLab for valuable feedback along the project. Special thanks to Chengen Xie for providing data support, and Jiahui Fu for sharing insights on MV2D.
dc.identifier.doi10.1109/TPAMI.2025.3642076
dc.identifier.embargoNo
dc.identifier.issn0162-8828
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-105024795294
dc.identifier.urihttps://doi.org/10.1109/TPAMI.2025.3642076
dc.identifier.urihttps://hdl.handle.net/20.500.14288/32131
dc.keywords3D object detection
dc.keywordsOnline detection system
dc.keywordsTest-time correction
dc.language.isoeng
dc.publisherIEEE Computer Society
dc.relation.affiliationKoç University
dc.relation.collectionKoç University Institutional Repository
dc.relation.ispartofIEEE Transactions on Pattern Analysis and Machine Intelligence
dc.relation.openaccessNo
dc.rightsCopyrighted
dc.subjectComputer engineering
dc.titleTest-time Correction: An Online 3D Detection System via Visual Prompting
dc.typeJournal Article
dspace.entity.typePublication
person.familyNameGüney
person.givenNameFatma
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files