Training-free framework that converts SAM3 into a real-time multi-class open-vocabulary detector. Achieves 55.8 AP on COCO val2017 (80 classes) at 15.8 FPS (4 classes, 1008px) on a single RTX 4080.
Abstract: Traffic surveillance is a key factor in ITS whereby accurate and real-time object detection assures improvement of road safety and traffic management. This paper advances a ...
Abstract: With the continuous improvement of high-resolution remote-sensing image-acquisition technologies, image quality and resolution are constantly improved, which greatly promotes the development ...