A Lightweight Hybrid Template-Matching–CNN Framework with Attention-Guided Fusion for Robust Small Object Detection

Zangana, Hewa Majeed and Omar, Marwan and Mirza, Mohammed Aquil and Cao, Xinwei and Wani, Sharyar (2026) A Lightweight Hybrid Template-Matching–CNN Framework with Attention-Guided Fusion for Robust Small Object Detection. Buletin Ilmiah Sarjana Teknik Elektro, 8 (1). pp. 258-271.

[thumbnail of 14751-Article Text-73121-1-10-20260222.pdf] Text
14751-Article Text-73121-1-10-20260222.pdf - Published Version

Download (849kB)

Abstract

Small object detection in aerial and surveillance imagery remains challenging due to low resolution, occlusion, and background clutter. This study introduces a novel hybrid detection framework that fuses template matching with a deep learning detector (Faster R-CNN) through an attention-guided decision fusion mechanism. The novelty lies in (i) a dual-stage fusion pipeline that integrates precise structural cues from template matching with deep semantic features, and (ii) a custom scale-aware focal loss, adapted from Focal Loss to emphasize hard and small objects by dynamically increasing penalties for low-confidence predictions. Evaluated on a Pascal VOC subset (1000 images, 5 classes), the proposed system achieves an mAP improvement of 3.5% over the Faster R-CNN baseline and surpasses YOLO-Lite and R-CNN variants in precision and recall. The hybrid design adds only a minimal computational overhead (0.45 s/image vs. 0.42 s for Faster R-CNN), demonstrating favorable efficiency–accuracy trade-offs suitable for scalable deployment. These findings highlight the framework’s robustness, particularly in scenes containing occlusion, clutter, or visually small targets. Limitations regarding template dependency are discussed, along with future directions for automatic template generation and real-time video adaptation.

Item Type: Article
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering
Depositing User: BISTE UAD
Date Deposited: 15 May 2026 03:40
Last Modified: 15 May 2026 03:40
URI: https://alxiv.org/id/eprint/812

Actions (login required)

View Item
View Item