Anchor Retouching via Model Interaction for Robust

Object Detection in Aerial Images

Dong Liang, Qixiang Geng, Zongqi Wei, Dmitry A Vorontsov, Ekaterina L Kim, Mingqiang Wei, Huiyu Zhou

Abstract


Object detection has made tremendous strides in computer vision. Small object detection with appearance degradation is a prominent challenge, especially for aerial observations. To collect sufficient positive/negative samples for heuristic training, most object detectors preset region anchors in order to calculate intersection-over-union (IoU) against the ground-truth data. In this case, small objects are frequently abandoned or mislabeled. In this article, we present an effective dynamic enhancement anchor network (DEA-Net) to construct a novel training sample generator. Different from the other state-of-the-art (SOTA) techniques, the proposed network leverages a sample discriminator to realize interactive sample screening between an anchor-based unit and an anchor-free unit to generate eligible samples. Besides, multi-task joint training with a conservative anchor-based inference scheme enhances the performance of the proposed model while reducing computational complexity. The proposed scheme supports both oriented and horizontal object detection tasks. Extensive experiments on two challenging aerial benchmarks (i.e., Dataset of Object deTection in Aerial images (DOTA) and HRSC2016) indicate that our method achieves SOTA performance in accuracy with moderate inference speed and computational overhead for training. On DOTA, our DEA-Net which integrated with the baseline of RoI-transformer surpasses the advanced method by 0.40% mean-average-precision (mAP) for oriented object detection with a weaker backbone network (ResNet-101 vs. ResNet-152) and 3.08% mAP for horizontal object detection with the same backbone. Besides, our DEA-Net which integrated with the baseline of ReDet achieves the SOTA performance by 80.37%. On HRSC2016, it surpasses the previous best model by 1.1% using only three horizontal anchors. The source code and the training set are made publicly available at https://github.com/QxGeng/DEA-Net..

    Method

    DEA module to provide better positive samples of small objects.(left)

    Architecture of our proposed DEA-Net for oriented object detection.(right)

Contributions

  1. an effective sample generator based on DEA head to enhance the performance of detecting small objects by combining the advantages of anchor-based and anchor-free model.

  2. a novel and robust DEA-Net, which can achieve the start-of-the-art oriented and horizontal object detection performance in aerial images.

Results

IoU distributions of the baseline (faster R-CNN) and our DEAnet

The x-axis represents the IoU between the samples and the ground-truth boxes. The y-axis represents the average number of samples on each image.


Visualization results


Comparisons With State-of-the-Arts


Ablation Studies

Effectiveness and Efficiency on Different Backbones & Effectiveness of the Proposed Units

Efficiency of Our Proposed Units

Effectiveness of the Inference Schemes

Materials


PDF| Homepage| Code

Citation

@ARTICLE{9654160,
  author={Liang, Dong and Geng, Qixiang and Wei, Zongqi and Vorontsov, Dmitry A. and Kim, Ekaterina L. and Wei, Mingqiang and Zhou, Huiyu},
  journal={IEEE Transactions on Geoscience and Remote Sensing},
  title={Anchor Retouching via Model Interaction for Robust Object Detection in Aerial Images},
  year={2022},
  volume={60},
  number={},
  pages={1-13},
  doi={10.1109/TGRS.2021.3136350}}
                    

Contact

If you have any questions, please contact Dong Liang at liangdong@nuaa.edu.cn.