Pest-PVT: A model for multi-class and dense pest detection and counting in field-scale environments.
Saved in:
| Title: | Pest-PVT: A model for multi-class and dense pest detection and counting in field-scale environments. |
|---|---|
| Authors: | Chen, Hongrui1,2 (AUTHOR), Wen, Changji1,2 (AUTHOR) changjiw@jlau.edu.cn, Zhang, Long1,2 (AUTHOR), Ma, Zhenyu1,2 (AUTHOR), Liu, Tianyu1,2 (AUTHOR), Wang, Guangyao1,2 (AUTHOR), Yu, Helong1,2 (AUTHOR), Yang, Ce2,3 (AUTHOR), Yuan, Xiaohui2,4 (AUTHOR), Ren, Junfeng1,2 (AUTHOR) renjunfeng@mails.jlau.edu.cn |
| Source: | Computers & Electronics in Agriculture. Mar2025, Vol. 230, pN.PAG-N.PAG. 1p. |
| Subjects: | Object recognition (Computer vision), Transformer models, Feature extraction, Agriculture, Pests |
| Abstract: | • The Pest-PVT framework addresses challenges in the Pest24 dataset, including small pest targets, high similarity, and dense distribution. • Pest-PVT uses an anchor-free FCOS method to improve small object detection and integrates ATSS to reduce sample imbalance bias. • Dynamic Heads tackle pests of different scales and spatial changes, while Shunted Self-Attention boosts multi-scale feature extraction and lowers memory use. Field-scale pest monitoring is crucial for evaluating insect infestations in agricultural environments. The Pest24 dataset presents unique challenges due to the small pest target sizes, high target similarity, and dense pest distribution. To address these challenges, we propose Pest-PVT, a comprehensive framework based on the Pyramid Vision Transformer v2 (PVTv2) network model. Pest-PVT adopts an anchor-free approach using Fully Convolutional One-Stage Object Detection (FCOS) to enhance small object detection and incorporates Adaptive Training Sample Selection (ATSS) to mitigate sample imbalance bias. Dynamic Heads (DyHead) are employed to handle pests of varying scales and spatial changes, while Shunted Self-Attention (SSA) enhances multi-scale feature capture and reduces memory consumption. Compared to 23 other mainstream detection models and previously published works, our proposed Pest-PVT outperforms state-of-the-art detectors on pest detection datasets, achieving the outperformed scores in detection evaluation metrics such as mAP, Precision, Recall, and F1-Score, reaching 77.2 %, 78.42 %, 81.27 %, and 0.80, respectively. With a parameter size of only 24.74 M, Pest-PVT is suitable for integration into edge devices with limited resources. This work represents a significant contribution to the field of field-scale pest monitoring and addresses the specific challenges posed by the Pest24 dataset. Our code is made available at https://github.com/jlauwcj/pest-pvt. [ABSTRACT FROM AUTHOR] |
| Copyright of Computers & Electronics in Agriculture is the property of Elsevier B.V. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Database: | Engineering Source |
Be the first to leave a comment!