Pest-PVT: A model for multi-class and dense pest detection and counting in field-scale environments.
Saved in:
| Title: | Pest-PVT: A model for multi-class and dense pest detection and counting in field-scale environments. |
|---|---|
| Authors: | Chen, Hongrui1,2 (AUTHOR), Wen, Changji1,2 (AUTHOR) changjiw@jlau.edu.cn, Zhang, Long1,2 (AUTHOR), Ma, Zhenyu1,2 (AUTHOR), Liu, Tianyu1,2 (AUTHOR), Wang, Guangyao1,2 (AUTHOR), Yu, Helong1,2 (AUTHOR), Yang, Ce2,3 (AUTHOR), Yuan, Xiaohui2,4 (AUTHOR), Ren, Junfeng1,2 (AUTHOR) renjunfeng@mails.jlau.edu.cn |
| Source: | Computers & Electronics in Agriculture. Mar2025, Vol. 230, pN.PAG-N.PAG. 1p. |
| Subjects: | Object recognition (Computer vision), Transformer models, Feature extraction, Agriculture, Pests |
| Abstract: | • The Pest-PVT framework addresses challenges in the Pest24 dataset, including small pest targets, high similarity, and dense distribution. • Pest-PVT uses an anchor-free FCOS method to improve small object detection and integrates ATSS to reduce sample imbalance bias. • Dynamic Heads tackle pests of different scales and spatial changes, while Shunted Self-Attention boosts multi-scale feature extraction and lowers memory use. Field-scale pest monitoring is crucial for evaluating insect infestations in agricultural environments. The Pest24 dataset presents unique challenges due to the small pest target sizes, high target similarity, and dense pest distribution. To address these challenges, we propose Pest-PVT, a comprehensive framework based on the Pyramid Vision Transformer v2 (PVTv2) network model. Pest-PVT adopts an anchor-free approach using Fully Convolutional One-Stage Object Detection (FCOS) to enhance small object detection and incorporates Adaptive Training Sample Selection (ATSS) to mitigate sample imbalance bias. Dynamic Heads (DyHead) are employed to handle pests of varying scales and spatial changes, while Shunted Self-Attention (SSA) enhances multi-scale feature capture and reduces memory consumption. Compared to 23 other mainstream detection models and previously published works, our proposed Pest-PVT outperforms state-of-the-art detectors on pest detection datasets, achieving the outperformed scores in detection evaluation metrics such as mAP, Precision, Recall, and F1-Score, reaching 77.2 %, 78.42 %, 81.27 %, and 0.80, respectively. With a parameter size of only 24.74 M, Pest-PVT is suitable for integration into edge devices with limited resources. This work represents a significant contribution to the field of field-scale pest monitoring and addresses the specific challenges posed by the Pest24 dataset. Our code is made available at https://github.com/jlauwcj/pest-pvt. [ABSTRACT FROM AUTHOR] |
| Copyright of Computers & Electronics in Agriculture is the property of Elsevier B.V. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Database: | Engineering Source |
| FullText | Text: Availability: 0 |
|---|---|
| Header | DbId: egs DbLabel: Engineering Source An: 183392841 AccessLevel: 6 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 0 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: Pest-PVT: A model for multi-class and dense pest detection and counting in field-scale environments. – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Chen%2C+Hongrui%22">Chen, Hongrui</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Wen%2C+Changji%22">Wen, Changji</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<i> changjiw@jlau.edu.cn</i><br /><searchLink fieldCode="AR" term="%22Zhang%2C+Long%22">Zhang, Long</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Ma%2C+Zhenyu%22">Ma, Zhenyu</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Liu%2C+Tianyu%22">Liu, Tianyu</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Wang%2C+Guangyao%22">Wang, Guangyao</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Yu%2C+Helong%22">Yu, Helong</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Yang%2C+Ce%22">Yang, Ce</searchLink><relatesTo>2,3</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Yuan%2C+Xiaohui%22">Yuan, Xiaohui</searchLink><relatesTo>2,4</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Ren%2C+Junfeng%22">Ren, Junfeng</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<i> renjunfeng@mails.jlau.edu.cn</i> – Name: TitleSource Label: Source Group: Src Data: <searchLink fieldCode="JN" term="%22Computers+%26+Electronics+in+Agriculture%22">Computers & Electronics in Agriculture</searchLink>. Mar2025, Vol. 230, pN.PAG-N.PAG. 1p. – Name: Subject Label: Subjects Group: Su Data: <searchLink fieldCode="DE" term="%22Object+recognition+%28Computer+vision%29%22">Object recognition (Computer vision)</searchLink><br /><searchLink fieldCode="DE" term="%22Transformer+models%22">Transformer models</searchLink><br /><searchLink fieldCode="DE" term="%22Feature+extraction%22">Feature extraction</searchLink><br /><searchLink fieldCode="DE" term="%22Agriculture%22">Agriculture</searchLink><br /><searchLink fieldCode="DE" term="%22Pests%22">Pests</searchLink> – Name: Abstract Label: Abstract Group: Ab Data: • The Pest-PVT framework addresses challenges in the Pest24 dataset, including small pest targets, high similarity, and dense distribution. • Pest-PVT uses an anchor-free FCOS method to improve small object detection and integrates ATSS to reduce sample imbalance bias. • Dynamic Heads tackle pests of different scales and spatial changes, while Shunted Self-Attention boosts multi-scale feature extraction and lowers memory use. Field-scale pest monitoring is crucial for evaluating insect infestations in agricultural environments. The Pest24 dataset presents unique challenges due to the small pest target sizes, high target similarity, and dense pest distribution. To address these challenges, we propose Pest-PVT, a comprehensive framework based on the Pyramid Vision Transformer v2 (PVTv2) network model. Pest-PVT adopts an anchor-free approach using Fully Convolutional One-Stage Object Detection (FCOS) to enhance small object detection and incorporates Adaptive Training Sample Selection (ATSS) to mitigate sample imbalance bias. Dynamic Heads (DyHead) are employed to handle pests of varying scales and spatial changes, while Shunted Self-Attention (SSA) enhances multi-scale feature capture and reduces memory consumption. Compared to 23 other mainstream detection models and previously published works, our proposed Pest-PVT outperforms state-of-the-art detectors on pest detection datasets, achieving the outperformed scores in detection evaluation metrics such as mAP, Precision, Recall, and F1-Score, reaching 77.2 %, 78.42 %, 81.27 %, and 0.80, respectively. With a parameter size of only 24.74 M, Pest-PVT is suitable for integration into edge devices with limited resources. This work represents a significant contribution to the field of field-scale pest monitoring and addresses the specific challenges posed by the Pest24 dataset. Our code is made available at https://github.com/jlauwcj/pest-pvt. [ABSTRACT FROM AUTHOR] – Name: AbstractSuppliedCopyright Label: Group: Ab Data: <i>Copyright of Computers & Electronics in Agriculture is the property of Elsevier B.V. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.) |
| PLink | https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=183392841 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1016/j.compag.2024.109864 Languages: – Code: eng Text: English PhysicalDescription: Pagination: PageCount: 1 StartPage: N.PAG Subjects: – SubjectFull: Object recognition (Computer vision) Type: general – SubjectFull: Transformer models Type: general – SubjectFull: Feature extraction Type: general – SubjectFull: Agriculture Type: general – SubjectFull: Pests Type: general Titles: – TitleFull: Pest-PVT: A model for multi-class and dense pest detection and counting in field-scale environments. Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Chen, Hongrui – PersonEntity: Name: NameFull: Wen, Changji – PersonEntity: Name: NameFull: Zhang, Long – PersonEntity: Name: NameFull: Ma, Zhenyu – PersonEntity: Name: NameFull: Liu, Tianyu – PersonEntity: Name: NameFull: Wang, Guangyao – PersonEntity: Name: NameFull: Yu, Helong – PersonEntity: Name: NameFull: Yang, Ce – PersonEntity: Name: NameFull: Yuan, Xiaohui – PersonEntity: Name: NameFull: Ren, Junfeng IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 03 Text: Mar2025 Type: published Y: 2025 Identifiers: – Type: issn-print Value: 01681699 Numbering: – Type: volume Value: 230 Titles: – TitleFull: Computers & Electronics in Agriculture Type: main |
| ResultId | 1 |