Generative diffusion models for agricultural AI: Plant image generation, indoor-to-outdoor translation, and expert preference alignment.
Saved in:
| Title: | Generative diffusion models for agricultural AI: Plant image generation, indoor-to-outdoor translation, and expert preference alignment. |
|---|---|
| Authors: | Tan, Da1 (AUTHOR) tand2@myumanitoba.ca, Beck, Michael2 (AUTHOR) m.beck@uwinnipeg.ca, Bidinosti, Christopher P.2 (AUTHOR) c.bidinosti@uwinnipeg.ca, Gulden, Robert H.1 (AUTHOR) Rob.Gulden@umanitoba.ca, Henry, Christopher J.1 (AUTHOR) christopher.henry@umanitoba.ca |
| Source: | Computers & Electronics in Agriculture. Jul2026, Vol. 249, pN.PAG-N.PAG. 1p. |
| Subjects: | Stable Diffusion, Data augmentation, Probabilistic generative models, Artificial intelligence, Image enhancement (Imaging systems), Weed science |
| Abstract: | Agricultural AI is often constrained by limited, imbalanced plant image datasets and pronounced domain shift when models trained on controlled indoor imagery are deployed in field conditions. To address these challenges, we propose an integrated diffusion-based framework with three components that can be used independently or as complementary stages: (1) text-conditioned plant image synthesis to expand labeled training data, (2) indoor-to-outdoor image translation to mitigate domain shift, and (3) expert preference-aligned fine-tuning to improve agronomic realism and output stability. Our implementation builds on a Stable Diffusion v1.4 backbone fine-tuned with our domain-specific image dataset, which is then served as the base model for the image-translation module using the DreamBooth strategy. The fine-tuned generative model is further optimized by a reward-weighted mechanism using expert scores to refine image quality. We evaluate the framework using standard generative metrics (IS, FID) and downstream agricultural tasks, including phenotype classification and weed detection with YOLOv8. Results indicate that the components are synergistic: the synthesis model provides a strong initialization for translation, translation improves field realism while retaining utility for data augmentation, and preference alignment further enhances consistency and expert-perceived quality. Overall, the proposed framework offers a practical, data-efficient, and expert-aware generative pipeline for real-world agricultural AI. • Fine-tuned Stable Diffusion enables text-conditioned crop image generation. • Synthetic data improves plant disease classification on two benchmarks. • Indoor-to-outdoor translation converts greenhouse plants to outdoor scenes. • Augmented datasets enhance YOLOv8 weed detection and classification accuracy. • Reward model-guided fine-tuning aligns AI output with expert preferences. [ABSTRACT FROM AUTHOR] |
| Copyright of Computers & Electronics in Agriculture is the property of Elsevier B.V. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Database: | Engineering Source |
| FullText | Text: Availability: 0 |
|---|---|
| Header | DbId: egs DbLabel: Engineering Source An: 194001979 AccessLevel: 6 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 0 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: Generative diffusion models for agricultural AI: Plant image generation, indoor-to-outdoor translation, and expert preference alignment. – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Tan%2C+Da%22">Tan, Da</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> tand2@myumanitoba.ca</i><br /><searchLink fieldCode="AR" term="%22Beck%2C+Michael%22">Beck, Michael</searchLink><relatesTo>2</relatesTo> (AUTHOR)<i> m.beck@uwinnipeg.ca</i><br /><searchLink fieldCode="AR" term="%22Bidinosti%2C+Christopher+P%2E%22">Bidinosti, Christopher P.</searchLink><relatesTo>2</relatesTo> (AUTHOR)<i> c.bidinosti@uwinnipeg.ca</i><br /><searchLink fieldCode="AR" term="%22Gulden%2C+Robert+H%2E%22">Gulden, Robert H.</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> Rob.Gulden@umanitoba.ca</i><br /><searchLink fieldCode="AR" term="%22Henry%2C+Christopher+J%2E%22">Henry, Christopher J.</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> christopher.henry@umanitoba.ca</i> – Name: TitleSource Label: Source Group: Src Data: <searchLink fieldCode="JN" term="%22Computers+%26+Electronics+in+Agriculture%22">Computers & Electronics in Agriculture</searchLink>. Jul2026, Vol. 249, pN.PAG-N.PAG. 1p. – Name: Subject Label: Subjects Group: Su Data: <searchLink fieldCode="DE" term="%22Stable+Diffusion%22">Stable Diffusion</searchLink><br /><searchLink fieldCode="DE" term="%22Data+augmentation%22">Data augmentation</searchLink><br /><searchLink fieldCode="DE" term="%22Probabilistic+generative+models%22">Probabilistic generative models</searchLink><br /><searchLink fieldCode="DE" term="%22Artificial+intelligence%22">Artificial intelligence</searchLink><br /><searchLink fieldCode="DE" term="%22Image+enhancement+%28Imaging+systems%29%22">Image enhancement (Imaging systems)</searchLink><br /><searchLink fieldCode="DE" term="%22Weed+science%22">Weed science</searchLink> – Name: Abstract Label: Abstract Group: Ab Data: Agricultural AI is often constrained by limited, imbalanced plant image datasets and pronounced domain shift when models trained on controlled indoor imagery are deployed in field conditions. To address these challenges, we propose an integrated diffusion-based framework with three components that can be used independently or as complementary stages: (1) text-conditioned plant image synthesis to expand labeled training data, (2) indoor-to-outdoor image translation to mitigate domain shift, and (3) expert preference-aligned fine-tuning to improve agronomic realism and output stability. Our implementation builds on a Stable Diffusion v1.4 backbone fine-tuned with our domain-specific image dataset, which is then served as the base model for the image-translation module using the DreamBooth strategy. The fine-tuned generative model is further optimized by a reward-weighted mechanism using expert scores to refine image quality. We evaluate the framework using standard generative metrics (IS, FID) and downstream agricultural tasks, including phenotype classification and weed detection with YOLOv8. Results indicate that the components are synergistic: the synthesis model provides a strong initialization for translation, translation improves field realism while retaining utility for data augmentation, and preference alignment further enhances consistency and expert-perceived quality. Overall, the proposed framework offers a practical, data-efficient, and expert-aware generative pipeline for real-world agricultural AI. • Fine-tuned Stable Diffusion enables text-conditioned crop image generation. • Synthetic data improves plant disease classification on two benchmarks. • Indoor-to-outdoor translation converts greenhouse plants to outdoor scenes. • Augmented datasets enhance YOLOv8 weed detection and classification accuracy. • Reward model-guided fine-tuning aligns AI output with expert preferences. [ABSTRACT FROM AUTHOR] – Name: AbstractSuppliedCopyright Label: Group: Ab Data: <i>Copyright of Computers & Electronics in Agriculture is the property of Elsevier B.V. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.) |
| PLink | https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=194001979 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1016/j.compag.2026.111862 Languages: – Code: eng Text: English PhysicalDescription: Pagination: PageCount: 1 StartPage: N.PAG Subjects: – SubjectFull: Stable Diffusion Type: general – SubjectFull: Data augmentation Type: general – SubjectFull: Probabilistic generative models Type: general – SubjectFull: Artificial intelligence Type: general – SubjectFull: Image enhancement (Imaging systems) Type: general – SubjectFull: Weed science Type: general Titles: – TitleFull: Generative diffusion models for agricultural AI: Plant image generation, indoor-to-outdoor translation, and expert preference alignment. Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Tan, Da – PersonEntity: Name: NameFull: Beck, Michael – PersonEntity: Name: NameFull: Bidinosti, Christopher P. – PersonEntity: Name: NameFull: Gulden, Robert H. – PersonEntity: Name: NameFull: Henry, Christopher J. IsPartOfRelationships: – BibEntity: Dates: – D: 15 M: 07 Text: Jul2026 Type: published Y: 2026 Identifiers: – Type: issn-print Value: 01681699 Numbering: – Type: volume Value: 249 Titles: – TitleFull: Computers & Electronics in Agriculture Type: main |
| ResultId | 1 |