Bibliographic Details
| Title: |
Generative diffusion models for agricultural AI: Plant image generation, indoor-to-outdoor translation, and expert preference alignment. |
| Authors: |
Tan, Da1 (AUTHOR) tand2@myumanitoba.ca, Beck, Michael2 (AUTHOR) m.beck@uwinnipeg.ca, Bidinosti, Christopher P.2 (AUTHOR) c.bidinosti@uwinnipeg.ca, Gulden, Robert H.1 (AUTHOR) Rob.Gulden@umanitoba.ca, Henry, Christopher J.1 (AUTHOR) christopher.henry@umanitoba.ca |
| Source: |
Computers & Electronics in Agriculture. Jul2026, Vol. 249, pN.PAG-N.PAG. 1p. |
| Subjects: |
Stable Diffusion, Data augmentation, Probabilistic generative models, Artificial intelligence, Image enhancement (Imaging systems), Weed science |
| Abstract: |
Agricultural AI is often constrained by limited, imbalanced plant image datasets and pronounced domain shift when models trained on controlled indoor imagery are deployed in field conditions. To address these challenges, we propose an integrated diffusion-based framework with three components that can be used independently or as complementary stages: (1) text-conditioned plant image synthesis to expand labeled training data, (2) indoor-to-outdoor image translation to mitigate domain shift, and (3) expert preference-aligned fine-tuning to improve agronomic realism and output stability. Our implementation builds on a Stable Diffusion v1.4 backbone fine-tuned with our domain-specific image dataset, which is then served as the base model for the image-translation module using the DreamBooth strategy. The fine-tuned generative model is further optimized by a reward-weighted mechanism using expert scores to refine image quality. We evaluate the framework using standard generative metrics (IS, FID) and downstream agricultural tasks, including phenotype classification and weed detection with YOLOv8. Results indicate that the components are synergistic: the synthesis model provides a strong initialization for translation, translation improves field realism while retaining utility for data augmentation, and preference alignment further enhances consistency and expert-perceived quality. Overall, the proposed framework offers a practical, data-efficient, and expert-aware generative pipeline for real-world agricultural AI. • Fine-tuned Stable Diffusion enables text-conditioned crop image generation. • Synthetic data improves plant disease classification on two benchmarks. • Indoor-to-outdoor translation converts greenhouse plants to outdoor scenes. • Augmented datasets enhance YOLOv8 weed detection and classification accuracy. • Reward model-guided fine-tuning aligns AI output with expert preferences. [ABSTRACT FROM AUTHOR] |
|
Copyright of Computers & Electronics in Agriculture is the property of Elsevier B.V. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) |
| Database: |
Engineering Source |