Generative diffusion models for agricultural AI: Plant image generation, indoor-to-outdoor translation, and expert preference alignment.

Saved in:
Bibliographic Details
Title: Generative diffusion models for agricultural AI: Plant image generation, indoor-to-outdoor translation, and expert preference alignment.
Authors: Tan, Da1 (AUTHOR) tand2@myumanitoba.ca, Beck, Michael2 (AUTHOR) m.beck@uwinnipeg.ca, Bidinosti, Christopher P.2 (AUTHOR) c.bidinosti@uwinnipeg.ca, Gulden, Robert H.1 (AUTHOR) Rob.Gulden@umanitoba.ca, Henry, Christopher J.1 (AUTHOR) christopher.henry@umanitoba.ca
Source: Computers & Electronics in Agriculture. Jul2026, Vol. 249, pN.PAG-N.PAG. 1p.
Subjects: Stable Diffusion, Data augmentation, Probabilistic generative models, Artificial intelligence, Image enhancement (Imaging systems), Weed science
Abstract: Agricultural AI is often constrained by limited, imbalanced plant image datasets and pronounced domain shift when models trained on controlled indoor imagery are deployed in field conditions. To address these challenges, we propose an integrated diffusion-based framework with three components that can be used independently or as complementary stages: (1) text-conditioned plant image synthesis to expand labeled training data, (2) indoor-to-outdoor image translation to mitigate domain shift, and (3) expert preference-aligned fine-tuning to improve agronomic realism and output stability. Our implementation builds on a Stable Diffusion v1.4 backbone fine-tuned with our domain-specific image dataset, which is then served as the base model for the image-translation module using the DreamBooth strategy. The fine-tuned generative model is further optimized by a reward-weighted mechanism using expert scores to refine image quality. We evaluate the framework using standard generative metrics (IS, FID) and downstream agricultural tasks, including phenotype classification and weed detection with YOLOv8. Results indicate that the components are synergistic: the synthesis model provides a strong initialization for translation, translation improves field realism while retaining utility for data augmentation, and preference alignment further enhances consistency and expert-perceived quality. Overall, the proposed framework offers a practical, data-efficient, and expert-aware generative pipeline for real-world agricultural AI. • Fine-tuned Stable Diffusion enables text-conditioned crop image generation. • Synthetic data improves plant disease classification on two benchmarks. • Indoor-to-outdoor translation converts greenhouse plants to outdoor scenes. • Augmented datasets enhance YOLOv8 weed detection and classification accuracy. • Reward model-guided fine-tuning aligns AI output with expert preferences. [ABSTRACT FROM AUTHOR]
Copyright of Computers & Electronics in Agriculture is the property of Elsevier B.V. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Engineering Source
FullText Text:
  Availability: 0
Header DbId: egs
DbLabel: Engineering Source
An: 194001979
AccessLevel: 6
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 0
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Generative diffusion models for agricultural AI: Plant image generation, indoor-to-outdoor translation, and expert preference alignment.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Tan%2C+Da%22">Tan, Da</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> tand2@myumanitoba.ca</i><br /><searchLink fieldCode="AR" term="%22Beck%2C+Michael%22">Beck, Michael</searchLink><relatesTo>2</relatesTo> (AUTHOR)<i> m.beck@uwinnipeg.ca</i><br /><searchLink fieldCode="AR" term="%22Bidinosti%2C+Christopher+P%2E%22">Bidinosti, Christopher P.</searchLink><relatesTo>2</relatesTo> (AUTHOR)<i> c.bidinosti@uwinnipeg.ca</i><br /><searchLink fieldCode="AR" term="%22Gulden%2C+Robert+H%2E%22">Gulden, Robert H.</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> Rob.Gulden@umanitoba.ca</i><br /><searchLink fieldCode="AR" term="%22Henry%2C+Christopher+J%2E%22">Henry, Christopher J.</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> christopher.henry@umanitoba.ca</i>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <searchLink fieldCode="JN" term="%22Computers+%26+Electronics+in+Agriculture%22">Computers & Electronics in Agriculture</searchLink>. Jul2026, Vol. 249, pN.PAG-N.PAG. 1p.
– Name: Subject
  Label: Subjects
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Stable+Diffusion%22">Stable Diffusion</searchLink><br /><searchLink fieldCode="DE" term="%22Data+augmentation%22">Data augmentation</searchLink><br /><searchLink fieldCode="DE" term="%22Probabilistic+generative+models%22">Probabilistic generative models</searchLink><br /><searchLink fieldCode="DE" term="%22Artificial+intelligence%22">Artificial intelligence</searchLink><br /><searchLink fieldCode="DE" term="%22Image+enhancement+%28Imaging+systems%29%22">Image enhancement (Imaging systems)</searchLink><br /><searchLink fieldCode="DE" term="%22Weed+science%22">Weed science</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: Agricultural AI is often constrained by limited, imbalanced plant image datasets and pronounced domain shift when models trained on controlled indoor imagery are deployed in field conditions. To address these challenges, we propose an integrated diffusion-based framework with three components that can be used independently or as complementary stages: (1) text-conditioned plant image synthesis to expand labeled training data, (2) indoor-to-outdoor image translation to mitigate domain shift, and (3) expert preference-aligned fine-tuning to improve agronomic realism and output stability. Our implementation builds on a Stable Diffusion v1.4 backbone fine-tuned with our domain-specific image dataset, which is then served as the base model for the image-translation module using the DreamBooth strategy. The fine-tuned generative model is further optimized by a reward-weighted mechanism using expert scores to refine image quality. We evaluate the framework using standard generative metrics (IS, FID) and downstream agricultural tasks, including phenotype classification and weed detection with YOLOv8. Results indicate that the components are synergistic: the synthesis model provides a strong initialization for translation, translation improves field realism while retaining utility for data augmentation, and preference alignment further enhances consistency and expert-perceived quality. Overall, the proposed framework offers a practical, data-efficient, and expert-aware generative pipeline for real-world agricultural AI. • Fine-tuned Stable Diffusion enables text-conditioned crop image generation. • Synthetic data improves plant disease classification on two benchmarks. • Indoor-to-outdoor translation converts greenhouse plants to outdoor scenes. • Augmented datasets enhance YOLOv8 weed detection and classification accuracy. • Reward model-guided fine-tuning aligns AI output with expert preferences. [ABSTRACT FROM AUTHOR]
– Name: AbstractSuppliedCopyright
  Label:
  Group: Ab
  Data: <i>Copyright of Computers & Electronics in Agriculture is the property of Elsevier B.V. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=194001979
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1016/j.compag.2026.111862
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 1
        StartPage: N.PAG
    Subjects:
      – SubjectFull: Stable Diffusion
        Type: general
      – SubjectFull: Data augmentation
        Type: general
      – SubjectFull: Probabilistic generative models
        Type: general
      – SubjectFull: Artificial intelligence
        Type: general
      – SubjectFull: Image enhancement (Imaging systems)
        Type: general
      – SubjectFull: Weed science
        Type: general
    Titles:
      – TitleFull: Generative diffusion models for agricultural AI: Plant image generation, indoor-to-outdoor translation, and expert preference alignment.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Tan, Da
      – PersonEntity:
          Name:
            NameFull: Beck, Michael
      – PersonEntity:
          Name:
            NameFull: Bidinosti, Christopher P.
      – PersonEntity:
          Name:
            NameFull: Gulden, Robert H.
      – PersonEntity:
          Name:
            NameFull: Henry, Christopher J.
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 15
              M: 07
              Text: Jul2026
              Type: published
              Y: 2026
          Identifiers:
            – Type: issn-print
              Value: 01681699
          Numbering:
            – Type: volume
              Value: 249
          Titles:
            – TitleFull: Computers & Electronics in Agriculture
              Type: main
ResultId 1