Training of construction robots using imitation learning and environmental rewards.
Saved in:
| Title: | Training of construction robots using imitation learning and environmental rewards. |
|---|---|
| Authors: | Duan, Kangkang1 (AUTHOR), Zou, Zhengbo1 (AUTHOR), Yang, T. Y.1 (AUTHOR) yang@civil.ubc.ca |
| Source: | Computer-Aided Civil & Infrastructure Engineering. 4/4/2025, Vol. 40 Issue 9, p1150-1165. 16p. |
| Subjects: | Observational learning, Imitative behavior, Gesture controlled interfaces (Computer systems), Robot design & construction, Machine learning, Reward (Psychology), Virtual reality, Learning |
| Abstract: | Construction robots are challenging the paradigm of labor‐intensive construction tasks. Imitation learning (IL) offers a promising approach, enabling robots to mimic expert actions. However, obtaining high‐quality expert demonstrations is a major bottleneck in this process as teleoperated robot motions may not align with optimal kinematic behavior. In this paper, two innovations have been proposed. First, traditional control using controllers has been replaced with vision‐based hand gesture control for intuitive demonstration collection. Second, a novel method that integrates both demonstrations and simple environmental rewards is proposed to strike a balance between imitation and exploration. To achieve this goal, a two‐step training process is proposed. In the first step, an intuitive demonstration collection platform using virtual reality is utilized. Second, a learning algorithm is used to train a policy for construction tasks. Experimental results demonstrate that combining IL with environmental rewards can significantly accelerate the training, even with limited demonstration data. [ABSTRACT FROM AUTHOR] |
| Copyright of Computer-Aided Civil & Infrastructure Engineering is the property of Wiley-Blackwell and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Database: | Engineering Source |
|
Full text is not displayed to guests.
Login for full access.
|
|
| FullText | Links: – Type: pdflink Text: Availability: 1 |
|---|---|
| Header | DbId: egs DbLabel: Engineering Source An: 183953368 AccessLevel: 6 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 0 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: Training of construction robots using imitation learning and environmental rewards. – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Duan%2C+Kangkang%22">Duan, Kangkang</searchLink><relatesTo>1</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Zou%2C+Zhengbo%22">Zou, Zhengbo</searchLink><relatesTo>1</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Yang%2C+T%2E+Y%2E%22">Yang, T. Y.</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> yang@civil.ubc.ca</i> – Name: TitleSource Label: Source Group: Src Data: <searchLink fieldCode="JN" term="%22Computer-Aided+Civil+%26+Infrastructure+Engineering%22">Computer-Aided Civil & Infrastructure Engineering</searchLink>. 4/4/2025, Vol. 40 Issue 9, p1150-1165. 16p. – Name: Subject Label: Subjects Group: Su Data: <searchLink fieldCode="DE" term="%22Observational+learning%22">Observational learning</searchLink><br /><searchLink fieldCode="DE" term="%22Imitative+behavior%22">Imitative behavior</searchLink><br /><searchLink fieldCode="DE" term="%22Gesture+controlled+interfaces+%28Computer+systems%29%22">Gesture controlled interfaces (Computer systems)</searchLink><br /><searchLink fieldCode="DE" term="%22Robot+design+%26+construction%22">Robot design & construction</searchLink><br /><searchLink fieldCode="DE" term="%22Machine+learning%22">Machine learning</searchLink><br /><searchLink fieldCode="DE" term="%22Reward+%28Psychology%29%22">Reward (Psychology)</searchLink><br /><searchLink fieldCode="DE" term="%22Virtual+reality%22">Virtual reality</searchLink><br /><searchLink fieldCode="DE" term="%22Learning%22">Learning</searchLink> – Name: Abstract Label: Abstract Group: Ab Data: Construction robots are challenging the paradigm of labor‐intensive construction tasks. Imitation learning (IL) offers a promising approach, enabling robots to mimic expert actions. However, obtaining high‐quality expert demonstrations is a major bottleneck in this process as teleoperated robot motions may not align with optimal kinematic behavior. In this paper, two innovations have been proposed. First, traditional control using controllers has been replaced with vision‐based hand gesture control for intuitive demonstration collection. Second, a novel method that integrates both demonstrations and simple environmental rewards is proposed to strike a balance between imitation and exploration. To achieve this goal, a two‐step training process is proposed. In the first step, an intuitive demonstration collection platform using virtual reality is utilized. Second, a learning algorithm is used to train a policy for construction tasks. Experimental results demonstrate that combining IL with environmental rewards can significantly accelerate the training, even with limited demonstration data. [ABSTRACT FROM AUTHOR] – Name: AbstractSuppliedCopyright Label: Group: Ab Data: <i>Copyright of Computer-Aided Civil & Infrastructure Engineering is the property of Wiley-Blackwell and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.) |
| PLink | https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=183953368 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1111/mice.13394 Languages: – Code: eng Text: English PhysicalDescription: Pagination: PageCount: 16 StartPage: 1150 Subjects: – SubjectFull: Observational learning Type: general – SubjectFull: Imitative behavior Type: general – SubjectFull: Gesture controlled interfaces (Computer systems) Type: general – SubjectFull: Robot design & construction Type: general – SubjectFull: Machine learning Type: general – SubjectFull: Reward (Psychology) Type: general – SubjectFull: Virtual reality Type: general – SubjectFull: Learning Type: general Titles: – TitleFull: Training of construction robots using imitation learning and environmental rewards. Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Duan, Kangkang – PersonEntity: Name: NameFull: Zou, Zhengbo – PersonEntity: Name: NameFull: Yang, T. Y. IsPartOfRelationships: – BibEntity: Dates: – D: 04 M: 04 Text: 4/4/2025 Type: published Y: 2025 Identifiers: – Type: issn-print Value: 10939687 Numbering: – Type: volume Value: 40 – Type: issue Value: 9 Titles: – TitleFull: Computer-Aided Civil & Infrastructure Engineering Type: main |
| ResultId | 1 |