Human-level performance in 3D multiplayer games with population-based reinforcement learning.
Saved in:
| Title: | Human-level performance in 3D multiplayer games with population-based reinforcement learning. |
|---|---|
| Authors: | Jaderberg, Max (AUTHOR), Czarnecki, Wojciech M. (AUTHOR), Dunning, Iain (AUTHOR), Marris, Luke (AUTHOR), Lever, Guy (AUTHOR), Castañeda, Antonio Garcia (AUTHOR), Beattie, Charles (AUTHOR), Rabinowitz, Neil C. (AUTHOR), Morcos, Ari S. (AUTHOR), Ruderman, Avraham (AUTHOR), Sonnerat, Nicolas (AUTHOR), Green, Tim (AUTHOR), Deason, Louise (AUTHOR), Leibo, Joel Z. (AUTHOR), Silver, David (AUTHOR), Hassabis, Demis (AUTHOR), Kavukcuoglu, Koray (AUTHOR), Graepel, Thore (AUTHOR) |
| Source: | Science (pre-March 2025). 5/31/2019, Vol. 364 Issue 6443, p859-865. 7p. 4 Diagrams. |
| Subjects: | Reinforcement learning, Multiplayer games, Agent (Philosophy), Pixels, Multiagent systems, Artificial intelligence research, Video games |
| Abstract: | Reinforcement learning (RL) has shown great success in increasingly complex single-agent environments and two-player turn-based games. However, the real world contains multiple agents, each learning and acting independently to cooperate and compete with other agents. We used a tournament-style evaluation to demonstrate that an agent can achieve human-level performance in a three-dimensional multiplayer first-person video game, Quake III Arena in Capture the Flag mode, using only pixels and game points scored as input. We used a two-tier optimization process in which a population of independent RL agents are trained concurrently from thousands of parallel matches on randomly generated environments. Each agent learns its own internal reward signal and rich representation of the world. These results indicate the great potential of multiagent reinforcement learning for artificial intelligence research. [ABSTRACT FROM AUTHOR] |
| Copyright of Science (pre-March 2025) is the property of American Association for the Advancement of Science and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Database: | Psychology and Behavioral Sciences Collection |
|
Full text is not displayed to guests.
Login for full access.
|
|
| FullText | Links: – Type: pdflink Text: Availability: 1 |
|---|---|
| Header | DbId: pbh DbLabel: Psychology and Behavioral Sciences Collection An: 136815122 AccessLevel: 6 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 0 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: Human-level performance in 3D multiplayer games with population-based reinforcement learning. – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Jaderberg%2C+Max%22">Jaderberg, Max</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Czarnecki%2C+Wojciech+M%2E%22">Czarnecki, Wojciech M.</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Dunning%2C+Iain%22">Dunning, Iain</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Marris%2C+Luke%22">Marris, Luke</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Lever%2C+Guy%22">Lever, Guy</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Castañeda%2C+Antonio+Garcia%22">Castañeda, Antonio Garcia</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Beattie%2C+Charles%22">Beattie, Charles</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Rabinowitz%2C+Neil+C%2E%22">Rabinowitz, Neil C.</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Morcos%2C+Ari+S%2E%22">Morcos, Ari S.</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Ruderman%2C+Avraham%22">Ruderman, Avraham</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Sonnerat%2C+Nicolas%22">Sonnerat, Nicolas</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Green%2C+Tim%22">Green, Tim</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Deason%2C+Louise%22">Deason, Louise</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Leibo%2C+Joel+Z%2E%22">Leibo, Joel Z.</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Silver%2C+David%22">Silver, David</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Hassabis%2C+Demis%22">Hassabis, Demis</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Kavukcuoglu%2C+Koray%22">Kavukcuoglu, Koray</searchLink> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Graepel%2C+Thore%22">Graepel, Thore</searchLink> (AUTHOR) – Name: TitleSource Label: Source Group: Src Data: <searchLink fieldCode="JN" term="%22Science+%28pre-March+2025%29%22">Science (pre-March 2025)</searchLink>. 5/31/2019, Vol. 364 Issue 6443, p859-865. 7p. 4 Diagrams. – Name: Subject Label: Subjects Group: Su Data: <searchLink fieldCode="DE" term="%22Reinforcement+learning%22">Reinforcement learning</searchLink><br /><searchLink fieldCode="DE" term="%22Multiplayer+games%22">Multiplayer games</searchLink><br /><searchLink fieldCode="DE" term="%22Agent+%28Philosophy%29%22">Agent (Philosophy)</searchLink><br /><searchLink fieldCode="DE" term="%22Pixels%22">Pixels</searchLink><br /><searchLink fieldCode="DE" term="%22Multiagent+systems%22">Multiagent systems</searchLink><br /><searchLink fieldCode="DE" term="%22Artificial+intelligence+research%22">Artificial intelligence research</searchLink><br /><searchLink fieldCode="DE" term="%22Video+games%22">Video games</searchLink> – Name: Abstract Label: Abstract Group: Ab Data: Reinforcement learning (RL) has shown great success in increasingly complex single-agent environments and two-player turn-based games. However, the real world contains multiple agents, each learning and acting independently to cooperate and compete with other agents. We used a tournament-style evaluation to demonstrate that an agent can achieve human-level performance in a three-dimensional multiplayer first-person video game, Quake III Arena in Capture the Flag mode, using only pixels and game points scored as input. We used a two-tier optimization process in which a population of independent RL agents are trained concurrently from thousands of parallel matches on randomly generated environments. Each agent learns its own internal reward signal and rich representation of the world. These results indicate the great potential of multiagent reinforcement learning for artificial intelligence research. [ABSTRACT FROM AUTHOR] – Name: AbstractSuppliedCopyright Label: Group: Ab Data: <i>Copyright of Science (pre-March 2025) is the property of American Association for the Advancement of Science and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.) |
| PLink | https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=pbh&AN=136815122 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1126/science.aau6249 Languages: – Code: eng Text: English PhysicalDescription: Pagination: PageCount: 7 StartPage: 859 Subjects: – SubjectFull: Reinforcement learning Type: general – SubjectFull: Multiplayer games Type: general – SubjectFull: Agent (Philosophy) Type: general – SubjectFull: Pixels Type: general – SubjectFull: Multiagent systems Type: general – SubjectFull: Artificial intelligence research Type: general – SubjectFull: Video games Type: general Titles: – TitleFull: Human-level performance in 3D multiplayer games with population-based reinforcement learning. Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Jaderberg, Max – PersonEntity: Name: NameFull: Czarnecki, Wojciech M. – PersonEntity: Name: NameFull: Dunning, Iain – PersonEntity: Name: NameFull: Marris, Luke – PersonEntity: Name: NameFull: Lever, Guy – PersonEntity: Name: NameFull: Castañeda, Antonio Garcia – PersonEntity: Name: NameFull: Beattie, Charles – PersonEntity: Name: NameFull: Rabinowitz, Neil C. – PersonEntity: Name: NameFull: Morcos, Ari S. – PersonEntity: Name: NameFull: Ruderman, Avraham – PersonEntity: Name: NameFull: Sonnerat, Nicolas – PersonEntity: Name: NameFull: Green, Tim – PersonEntity: Name: NameFull: Deason, Louise – PersonEntity: Name: NameFull: Leibo, Joel Z. – PersonEntity: Name: NameFull: Silver, David – PersonEntity: Name: NameFull: Hassabis, Demis – PersonEntity: Name: NameFull: Kavukcuoglu, Koray – PersonEntity: Name: NameFull: Graepel, Thore IsPartOfRelationships: – BibEntity: Dates: – D: 31 M: 05 Text: 5/31/2019 Type: published Y: 2019 Identifiers: – Type: issn-print Value: 00368075 Numbering: – Type: volume Value: 364 – Type: issue Value: 6443 Titles: – TitleFull: Science (pre-March 2025) Type: main |
| ResultId | 1 |