Fast Deterministic Black-box Context-free Grammar Inference.
Saved in:
| Title: | Fast Deterministic Black-box Context-free Grammar Inference. |
|---|---|
| Authors: | Arefin, Mohammad Rifat1 mxa7262@mavs.uta.edu, Shetiya, Suraj1 suraj.shetiya@mavs.uta.edu, Wang, Zili2 ziliw1@iastate.edu, Csallner, Christoph1 csallner@uta.edu |
| Source: | ICSE: International Conference on Software Engineering. 2024, p1-12. 12p. |
| Subjects: | Generalization, Heuristic programming, Source code, Open source software, Deterministic algorithms |
| Abstract: | Black-box context-free grammar inference is a hard problem as in many practical settings it only has access to a limited number of example programs. The state-of-the-art approach Arvada heuristically generalizes grammar rules starting from flat parse trees and is non-deterministic to explore different generalization sequences. We observe that many of Arvada's generalization steps violate common language concept nesting rules. We thus propose to pre-structure input programs along these nesting rules, apply learnt rules recursively, and make black-box context-free grammar inference deterministic. The resulting TreeVada yielded faster runtime and higher-quality grammars in an empirical comparison. The TreeVada source code, scripts, evaluation parameters, and training data are open-source and publicly available (https://doi.org/10.6084/m9.figshare.23907738). [ABSTRACT FROM AUTHOR] |
| Copyright of ICSE: International Conference on Software Engineering is the property of Association for Computing Machinery and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Database: | Engineering Source |
|
Full text is not displayed to guests.
Login for full access.
|
|
| FullText | Links: – Type: pdflink Text: Availability: 1 |
|---|---|
| Header | DbId: egs DbLabel: Engineering Source An: 185196494 AccessLevel: 6 PubType: Conference PubTypeId: conference PreciseRelevancyScore: 0 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: Fast Deterministic Black-box Context-free Grammar Inference. – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Arefin%2C+Mohammad+Rifat%22">Arefin, Mohammad Rifat</searchLink><relatesTo>1</relatesTo><i> mxa7262@mavs.uta.edu</i><br /><searchLink fieldCode="AR" term="%22Shetiya%2C+Suraj%22">Shetiya, Suraj</searchLink><relatesTo>1</relatesTo><i> suraj.shetiya@mavs.uta.edu</i><br /><searchLink fieldCode="AR" term="%22Wang%2C+Zili%22">Wang, Zili</searchLink><relatesTo>2</relatesTo><i> ziliw1@iastate.edu</i><br /><searchLink fieldCode="AR" term="%22Csallner%2C+Christoph%22">Csallner, Christoph</searchLink><relatesTo>1</relatesTo><i> csallner@uta.edu</i> – Name: TitleSource Label: Source Group: Src Data: <searchLink fieldCode="JN" term="%22ICSE%3A+International+Conference+on+Software+Engineering%22">ICSE: International Conference on Software Engineering</searchLink>. 2024, p1-12. 12p. – Name: Subject Label: Subjects Group: Su Data: <searchLink fieldCode="DE" term="%22Generalization%22">Generalization</searchLink><br /><searchLink fieldCode="DE" term="%22Heuristic+programming%22">Heuristic programming</searchLink><br /><searchLink fieldCode="DE" term="%22Source+code%22">Source code</searchLink><br /><searchLink fieldCode="DE" term="%22Open+source+software%22">Open source software</searchLink><br /><searchLink fieldCode="DE" term="%22Deterministic+algorithms%22">Deterministic algorithms</searchLink> – Name: Abstract Label: Abstract Group: Ab Data: Black-box context-free grammar inference is a hard problem as in many practical settings it only has access to a limited number of example programs. The state-of-the-art approach Arvada heuristically generalizes grammar rules starting from flat parse trees and is non-deterministic to explore different generalization sequences. We observe that many of Arvada's generalization steps violate common language concept nesting rules. We thus propose to pre-structure input programs along these nesting rules, apply learnt rules recursively, and make black-box context-free grammar inference deterministic. The resulting TreeVada yielded faster runtime and higher-quality grammars in an empirical comparison. The TreeVada source code, scripts, evaluation parameters, and training data are open-source and publicly available (https://doi.org/10.6084/m9.figshare.23907738). [ABSTRACT FROM AUTHOR] – Name: AbstractSuppliedCopyright Label: Group: Ab Data: <i>Copyright of ICSE: International Conference on Software Engineering is the property of Association for Computing Machinery and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.) |
| PLink | https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=185196494 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1145/3597503.3639214 Languages: – Code: eng Text: English PhysicalDescription: Pagination: PageCount: 12 StartPage: 1 Subjects: – SubjectFull: Generalization Type: general – SubjectFull: Heuristic programming Type: general – SubjectFull: Source code Type: general – SubjectFull: Open source software Type: general – SubjectFull: Deterministic algorithms Type: general Titles: – TitleFull: Fast Deterministic Black-box Context-free Grammar Inference. Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Arefin, Mohammad Rifat – PersonEntity: Name: NameFull: Shetiya, Suraj – PersonEntity: Name: NameFull: Wang, Zili – PersonEntity: Name: NameFull: Csallner, Christoph IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 05 Text: 2024 Type: published Y: 2024 Titles: – TitleFull: ICSE: International Conference on Software Engineering Type: main |
| ResultId | 1 |