Fast Deterministic Black-box Context-free Grammar Inference.

Saved in:
Bibliographic Details
Title: Fast Deterministic Black-box Context-free Grammar Inference.
Authors: Arefin, Mohammad Rifat1 mxa7262@mavs.uta.edu, Shetiya, Suraj1 suraj.shetiya@mavs.uta.edu, Wang, Zili2 ziliw1@iastate.edu, Csallner, Christoph1 csallner@uta.edu
Source: ICSE: International Conference on Software Engineering. 2024, p1-12. 12p.
Subjects: Generalization, Heuristic programming, Source code, Open source software, Deterministic algorithms
Abstract: Black-box context-free grammar inference is a hard problem as in many practical settings it only has access to a limited number of example programs. The state-of-the-art approach Arvada heuristically generalizes grammar rules starting from flat parse trees and is non-deterministic to explore different generalization sequences. We observe that many of Arvada's generalization steps violate common language concept nesting rules. We thus propose to pre-structure input programs along these nesting rules, apply learnt rules recursively, and make black-box context-free grammar inference deterministic. The resulting TreeVada yielded faster runtime and higher-quality grammars in an empirical comparison. The TreeVada source code, scripts, evaluation parameters, and training data are open-source and publicly available (https://doi.org/10.6084/m9.figshare.23907738). [ABSTRACT FROM AUTHOR]
Copyright of ICSE: International Conference on Software Engineering is the property of Association for Computing Machinery and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Engineering Source
Full text is not displayed to guests.
FullText Links:
  – Type: pdflink
Text:
  Availability: 1
Header DbId: egs
DbLabel: Engineering Source
An: 185196494
AccessLevel: 6
PubType: Conference
PubTypeId: conference
PreciseRelevancyScore: 0
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Fast Deterministic Black-box Context-free Grammar Inference.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Arefin%2C+Mohammad+Rifat%22">Arefin, Mohammad Rifat</searchLink><relatesTo>1</relatesTo><i> mxa7262@mavs.uta.edu</i><br /><searchLink fieldCode="AR" term="%22Shetiya%2C+Suraj%22">Shetiya, Suraj</searchLink><relatesTo>1</relatesTo><i> suraj.shetiya@mavs.uta.edu</i><br /><searchLink fieldCode="AR" term="%22Wang%2C+Zili%22">Wang, Zili</searchLink><relatesTo>2</relatesTo><i> ziliw1@iastate.edu</i><br /><searchLink fieldCode="AR" term="%22Csallner%2C+Christoph%22">Csallner, Christoph</searchLink><relatesTo>1</relatesTo><i> csallner@uta.edu</i>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <searchLink fieldCode="JN" term="%22ICSE%3A+International+Conference+on+Software+Engineering%22">ICSE: International Conference on Software Engineering</searchLink>. 2024, p1-12. 12p.
– Name: Subject
  Label: Subjects
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Generalization%22">Generalization</searchLink><br /><searchLink fieldCode="DE" term="%22Heuristic+programming%22">Heuristic programming</searchLink><br /><searchLink fieldCode="DE" term="%22Source+code%22">Source code</searchLink><br /><searchLink fieldCode="DE" term="%22Open+source+software%22">Open source software</searchLink><br /><searchLink fieldCode="DE" term="%22Deterministic+algorithms%22">Deterministic algorithms</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: Black-box context-free grammar inference is a hard problem as in many practical settings it only has access to a limited number of example programs. The state-of-the-art approach Arvada heuristically generalizes grammar rules starting from flat parse trees and is non-deterministic to explore different generalization sequences. We observe that many of Arvada's generalization steps violate common language concept nesting rules. We thus propose to pre-structure input programs along these nesting rules, apply learnt rules recursively, and make black-box context-free grammar inference deterministic. The resulting TreeVada yielded faster runtime and higher-quality grammars in an empirical comparison. The TreeVada source code, scripts, evaluation parameters, and training data are open-source and publicly available (https://doi.org/10.6084/m9.figshare.23907738). [ABSTRACT FROM AUTHOR]
– Name: AbstractSuppliedCopyright
  Label:
  Group: Ab
  Data: <i>Copyright of ICSE: International Conference on Software Engineering is the property of Association for Computing Machinery and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=185196494
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1145/3597503.3639214
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 12
        StartPage: 1
    Subjects:
      – SubjectFull: Generalization
        Type: general
      – SubjectFull: Heuristic programming
        Type: general
      – SubjectFull: Source code
        Type: general
      – SubjectFull: Open source software
        Type: general
      – SubjectFull: Deterministic algorithms
        Type: general
    Titles:
      – TitleFull: Fast Deterministic Black-box Context-free Grammar Inference.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Arefin, Mohammad Rifat
      – PersonEntity:
          Name:
            NameFull: Shetiya, Suraj
      – PersonEntity:
          Name:
            NameFull: Wang, Zili
      – PersonEntity:
          Name:
            NameFull: Csallner, Christoph
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 05
              Text: 2024
              Type: published
              Y: 2024
          Titles:
            – TitleFull: ICSE: International Conference on Software Engineering
              Type: main
ResultId 1