Finding the right term: Retrieving and exploring semantic concepts in astronomical vocabularies

Saved in:
Bibliographic Details
Title: Finding the right term: Retrieving and exploring semantic concepts in astronomical vocabularies
Authors: Gray, Alasdair J.G.1 agray@dcs.gla.ac.uk, Gray, Norman2 norman@astro.gla.ac.uk, Hall, Christopher W.1, Ounis, Iadh1 ounis@dcs.gla.ac.uk
Source: Information Processing & Management. Jul2010, Vol. 46 Issue 4, p470-478. 9p.
Subjects: Information retrieval, Tags (Metadata), Astronomy, Vocabulary, Semantic Web, Terms & phrases, QUERY (Information retrieval system), Semantics, Web search engines
Abstract: Abstract: Astronomy, like many domains, already has several sets of terminology in general use, referred to as controlled vocabularies. For example, the keywords for tagging journal articles, or the taxonomy of terms used to label image files. These existing vocabularies can be encoded into skos, a W3C proposed recommendation for representing vocabularies on the Semantic Web, so that computer systems can help users to search for and discover resources tagged with vocabulary concepts. However, this requires a search mechanism to go from a user-supplied string to a vocabulary concept. In this paper, we present our experiences in implementing the Vocabulary Explorer, a vocabulary search service based on the Terrier Information Retrieval Platform. We investigate the capabilities of existing document weighting models for identifying the correct vocabulary concept for a query. Due to the highly structured nature of a skos encoded vocabulary, we investigate the effects of term weighting (boosting the score of concepts that match on particular fields of a vocabulary concept), and query expansion. We found that the existing document weighting models provided very high quality results, but these could be improved further with the use of term weighting that makes use of the semantic evidence. [Copyright &y& Elsevier]
Copyright of Information Processing & Management is the property of Pergamon Press - An Imprint of Elsevier Science and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Engineering Source
FullText Text:
  Availability: 0
Header DbId: egs
DbLabel: Engineering Source
An: 51296036
AccessLevel: 6
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 0
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Finding the right term: Retrieving and exploring semantic concepts in astronomical vocabularies
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Gray%2C+Alasdair+J%2EG%2E%22">Gray, Alasdair J.G.</searchLink><relatesTo>1</relatesTo><i> agray@dcs.gla.ac.uk</i><br /><searchLink fieldCode="AR" term="%22Gray%2C+Norman%22">Gray, Norman</searchLink><relatesTo>2</relatesTo><i> norman@astro.gla.ac.uk</i><br /><searchLink fieldCode="AR" term="%22Hall%2C+Christopher+W%2E%22">Hall, Christopher W.</searchLink><relatesTo>1</relatesTo><br /><searchLink fieldCode="AR" term="%22Ounis%2C+Iadh%22">Ounis, Iadh</searchLink><relatesTo>1</relatesTo><i> ounis@dcs.gla.ac.uk</i>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <searchLink fieldCode="JN" term="%22Information+Processing+%26+Management%22">Information Processing & Management</searchLink>. Jul2010, Vol. 46 Issue 4, p470-478. 9p.
– Name: Subject
  Label: Subjects
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Information+retrieval%22">Information retrieval</searchLink><br /><searchLink fieldCode="DE" term="%22Tags+%28Metadata%29%22">Tags (Metadata)</searchLink><br /><searchLink fieldCode="DE" term="%22Astronomy%22">Astronomy</searchLink><br /><searchLink fieldCode="DE" term="%22Vocabulary%22">Vocabulary</searchLink><br /><searchLink fieldCode="DE" term="%22Semantic+Web%22">Semantic Web</searchLink><br /><searchLink fieldCode="DE" term="%22Terms+%26+phrases%22">Terms & phrases</searchLink><br /><searchLink fieldCode="DE" term="%22QUERY+%28Information+retrieval+system%29%22">QUERY (Information retrieval system)</searchLink><br /><searchLink fieldCode="DE" term="%22Semantics%22">Semantics</searchLink><br /><searchLink fieldCode="DE" term="%22Web+search+engines%22">Web search engines</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: Abstract: Astronomy, like many domains, already has several sets of terminology in general use, referred to as controlled vocabularies. For example, the keywords for tagging journal articles, or the taxonomy of terms used to label image files. These existing vocabularies can be encoded into skos, a W3C proposed recommendation for representing vocabularies on the Semantic Web, so that computer systems can help users to search for and discover resources tagged with vocabulary concepts. However, this requires a search mechanism to go from a user-supplied string to a vocabulary concept. In this paper, we present our experiences in implementing the Vocabulary Explorer, a vocabulary search service based on the Terrier Information Retrieval Platform. We investigate the capabilities of existing document weighting models for identifying the correct vocabulary concept for a query. Due to the highly structured nature of a skos encoded vocabulary, we investigate the effects of term weighting (boosting the score of concepts that match on particular fields of a vocabulary concept), and query expansion. We found that the existing document weighting models provided very high quality results, but these could be improved further with the use of term weighting that makes use of the semantic evidence. [Copyright &y& Elsevier]
– Name: AbstractSuppliedCopyright
  Label:
  Group: Ab
  Data: <i>Copyright of Information Processing & Management is the property of Pergamon Press - An Imprint of Elsevier Science and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=51296036
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1016/j.ipm.2009.09.004
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 9
        StartPage: 470
    Subjects:
      – SubjectFull: Information retrieval
        Type: general
      – SubjectFull: Tags (Metadata)
        Type: general
      – SubjectFull: Astronomy
        Type: general
      – SubjectFull: Vocabulary
        Type: general
      – SubjectFull: Semantic Web
        Type: general
      – SubjectFull: Terms & phrases
        Type: general
      – SubjectFull: QUERY (Information retrieval system)
        Type: general
      – SubjectFull: Semantics
        Type: general
      – SubjectFull: Web search engines
        Type: general
    Titles:
      – TitleFull: Finding the right term: Retrieving and exploring semantic concepts in astronomical vocabularies
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Gray, Alasdair J.G.
      – PersonEntity:
          Name:
            NameFull: Gray, Norman
      – PersonEntity:
          Name:
            NameFull: Hall, Christopher W.
      – PersonEntity:
          Name:
            NameFull: Ounis, Iadh
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 07
              Text: Jul2010
              Type: published
              Y: 2010
          Identifiers:
            – Type: issn-print
              Value: 03064573
          Numbering:
            – Type: volume
              Value: 46
            – Type: issue
              Value: 4
          Titles:
            – TitleFull: Information Processing & Management
              Type: main
ResultId 1