Structured Data on the Web.

Saved in:
Bibliographic Details
Title: Structured Data on the Web.
Authors: CAFARELLA, MICHAEL J.1 michjc@umich.edu, HALEVY, ALON2 halevy@google.com, MADHAVAN, JAYANT3 jayant@google.com
Source: Communications of the ACM. Feb2011, Vol. 54 Issue 2, p72-79. 8p. 3 Color Photographs, 2 Charts.
Subjects: Structured techniques of electronic data processing, Web search engines, Online databases, Data analysis, Database management, Google Inc.
Abstract: The article discusses the nature of structured data on the World Wide Web and the challenges of using it. Because data on the Web can be about any topic, does not conform to a standardized data design, and is embedded in pages of text, it poses significant problems for traditional data management techniques. Methods for data integration and data modeling using multiple internet sources and metadata are discussed. The examples provided are WebTables and Deep Web Crawler, both developed by the internet search firm Google.
Database: Engineering Source
FullText Links:
  – Type: pdflink
Text:
  Availability: 0
Header DbId: egs
DbLabel: Engineering Source
An: 59631584
AccessLevel: 6
PubType: Periodical
PubTypeId: serialPeriodical
PreciseRelevancyScore: 0
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Structured Data on the Web.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22CAFARELLA%2C+MICHAEL+J%2E%22">CAFARELLA, MICHAEL J.</searchLink><relatesTo>1</relatesTo><i> michjc@umich.edu</i><br /><searchLink fieldCode="AR" term="%22HALEVY%2C+ALON%22">HALEVY, ALON</searchLink><relatesTo>2</relatesTo><i> halevy@google.com</i><br /><searchLink fieldCode="AR" term="%22MADHAVAN%2C+JAYANT%22">MADHAVAN, JAYANT</searchLink><relatesTo>3</relatesTo><i> jayant@google.com</i>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <searchLink fieldCode="JN" term="%22Communications+of+the+ACM%22">Communications of the ACM</searchLink>. Feb2011, Vol. 54 Issue 2, p72-79. 8p. 3 Color Photographs, 2 Charts.
– Name: Subject
  Label: Subjects
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Structured+techniques+of+electronic+data+processing%22">Structured techniques of electronic data processing</searchLink><br /><searchLink fieldCode="DE" term="%22Web+search+engines%22">Web search engines</searchLink><br /><searchLink fieldCode="DE" term="%22Online+databases%22">Online databases</searchLink><br /><searchLink fieldCode="DE" term="%22Data+analysis%22">Data analysis</searchLink><br /><searchLink fieldCode="DE" term="%22Database+management%22">Database management</searchLink><br /><searchLink fieldCode="DE" term="%22Google+Inc%2E%22">Google Inc.</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: The article discusses the nature of structured data on the World Wide Web and the challenges of using it. Because data on the Web can be about any topic, does not conform to a standardized data design, and is embedded in pages of text, it poses significant problems for traditional data management techniques. Methods for data integration and data modeling using multiple internet sources and metadata are discussed. The examples provided are WebTables and Deep Web Crawler, both developed by the internet search firm Google.
PLink https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=59631584
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1145/1897816.1897839
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 8
        StartPage: 72
    Subjects:
      – SubjectFull: Structured techniques of electronic data processing
        Type: general
      – SubjectFull: Web search engines
        Type: general
      – SubjectFull: Online databases
        Type: general
      – SubjectFull: Data analysis
        Type: general
      – SubjectFull: Database management
        Type: general
      – SubjectFull: Google Inc.
        Type: general
    Titles:
      – TitleFull: Structured Data on the Web.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: CAFARELLA, MICHAEL J.
      – PersonEntity:
          Name:
            NameFull: HALEVY, ALON
      – PersonEntity:
          Name:
            NameFull: MADHAVAN, JAYANT
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 02
              Text: Feb2011
              Type: published
              Y: 2011
          Identifiers:
            – Type: issn-print
              Value: 00010782
          Numbering:
            – Type: volume
              Value: 54
            – Type: issue
              Value: 2
          Titles:
            – TitleFull: Communications of the ACM
              Type: main
ResultId 1