Bibliographic Details
| Title: |
Structured Data on the Web. |
| Authors: |
CAFARELLA, MICHAEL J.1 michjc@umich.edu, HALEVY, ALON2 halevy@google.com, MADHAVAN, JAYANT3 jayant@google.com |
| Source: |
Communications of the ACM. Feb2011, Vol. 54 Issue 2, p72-79. 8p. 3 Color Photographs, 2 Charts. |
| Subjects: |
Structured techniques of electronic data processing, Web search engines, Online databases, Data analysis, Database management, Google Inc. |
| Abstract: |
The article discusses the nature of structured data on the World Wide Web and the challenges of using it. Because data on the Web can be about any topic, does not conform to a standardized data design, and is embedded in pages of text, it poses significant problems for traditional data management techniques. Methods for data integration and data modeling using multiple internet sources and metadata are discussed. The examples provided are WebTables and Deep Web Crawler, both developed by the internet search firm Google. |
| Database: |
Engineering Source |