An Overview of Schema Extraction and Matching Techniques

Muhammad Umair Hassan, Kamran Shankat, Dongmie Niu, Sundas Mahreen, Yingjun Ma, Fatima Haider, Muhammad Mubashir, Xiuyang Zhao

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Scopus citations

Abstract

W383 contains huge amount of data both in structured and unstructured form. One of the forms for the structured data is HTML tables for which metadata are not explicitly stored/available. As a result, data in such tables cannot be queried accurately and users cannot get exact results to their queries through search engines. Schema extraction establishes schema for data found in different form of web tables. Once the schema of web tables has been extracted, the tables can be created and populated against this schema which can be queried using SQL resulting much better results than traditional search engines. Schema Matching determines number of correspondences which identifies the similar elements from two different schemas. Columns and data values are compared one after the other to match schema. In this paper, different ways of extracting data from tables are mentioned and different tools used for schema extraction are named. Two techniques for Schema Matching are briefly explained.
Original languageEnglish (US)
Title of host publicationProceedings of 2018 2nd IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference, IMCEC 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1290-1294
Number of pages5
ISBN (Print)9781538618035
DOIs
StatePublished - Sep 20 2018
Externally publishedYes

Bibliographical note

Generated from Scopus record by KAUST IRTS on 2023-09-20

Fingerprint

Dive into the research topics of 'An Overview of Schema Extraction and Matching Techniques'. Together they form a unique fingerprint.

Cite this