Enriching Context Information for Entity Linking with Web Data

Yi Ting Wang, Jie Shen, Zhi Xu Li, Qiang Yang, An Liu, Peng Peng Zhao, Jia Jie Xu, Lei Zhao, Xun Jie Yang

Research output: Contribution to journalArticlepeer-review

2 Scopus citations


Entity linking (EL) is the task of determining the identity of textual entity mentions given a predefined knowledge base (KB). Plenty of existing efforts have been made on this task using either “local” information (contextual information of the mention in the text), or “global” information (relations among candidate entities). However, either local or global information might be insufficient especially when the given text is short. To get richer local and global information for entity linking, we propose to enrich the context information for mentions by getting extra contexts from the web through web search engines (WSE). Based on the intuition above, two novel attempts are made. The first one adds web-searched results into an embedding-based method to expand the mention’s local information, where we try two different methods to help generate high-quality web contexts: one is to apply the attention mechanism and the other is to use the abstract extraction method. The second one uses the web contexts to extend the global information, i.e., finding and utilizing more extra relevant mentions from the web contexts with a graph-based model. Finally, we combine the two models we propose to use both extended local and global information from the extra web contexts. Our empirical study based on six real-world datasets shows that using extra web contexts to extend the local and the global information could effectively improve the F1 score of entity linking.
Original languageEnglish (US)
Pages (from-to)724-738
Number of pages15
JournalJournal of Computer Science and Technology
Issue number4
StatePublished - Jul 27 2020

Bibliographical note

KAUST Repository Item: Exported on 2020-10-01


Dive into the research topics of 'Enriching Context Information for Entity Linking with Web Data'. Together they form a unique fingerprint.

Cite this