TY - JOUR
T1 - Enriching Context Information for Entity Linking with Web Data
AU - Wang, Yi Ting
AU - Shen, Jie
AU - Li, Zhi Xu
AU - Yang, Qiang
AU - Liu, An
AU - Zhao, Peng Peng
AU - Xu, Jia Jie
AU - Zhao, Lei
AU - Yang, Xun Jie
N1 - KAUST Repository Item: Exported on 2020-10-01
PY - 2020/7/27
Y1 - 2020/7/27
N2 - Entity linking (EL) is the task of determining the identity of textual entity mentions given a predefined knowledge base (KB). Plenty of existing efforts have been made on this task using either “local” information (contextual information of the mention in the text), or “global” information (relations among candidate entities). However, either local or global information might be insufficient especially when the given text is short. To get richer local and global information for entity linking, we propose to enrich the context information for mentions by getting extra contexts from the web through web search engines (WSE). Based on the intuition above, two novel attempts are made. The first one adds web-searched results into an embedding-based method to expand the mention’s local information, where we try two different methods to help generate high-quality web contexts: one is to apply the attention mechanism and the other is to use the abstract extraction method. The second one uses the web contexts to extend the global information, i.e., finding and utilizing more extra relevant mentions from the web contexts with a graph-based model. Finally, we combine the two models we propose to use both extended local and global information from the extra web contexts. Our empirical study based on six real-world datasets shows that using extra web contexts to extend the local and the global information could effectively improve the F1 score of entity linking.
AB - Entity linking (EL) is the task of determining the identity of textual entity mentions given a predefined knowledge base (KB). Plenty of existing efforts have been made on this task using either “local” information (contextual information of the mention in the text), or “global” information (relations among candidate entities). However, either local or global information might be insufficient especially when the given text is short. To get richer local and global information for entity linking, we propose to enrich the context information for mentions by getting extra contexts from the web through web search engines (WSE). Based on the intuition above, two novel attempts are made. The first one adds web-searched results into an embedding-based method to expand the mention’s local information, where we try two different methods to help generate high-quality web contexts: one is to apply the attention mechanism and the other is to use the abstract extraction method. The second one uses the web contexts to extend the global information, i.e., finding and utilizing more extra relevant mentions from the web contexts with a graph-based model. Finally, we combine the two models we propose to use both extended local and global information from the extra web contexts. Our empirical study based on six real-world datasets shows that using extra web contexts to extend the local and the global information could effectively improve the F1 score of entity linking.
UR - http://hdl.handle.net/10754/664536
UR - http://link.springer.com/10.1007/s11390-020-0280-1
UR - http://www.scopus.com/inward/record.url?scp=85088641361&partnerID=8YFLogxK
U2 - 10.1007/s11390-020-0280-1
DO - 10.1007/s11390-020-0280-1
M3 - Article
SN - 1860-4749
VL - 35
SP - 724
EP - 738
JO - Journal of Computer Science and Technology
JF - Journal of Computer Science and Technology
IS - 4
ER -