TY - GEN
T1 - Quick Mining of Isomorphic Exact Large Patterns from Large Graphs
AU - Almasri, Islam
AU - Gao, Xin
AU - Fedoroff, Nina V.
N1 - KAUST Repository Item: Exported on 2020-10-01
PY - 2014/12
Y1 - 2014/12
N2 - The applications of the sub graph isomorphism search are growing with the growing number of areas that model their systems using graphs or networks. Specifically, many biological systems, such as protein interaction networks, molecular structures and protein contact maps, are modeled as graphs. The sub graph isomorphism search is concerned with finding all sub graphs that are isomorphic to a relevant query graph, the existence of such sub graphs can reflect on the characteristics of the modeled system. The most computationally expensive step in the search for isomorphic sub graphs is the backtracking algorithm that traverses the nodes of the target graph. In this paper, we propose a pruning approach that is inspired by the minimum remaining value heuristic that achieves greater scalability over large query and target graphs. Our testing on various biological networks shows that performance enhancement of our approach over existing state-of-the-art approaches varies between 6x and 53x. © 2014 IEEE.
AB - The applications of the sub graph isomorphism search are growing with the growing number of areas that model their systems using graphs or networks. Specifically, many biological systems, such as protein interaction networks, molecular structures and protein contact maps, are modeled as graphs. The sub graph isomorphism search is concerned with finding all sub graphs that are isomorphic to a relevant query graph, the existence of such sub graphs can reflect on the characteristics of the modeled system. The most computationally expensive step in the search for isomorphic sub graphs is the backtracking algorithm that traverses the nodes of the target graph. In this paper, we propose a pruning approach that is inspired by the minimum remaining value heuristic that achieves greater scalability over large query and target graphs. Our testing on various biological networks shows that performance enhancement of our approach over existing state-of-the-art approaches varies between 6x and 53x. © 2014 IEEE.
UR - http://hdl.handle.net/10754/565845
UR - http://ieeexplore.ieee.org/document/7022640/
UR - http://www.scopus.com/inward/record.url?scp=84936872255&partnerID=8YFLogxK
U2 - 10.1109/ICDMW.2014.65
DO - 10.1109/ICDMW.2014.65
M3 - Conference contribution
SN - 9781479942749
SP - 517
EP - 524
BT - 2014 IEEE International Conference on Data Mining Workshop
PB - Institute of Electrical and Electronics Engineers (IEEE)
ER -