Abstract
Betweenness centrality quantifies the importance of graph nodes in a variety of applications including social, biological and communication networks. Its computation is very costly for large graphs; therefore, many approximate methods have been proposed. Given the lack of a golden standard, the accuracy of most approximate methods is evaluated on tiny graphs and is not guaranteed to be representative of realistic datasets that are orders of magnitude larger. In this paper, we develop BeBeCA, a benchmark for betweenness centrality approximation methods on large graphs. Specifically: (i) We generate a golden standard by deploying a parallel implementation of Brandes algorithm using 96,000 CPU cores on a supercomputer to compute exact betweenness centrality values for several large graphs with up to 126M edges. (ii) We propose an evaluation methodology to assess various aspects of approximation accuracy, such as average error and quality of node ranking. (iii) We survey a large number of existing approximation methods and compare their performance and accuracy using our benchmark. (iv) We publicly share our benchmark, which includes the golden standard exact betweenness centrality values together with the scripts that implement our evaluation methodology; for researchers to compare their own algorithms and practitioners to select the appropriate algorithm for their application and data.
Original language | English (US) |
---|---|
Title of host publication | SSDBM 2017 |
Subtitle of host publication | 29th International Conference on Scientific and Statistical Database Management |
Publisher | Association for Computing Machinery |
ISBN (Electronic) | 9781450352826 |
DOIs | |
State | Published - Jun 27 2017 |
Event | 29th International Conference on Scientific and Statistical Database Management, SSDBM 2017 - Chicago, United States Duration: Jun 27 2017 → Jun 29 2017 |
Publication series
Name | ACM International Conference Proceeding Series |
---|---|
Volume | Part F128636 |
Conference
Conference | 29th International Conference on Scientific and Statistical Database Management, SSDBM 2017 |
---|---|
Country/Territory | United States |
City | Chicago |
Period | 06/27/17 → 06/29/17 |
Bibliographical note
Publisher Copyright:© 2017 ACM.
Keywords
- Approximation algorithms
- Betweenness centrality
- Experimental evaluation
- Social networks
ASJC Scopus subject areas
- Software
- Human-Computer Interaction
- Computer Vision and Pattern Recognition
- Computer Networks and Communications