TY - GEN
T1 - A comparison of algorithms and humans for mitosis detection
AU - Giusti, Alessandro
AU - Caccia, Claudio
AU - Cireşan, Dan C.
AU - Schmidhuber, Jürgen
AU - Gambardella, Luca M.
N1 - Generated from Scopus record by KAUST IRTS on 2022-09-14
PY - 2014/7/29
Y1 - 2014/7/29
N2 - We consider the problem of detecting mitotic figures in breast cancer histology slides. We investigate whether the performance of state-of-the-art detection algorithms is comparable to the performance of humans, when they are compared under fair conditions: our test subjects were not previously exposed to the task, and were required to learn their own classification criteria solely by studying the same training set available to algorithms. We designed and implemented a standardized web-based test based on the publicly-available MITOS dataset, and compared results with the performance of the 6 top-scoring algorithms in the ICPR 2012 Mitosis Detection Contest. The problem is presented as a classification task on a balanced dataset. 45 different test subjects produced a total of 3009 classifications. The best individual (accuracy = 0.859 ± 0.012), is outperformed by the most accurate algorithm (accuracy = 0.873 ± 0.004). This suggests that state-of-the-art detection algorithms are likely limited by the size of the training set, rather than by lack of generalization ability.
AB - We consider the problem of detecting mitotic figures in breast cancer histology slides. We investigate whether the performance of state-of-the-art detection algorithms is comparable to the performance of humans, when they are compared under fair conditions: our test subjects were not previously exposed to the task, and were required to learn their own classification criteria solely by studying the same training set available to algorithms. We designed and implemented a standardized web-based test based on the publicly-available MITOS dataset, and compared results with the performance of the 6 top-scoring algorithms in the ICPR 2012 Mitosis Detection Contest. The problem is presented as a classification task on a balanced dataset. 45 different test subjects produced a total of 3009 classifications. The best individual (accuracy = 0.859 ± 0.012), is outperformed by the most accurate algorithm (accuracy = 0.873 ± 0.004). This suggests that state-of-the-art detection algorithms are likely limited by the size of the training set, rather than by lack of generalization ability.
UR - https://ieeexplore.ieee.org/document/6868130/
UR - http://www.scopus.com/inward/record.url?scp=84927939729&partnerID=8YFLogxK
U2 - 10.1109/isbi.2014.6868130
DO - 10.1109/isbi.2014.6868130
M3 - Conference contribution
SN - 9781467319591
SP - 1360
EP - 1363
BT - 2014 IEEE 11th International Symposium on Biomedical Imaging, ISBI 2014
PB - Institute of Electrical and Electronics Engineers Inc.
ER -