SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in Soccer Videos

Anthony Cioppa, Silvio Giancola, Adrien Deliège, Le Kang, Xin Zhou, Zhiyu Cheng, Bernard Ghanem, Marc Van Droogenbroeck

Research output: Chapter in Book/Report/Conference proceedingConference contribution

28 Scopus citations


Tracking objects in soccer videos is extremely important to gather both player and team statistics, whether it is to estimate the total distance run, the ball possession or the team formation. Video processing can help automating the extraction of those information, without the need of any invasive sensor, hence applicable to any team on any stadium. Yet, the availability of datasets to train learnable models and benchmarks to evaluate methods on a common testbed is very limited. In this work, we propose a novel dataset for multiple object tracking composed of 200 sequences of 30s each, representative of challenging soccer scenarios, and a complete 45-minutes half-time for long-term tracking. The dataset is fully annotated with bounding boxes and tracklet IDs, enabling the training of MOT baselines in the soccer domain and a full benchmarking of those methods on our segregated challenge sets. Our analysis shows that multiple player, referee and ball tracking in soccer videos is far from being solved, with several improvement required in case of fast motion or in scenarios of severe occlusion.
Original languageEnglish (US)
Title of host publication2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
ISBN (Print)978-1-6654-8740-5
StatePublished - Aug 23 2022

Bibliographical note

KAUST Repository Item: Exported on 2022-09-14
Acknowledged KAUST grant number(s): CRG2017-3405
Acknowledgements: This work was supported by the Service Public de Wallonie (SPW) Recherche, under Grant No. 2010235 – ARIAC by, and KAUST Office of Sponsored Research (CRG2017-3405).


Dive into the research topics of 'SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in Soccer Videos'. Together they form a unique fingerprint.

Cite this