Home

Awesome

SkillCorner Open Data

About this repo

Description

This repo contains 9 matches of broadcast tracking data collected by SkillCorner.

The matches included are the 2019/2020 league matches between the champions and runners up in English Premier League, French L1, Spanish LaLiga, Italian Serie A and German Bundesliga.

Broadcast tracking data is tracking data collected through computer vision and machine learning out of the broadcast video.

To find out more about broadcast tracking data and its use cases, read this Medium article.

Motivation

This data has been open sourced in a joint initiative between SkillCorner and Friends Of Tracking. The goals are multiple:

Thus, if you use the data, we kindly ask that you credit SkillCorner and hope you'll notify us on Twitter so we can follow the great work being done with this data.

Documentation

Data Structure

The data directory contains:

For each match, there is two files:

Tracking Data Description

The tracking data is a list. Each element of the list is the result of the tracking for a frame, it's a dictionary with keys:

Each element of the data list is an "object" (referee, ball or player) found at this frame. It's a dictionary with keys:

Note that trackable_object is included when the player has been identified with a high degree of certainty. group_name is not included in this case. Otherwise, only group_name is included.

For the spatial coordinates, the unit of the field modelization is the meter, the center of the coordinates is at the center of the pitch.

The x axis is the long side and the y axis in the short side.

Here is an illustration for a field of size 105mx68m. Field modelization for a pitch of size 105x68

Limitation

The data has been processed as SkillCorner produced matches from over 20 leagues (more than 8000 matches this season). The data has been collected automatically from the broadcast and has not received any manual correction. What it means for user:

Future works

Contact us