Home

Awesome

Data Analysis: Do figure skating judges score their home countries more favorably?

A closer look at the data, code, and methodologies supporting the BuzzFeed News article, "The Edge," published February 8, 2018. Please read that article, which contains important context, before proceeding.

Glossary

This repository uses the following definitions:

Data

Raw Scoring Data

The raw data for this project was collected from the protocol PDFs for every major international competition from October 2016 through December 2017. You can find a list of those 17 competitions below.

The raw data is contained in the following three "tidy" CSV files:

You can find a list of all the fields and their definitions, as well as the scripts used to extract the data from the PDFs, in github.com/BuzzFeedNews/figure-skating-scores.

Competitions Included

2016–17 season:

2017–18 season:

Processed Data

Translations of "Grades of Execution"

The protocol PDFs include a rating for each element, between -3 and +3, given by each judge. This Grade of Execution is translated by the Scale of Values to the actual number of points awarded for that element. (More difficult elements receive more points for the same Grade of Execution.) The translate-goe notebook includes the code BuzzFeed News used to translate each Grade of Execution for each element. That notebook produces the judge-goe.csv file, which contains the translated values for each score.

Each season of figure skating and ice dancing uses a slightly different Scale of Values, which are documented in a series of PDFs published by the ISU. For easier analysis, BuzzFeed News converted these PDFs to CSV files:

Judge Names and Home Countries

The ISU posts the names of each judge (for each program) on a series of HTML pages on its website. BuzzFeed News collected all of the judge names from those pages and standardized that data. You can find the results in judges.csv.

On the ISU's website, judges are often labeled simply as members of the ISU instead of their home country. BuzzFeed News used the ISU's lists of officials for the 2017-18 season and the 2016-17 season to identify each judge's home country. You can find the results in judge-country.csv.

Analyses

The analysis of home-country preference can be found in this Jupyter Notebook. The notebook takes a reader through the process of:

In developing the methodology, BuzzFeed News consulted with three statisticians. Two — Jay Emerson (Yale) and Eric Zizewitz (Dartmouth) — have worked closely with figure skating data in the past. The third, Abraham Wyner (Penn), has extensive experience applying similar analyses to data from other sports.

In addition to the home-country preference analysis, BuzzFeed News also recreated the ISU's "Deviation Points" system for identifying outlier judgments, the rules for which can be found in ISU Communication 2098. That code can be found in the isu-deviation-points notebook.

A fourth notebook — progressive-skate-america-2016-example — provides details about Maira Abasova's scores at the ISU GP Progressive Skate America 2016.

Update, February 18, 2018: This repository now also includes a fifth notebook — ice-dance-team-scoring — supporting a second BuzzFeed News article, "In Bitter Ice Dancing Rivalry Judges Favor Their Own." That notebook uses the same data, and much of the same code, as the analyses above.

Technical Notes

All of the analyses above are coded in Python 3, using the libraries listed in requirements.txt.

The individual-judge analysis in the home-country-preference-analysis notebook uses bootstrapping to test for statistical significance. Depending on your computer's processing power, this step could take at least a few hours to run. If you would like to run it more quickly, you can change the number of simulations in find_judge_prob, but doing so will decrease the accuracy of those calculations.

Licensing

All code in this repository is available under the MIT License. All data files are available under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.

Questions / Feedback

Contact John Templon at john.templon@buzzfeed.com.

Looking for more from BuzzFeed News? Click here for a list of our open-sourced projects, data, and code.