Home

Awesome

A Network Analysis of the 2018 FIFA World Cup

Group 9: Maxence DRAGUET, Robert INJAC, Yannick KLOSE & Manana LORTKIPANIDZE

<b>In this Readme</b>, you will find in the following order:

Description of notebooks:

A <i>report</i> PDF gathers the result obtained throughout this project. A more detailed reasoning behind each task follows.

THE TASKS

Why the World Cup?

This playful and light event is an incredible opportunity to extract information from the Wikipedia pages using networks. The first of these is obviously the one formed by the hyperlinks on each Wikipedia Page of interest to this subject: mainly the players, the countries and the national teams. Note that other nodes could have been added such as the stadiums, the referees, .... We however believe that this subsample offer the most interesting opportunities to explore connectivity between these different famlies, clustering aspect of the natural teams and signal analysis. The advantage of such a short time-scaled highly hyped event is that correlation and number of visits will be tightly linked to real-world connections.

TASK 0. Get the data:

Note: some of our first ideas needed to sample the number of visits on given Wikipedia pages <b>per hour</b>. The API unfortunately does not offer this possibility.

TASK 1. Network analysis:

TASK 2. Finding the matches:

TASK 3. Identifying teams:

Summary of what is available in Data:

Adjacencies available:

With Numbers of Visit

Numbers of visit in absolute scale

Numbers of visit in Normalised scale

Same files as above but the names end with "_Normalised"

Without Number of Visits

[UPDATED]

Added file:

Supplement:

These should not be necessary as they only contain the node and its category in seperated files for each category Separeted Info:

And in extra, with more entries than just that World Cup:

Thank You For Reading !