Home

Awesome

Wikipedia Recommender System

Welcome to our project repository for the Network Tour of Data Science course at EPFL !

We implemented a query-based search engine for Wikipedia articles related to various Machine Learning topics.

In other words, given a query our system will retrieve and suggest articles with similar semantic contents. Moreover, we provide a graph visualisation tool to interact with the query engine.

More details about this ML system can be found in the project [report](Team 02 - Project report.pdf).

How to reproduce results:

Note that 'wd' is the directory containing the run.sh script (in the project folder).

NOTE: if you want to use a virtual environment, run the following:

From wd, run the following:

Interactive Visualisation:

After having done the previous part, run the command: python3 visualization/app.py 8888

NOTE: if you want to put the app online like on the following link, you have to do all the above installs in "sudo" mode, and run the following command instead: sudo PYTHONPATH=wd python3 visualization/app.py 80. Another option is that you enable port 80 for current user.

You can choose any of the three methods to perform a query.

For multiple concepts, please separate by a comma, e.g. machine learning,text processing The port 80 must be opened for external access if you use a server.

This web app has been only tested on Chrome for Linux (78.0.3904.70).

Files breakdown:

run.sh : shell script executing the acquisition, exploitation and visualisation tasks.

Acquisition:

Exploration:

Exploitation:

Visualization:

Helpers:

Data:

Authors

License

Wikipedia Recommender System - Network Tour of Data Science EE-558 - EPFL - Fall 2019 - Team 2

Copyright (c) 2019 EPFL

This program is licensed under the terms of the GPL.