Home

Awesome

How to Invest in Movies

Authors: Sheng ZhaoXia, Yinan Zhang, Yu-Ting Huang, Zhantao Deng

Report: How to Invest in Movies

Abstract

1. Prerequisites

This project is based on Python and we use Spyder as our python programming environment. We also provide .ipynb files for Jupyter notebook. These softwares can be installed together with Anaconda. In addition, We install all packages through the Anaconda Prompt. These packages have been tested in Window 10 Home and macOS Mojave, but it should be easy to implement in other platforms.

Anaconda

This project is based on anaconda and jupiter notebook. Download and install instructions can be found at: https://www.anaconda.com/download/. After installing Anaconda, pip and conda can be used to install Python packages. Spyder will also be installed together with

Scipy

We use the scientific computing and visualization functionalities of scipy, especially the numpy, pandas and matplotlib package. These packages can be installed by typing the following command in your Anaconda Prompt.

python -m pip install --user numpy scipy matplotlib ipython jupyter pandas sympy nose

Networkx

We use networkx to visualize signals and graph structure. The package can be installed by typing the following command in your Anaconda Prompt.

pip install networkx

NLTK

We use NLTK to analyze the keywords of each movie. The package can be installed by typing the following command in your Anaconda Prompt.

pip install -U nltk

2. Research questions

In this project we have figured out:

3. Dataset

Original Dataset:

We use the dataset on Kaggle. It contains two csv files:

The dataset contains the cast and crew data of 4803 movies. The cast feature mainly depicts what actors appear in the movie and what characters they star. So as the crew data, which descibles the names and jobs of specific crews.

The dataset contains 4803 movies data with 20 distinct features. The important features that we are interested: budget, genres ,keywords, title, popularity, production_companies, revenue, vote_average.

4. What we have done

Auxiliary modules:

Working modules:

5. Contributions of group members

Sheng ZhaoXia:

Yinan Zhang:

Yuting Huang:

Zhantao Deng: