IMDB Ratings Visualization

2022-08-12

GitHub App

banner

*See my other GitHub Repo for the chrome extension I built off this data.

Summary

This application takes a chosen film as an input and creates a plot and table describing the rank of the film for the specific genres it is categorized in.

The intent of this project is to compare the average imdb ratings between genres and visualize the distribution of ratings for each genre in an effort to translate a film rating to a genre specific ranking.

app

Data Sources

2 datasets, 2 APIs, and a web crawler comprise the backend data handling for this project:

Methods

The tables 'title.basics.tsv', 'title.ratings.tsv' and 'IMDB movies.csv' were loaded into Microsoft SQL Server as TitleBasics, TitleRatings and TitleExtras respectively. These tables were used for exploratory data analysis and subsequent creation of the tables necessary for visualization (OMDb API was used to source the Rotten Tomatoes and Metacritic ratings). The database schema is shown below (created in dbdiagram.io)

app

The python package pyodbc was then used to load the tables from this database into pandas for visualization in matplotlib and plotly. Dash was used for the frontend of the web application.