All Projects and Blog Posts
Projects & Applications I've Created
Personal Blog Website
NextJS
2022-10-07
Wondering how I made this website?
This project uses Next.JS and React to scale my previous static website to a dynamically produced static website. The content is created programatically with React components however the website is fast with great SEO because it is statically generated.
IMDB Ratings Visualization
Dash & Python
2022-08-12
This application takes a chosen film as an input and creates a plot and table describing the rank of the film for the specific genres it is categorized in.
2 datasets, 2 APIs, and a web crawler comprise the backend data handling for this project and the data visualization framework `Dash` was used for the application frontend.
YouTube Analytics on AWS
AWS
2022-06-18
Built storage & pipeline infrastructure with AWS CLI then loaded structured and semi-structured YouTube data into S3 Data Lake. Transformed .csv and .json data into Apache Parquet with python scripts on AWS Lambda and Glue ETL. Built data catalogs with Glue Crawler to form the schema of the Data Lake and built a Glue ETL to produce an Analytic Table. Visualized results in a dashboard using AWS QuickSight.
Spotify Recently Played Songs
Python & MS SQL
2022-04-20
Extracted personal data from Spotify API using OAuth 2.0 framework and the authorization code flow structure for long-running apps. Transformed this raw data into a data frame using `pandas` & loaded resulting data frame into SQL Server database using `pyodbc` and `SQLAlchemy` to append new data. Scheduled the above batch ETL process using Apache Airflow.
Google Analytics Dashboard
GCP & Tableau
2022-03-30
Extract-transform-loaded (ETL): GA data (1.4 million records) ➜ BigQuery ➜ Google Cloud Storage ➜ Tableau Server Connection.
BigQuery was used to calculate desirable metrics (Page views, time on page, bounces, sessions) for use as KPIs in Tableau. Created filters and parameters for segmentation and comparison analysis.
Covid-19 Data Dashboard
MS SQL & Tableau
2021-09-05
Statistics and research data was sourced from Our World in Data to visualize how covid-19 has spread over time and between countries. The data was explored and cleaned with Microsoft SQL and visualized in Tableau.
Auto Backup & File Management
Python
2021-04-29
An interactive python tool for keeping track of videos, pictures, documents etc. that have been backed up. The `tqdm` package in python is used to monitor file transfer progress. Records of previous backups are stored in a .json file.
Word Prediction Model
R & Shiny
2021-04-10
Developed an application in `shiny` which accepts a string of words and returns a table of recommendations sorted by probability. The `tidytext` package in R was used for processing, cleaning and analyzing the data, while `ggplot2` was used for visualizations.
Canada Population Dashboard
R & Shiny
2021-03-14
An interactive map showing division population changes based on user chosen dates. The packages `dplyr` and `data.table` in R were used to process and clean the raw population and shape data and the package `shiny` was used to build the application UI. Data was sourced from Statistics Canada.
Nvidia Graphics Card Scraper
Python
2020-12-05
A python Web Scraper used to search a number of different retailers to check for stock. The package `twilio` is used to send a text when stock is received. This app was used to acquire a personal graphics card during the "Great Graphics Card Shortage of 2020".
Human Activity Prediction Using Machine Learning
R
2020-08-25
This project employs the `caret` package in R to build machine learning models that can predict which exercise an individual is performing from wearable sensor data. The data for this project was sourced from UCI Machine Learning Repository
Weather Data Cleaning and Visualization in R
R
2020-05-15
Storm data spanning from 1950-2011 was acquired from the National Weather Service. This data was processed and analyzed to determine which weather related events were the most costly to human life as well as property and crops.
TIFF Data Scraping with Organization in Excel
Python
2019-09-07
Sometimes it is difficult to tell which movies are worth seeing at a film festival since they have never been seen before. In 2019, TIFF decided to allow attendees to "heart" the films they wished to see. I decided to use this as a metric for "popularity" so I could better decide which films were worth attending.