I am a results-driven data professional with a strong foundation in data analytics, data science and data engineering. Experienced in leading teams and implementing data-driven solutions to transform complex data into actionable insights. Proficient in Python, R, SQL, and cloud platforms (AWS, GCP, Azure), and skilled at optimizing processes for efficient and impactful operations.
M.S. in Data Science, 2020
University of British Columbia
B.Eng. in Civil Engineering, 2019
University of Victoria
Data Wrangling, Statistical Modelling, Data Visualization, ggplot2
NumPy, Matplotlib, Plotly, Pandas
Classification, Clustering, Regression, Feature Selection, NLP
MySQL, PostgreSQL, Google BigQuery, Database Design, Query Optimization
dbt, Dataflow
AWS (S3, EMR), Hadoop (MapReduce), Spark (Cassandra, PySpark), RESTful API, Web Scraping, Azure, Google Cloud (GCP)
Retrieve and Manipulate Streaming Data
Basic Programming
Component-based UI development, State management, Hooks, Routing
Server-side development, Express.js, RESTful APIs, Asynchronous programming
Python & R Package Development, Scripting, Bash, Unit Testing, Travis Integration
Collaborative Software Development, Git Version Control
Tableau, PowerBI, Looker
Interactive reports, DAX functions, calculated measures, custom paginated reports, Power Automate integration
PivotTables, Filters, Conditional Formatting, Visualizations, VBA Macros
Jira, Slack, Shortcut
Responsibilities included:
Responsibilities included:
Data Analysis:
Responsibilities included:
Responsibilities included:
Deployed a Python program to classify live-streaming equipment end-uses using hierarchical clustering with 95% accuracy:
Consulted with technical and non-technical stakeholders:
Collaborative coding with the data science team:
Responsibilities included:
Responsibilities included:
Responsibilities included:
Relevant Courses:
Relevant Courses:
A personal dog dictionary React App
that allows users to add pictures and basic information about dogs they meet at the dog park.
For my Capstone project I helped create a python program that queries live streaming sensor data from the UDL SkySpark database, cleans and uses appropriate Machine Learning methods to apply NRCan Secondary End-Use Classifications to the data
A convolutional neural network (CNN
) using the CIFAR-10 dataset to classify 32x32 colour images in 10 classes.
An App
that uses flight data to show the worst airport connections for delayed and cancelled flights in the USA geographically.
A Mars_API
to retrieve and visualize weather data from the last 7 Sols (Martian days) as recorded and updated daily by NASA’s InSight Mars lander. InSight is located at Elysium Planitia, a flat surface near the equator of Mars.
An interactive Tableau Dashboard
created using a Urban Social Disorder dataset from the Peace Research Institute Oslo.
Created 3D models and collaborative maps using a drone and software platform.