Stephen Lang Resume

Email: Website: https://stevelang.xyz

GitHub: https://github.com/Kaizen91 LinkedIn: https://www.linkedin.com/in/stephen-lang-canada/

Software Development Data Engineering Tools DevOps Tools Soft Skills
-SQL -ETL / ELT -Google Cloud Platform -Spanish / English
-Python -OLAP / OLTP -Terraform -Project Management (Kanban, Agile)
-R -Spark / Hadoop -Linux -Ability to train both technical and non technical audiences
-Continuous Integration / Continuous Deployment -BigQuery, Cloud Storage, Pub/Sub, Dataproc -Cloud Computing generally
-Git -DBT -Docker

Work Experience

Data Engineer Manager Scotiabank 11.2020 - 10.2023

I was the team lead for a team of data engineers building ETL pipelines to produce standardized data sets on a project spanning 4 countries. We worked using Pyspark to build an ETL pipeline to standardize the bank’s data. I handled several Data Quality and remediation initiatives. I provided training to several teams on GCP and cloud computing in general. I handled Data Governance activities for dozens of systems.

Professional Services Consultant Delbridge Solutions 03.2019 - 11.2020

Built ETL processes using SQL stored procedures, Bash Scripts, APIs, and mapping tools.
Consolidations, Capex planning, HR planning, OPEX, COGS, and Revenue Planning
Implementing and integrating VENA Financial Budgeting Software.
Training power users on OLAP technologies.
Designing reporting and budgeting workflows, and designing Solution Architecture Addressing security and data permissions.
Wrote Python scripts to automate repetitive processes and increase efficiency
Leveraging data base connections and Excel to build custom Business Intelligence reports

Application Specialist Ceridian 06.2017 - 03.2019

Complex problem solving related to both payroll law and data analysis
Running SQL queries and scripts to investigate and solve issues
Developing BI reporting
Bridge between the client and product management team ensuring any product enhancements are built into software
Solving technical issues related to processing payroll for multimillion dollar clients
Creating training documentation
Configuring taxation for clients with employees in multiple states

Certifications

Projects

Canadian Housing Market Pipeline

https://github.com/Kaizen91/spark-housing-market-canada

An ETL project using canadian housing data to demonstrate knowledge of Spark, Terraform, and GCP (Dataproc, Cloud Storage, BigQuery). The main.tf terraform file will create all the infrastructure needed for this pipeline: a Google Cloud Storage bucket, a Dataproc cluster, a Dataproc job, and a BigQuery dataset. It will also upload the source csv file and the transform.py script to the Google Cloud Storage bucket, so that they can be accessed by the Dataproc Pyspark Job running on the Dataproc cluster.

GCP Streaming Pipeline with Cloud Run

https://github.com/Kaizen91/gcp-stream-cloud-run

This is an ETL project using GCP (Pub/Sub, Docker, Cloud Run, and BigQuery) to stream simulated telemetry data. This set up would be good for any pipeline where you need to do light weight transformations on messages before storing them.

Education

Carleton University 2011 - 2015