- 👋 Hi, I’m @mtholahan
- 👀 I’m interested in learning new technologies.
- 🌱 I’m currently learning data-engineering topics.
- 💞️ I’m looking to collaborate on Python projects.
- 📫 How to reach me markholahan@proton.me
- ⚡ Fun fact: the double paradiddle, comprising 6 beats, RLRLRR, LRLRLL, when grouped in 4 groups of 3, RLR LRR LRL RLL, is the perfect rudiment for shuffles.
Pinned Loading
-
springboard-projects
springboard-projects PublicA meta-repo for my Springboard data engineering boot camp projects.
Shell
-
unguided-capstone-project
unguided-capstone-project PublicThis is my unguided capstone project: exploring the impact of soundtrack genre diversity on movie popularity using TMDb & Discogs.
Python
-
guided-capstone-project
guided-capstone-project PublicBuild an end-to-end pipeline for high-frequency equity market data. Designed database schemas, ingested daily trade and quote records from CSV/JSON into Spark, implemented EOD batch loads with dedu…
Jupyter Notebook
-
apache-spark-optimization-mini-project
apache-spark-optimization-mini-project PublicOptimized PySpark jobs by analyzing query execution plans and rewriting transformations for efficiency. Applied techniques such as reducing shuffles, tuning partitions, selecting efficient operator…
Python
-
kafka-mini-project
kafka-mini-project PublicBuilt a streaming fraud detection system with Apache Kafka and Python. Deployed a Kafka cluster via Docker Compose, implemented a transaction generator and fraud detector using kafka-python, and ro…
Python
-
mysql-python-data-pipeline-mini-project
mysql-python-data-pipeline-mini-project PublicDeveloped a Python and SQL data pipeline for an event ticketing system. Designed a MySQL table schema, ingested CSV sales data via Python connectors, and implemented queries to analyze ticket popul…
Python
If the problem persists, check the GitHub status page or contact support.