Skip to content

RStaff/https-github.com-rossstafford-ross-ai-portfolio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

8 Commits
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Ross AI Portfolio

Analytics & Data-Quality Mini-Portfolio

Author: Ross Stafford โ€ข PMPยฎ | Data & Product PM
Live repo: https://github.com/RStaff/https-github.com-rossstafford-ross-ai-portfolio

License: MIT CI


๐Ÿ“ˆ Project 1 โ€” GA4 Foot-Traffic Forecast MVP

A small experiment built in 2024 to see if store-visit data can predict sales (โ€œalphaโ€ = trading signal). I merged Google Analytics 4 visit counts (people near each store) with that storeโ€™s point-of-sale revenue for the same weeks. I used PySpark to engineer and store model features (e.g., 7-day lag, rolling averages) in a scalable way. I trained Random-Forest model, the machine-learning algorithm you trained to predict next weekโ€™s sales. The modelโ€™s average error was 8 % lower than a very simple benchmark (e.g., โ€œnext week = this weekโ€). In other words, itโ€™s meaningfully more accurate. The predictions identified stores whose sales were likely to significantly beat or miss expectations; statistical test shows only a 3 % chance the signal is random (good).

Step Tech Outcome
ETL Python / pandas Join synthetic GA4 geofence events with POS sales
Model scikit-learn RandomForest โ†“ MAE 8 % vs. naรฏve baseline
Viz Looker Studio Dual-axis chart for planners (sales vs. visits)

Looker screenshot

Reproduce locally

cd ga4_dashboard
python3 foot_traffic.ipynb  # or open in JupyterLab
open chart.png              # exported Looker chart

**๐Ÿ›ก๏ธ Project 2 โ€” Airflow Data-Quality Guard Rails**
Check	Logic	Alert
Row count	Fail if < 700 rows	Airflow task โ†’ red
Null ratio	Fail if any visits NULLs	Upstream-failed

๐Ÿ—๏ธ Architecture
          GA4 events           POS csv
                โ”‚                 โ”‚
                โ””โ”€โ”€โ”€โ–บ pandas โ€” RandomForest โ€”โ–บ predictions.csv
                                         โ”‚
  Looker Studio dashboard โ—„โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
                                         โ–ผ
           Airflow (row & null checks) โ€” SQLite


๐Ÿ” Tech Stack
Python 3.11, pandas, scikit-learn

Docker Compose (Airflow 2.9, Postgres, Redis)

Looker Studio โ€ข SQLite โ€ข GitHub Actions (CI / Trivy / Bandit)

๐Ÿ“ License
MIT โ€” free to fork & remix. Attribution appreciated!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages