$ whoami
[joseph@data:~]$ 6 years transforming complex datasets into clean pipelines, actionable dashboards, and business insights. Specialized in scalable ETL, real-time analytics, and data storytelling.
Kafka · Flink · PostgreSQL · Redis
Built low-latency pipeline processing 120k events/sec, reducing fraud alert latency from 5min to 8 seconds. Saved $2.3M in potential fraud annually.
Airflow · dbt · BigQuery · Fivetran
Unified 14 disparate source systems into single customer view, powering analytics for 5 departments. Reduced reporting time from weeks to minutes.
AWS S3 · Glue · Athena · Terraform
Led migration of 8 PB on-prem Hadoop cluster to AWS data lake, reducing storage costs by 58% and enabling SQL-on-anything analytics.
Great Expectations · dbt · GitHub Actions
Implemented CI/CD data testing with 800+ expectations, catching 96% of data issues before production. Reduced data incidents by 82%.
Tableau · 8 sources · refresh hourly
Real-time KPI tracker for C-level: revenue, churn, CAC, LTV. Used in weekly board meetings.
Power BI · DAX · SQL
Self-serve funnel with cohort retention, product affinity, and drop-off analysis. Led to 15% conversion lift.
Looker · Python · SQL
Cohort-based retention dashboard identifying key levers for increasing LTV by 27%.
$ cat toolchain.txt
$ # certifications: AWS Data Analytics, dbt, Databricks
Lead data platform team · designed real-time fraud detection pipeline · optimized warehouse costs by 41% · mentor 3 juniors.
Built supply chain dashboards used by 50+ operations managers · automated ETL with Python, reducing manual work by 25h/week.
Developed and maintained ETL pipelines · supported BI team with SQL optimization and data modeling.
$ cat education --certifications
M.Sc. Data Science — University of Lagos, 2019
B.Sc. Computer Engineering — Obafemi Awolowo University, 2016
» AWS Certified Data Analytics · dbt Fundamentals · Databricks Lakehouse
available for freelance, consulting, or data engineering conversations.
[ PGP available · signal: @joseph.42 ]