$ whoami

Joseph Edet

>> data engineer + analyst

[joseph@data:~]$ 6 years transforming complex datasets into clean pipelines, actionable dashboards, and business insights. Specialized in scalable ETL, real-time analytics, and data storytelling.

4.8B records processed/month
99.95% pipeline reliability
18 production dbs
27 dashboards deployed
⛓️

[ data engineering ]

📦 real-time

fraud detection pipeline

Kafka · Flink · PostgreSQL · Redis

Built low-latency pipeline processing 120k events/sec, reducing fraud alert latency from 5min to 8 seconds. Saved $2.3M in potential fraud annually.

throughput: 120k/s 99.99% uptime
🔄 batch

centralized customer 360

Airflow · dbt · BigQuery · Fivetran

Unified 14 disparate source systems into single customer view, powering analytics for 5 departments. Reduced reporting time from weeks to minutes.

sources: 14 time saved: 85%
☁️ cloud infra

data lake migration

AWS S3 · Glue · Athena · Terraform

Led migration of 8 PB on-prem Hadoop cluster to AWS data lake, reducing storage costs by 58% and enabling SQL-on-anything analytics.

cost reduction: 58% 8 PB migrated
🧪 data quality

automated data quality suite

Great Expectations · dbt · GitHub Actions

Implemented CI/CD data testing with 800+ expectations, catching 96% of data issues before production. Reduced data incidents by 82%.

tests: 800+ incidents ↓ 82%
📊

[ analytics & dashboards ]

📈

executive growth dashboard

Tableau · 8 sources · refresh hourly

Real-time KPI tracker for C-level: revenue, churn, CAC, LTV. Used in weekly board meetings.

daily users: 30+ execs
🛍️

e-commerce funnel analyzer

Power BI · DAX · SQL

Self-serve funnel with cohort retention, product affinity, and drop-off analysis. Led to 15% conversion lift.

queries saved: 180+/mo
👥

user retention & LTV

Looker · Python · SQL

Cohort-based retention dashboard identifying key levers for increasing LTV by 27%.

LTV increase: 27%

$ cat toolchain.txt

Python (pandas, numpy)
SQL (BigQuery, Postgres)
Spark / PySpark
Airflow / Dagster
dbt / Dataform
AWS (Redshift, S3, Glue)
Tableau / Power BI
Kafka / Kinesis
Terraform / Docker
Great Expectations
Git / CI/CD
Looker / LookML

$ # certifications: AWS Data Analytics, dbt, Databricks

⌨️

[ experience ]

2022 – present

senior data engineer · credo bank

Lead data platform team · designed real-time fraud detection pipeline · optimized warehouse costs by 41% · mentor 3 juniors.

2019 – 2022

data analyst · maven logistics

Built supply chain dashboards used by 50+ operations managers · automated ETL with Python, reducing manual work by 25h/week.

2017 – 2019

junior data engineer · quickcom

Developed and maintained ETL pipelines · supported BI team with SQL optimization and data modeling.

$ cat education --certifications

M.Sc. Data Science — University of Lagos, 2019

B.Sc. Computer Engineering — Obafemi Awolowo University, 2016

» AWS Certified Data Analytics · dbt Fundamentals · Databricks Lakehouse

📡

let's connect

available for freelance, consulting, or data engineering conversations.

[ PGP available · signal: @joseph.42 ]