Skip to main content
Projects
Case studyData engineering / DuckDB / Analytics

Urban Mobility Data Platform

Local-first data engineering and analytics platform for urban mobility datasets.

Project type
Data Engineering / Analytics Platform
Core stack
Python, DuckDB, dbt-style SQL
Delivery
Case study

Case Study

The problem, implementation decisions, measured evidence, and next improvements.

Overview

A local-first data platform connecting ingestion, validation, analytical modeling, APIs, and dashboard evidence.

Problem

Mobility data projects need a reproducible path from raw inputs to trustworthy analytical outputs without requiring reviewers to provision cloud infrastructure.

Solution

Built a deterministic sample pipeline with data validation, DuckDB and dbt-style analytical modeling, an API, and a reviewer-facing dashboard.

Technical Decisions

  • Local-first defaults avoid cloud accounts and private infrastructure in the review path.
  • Deterministic fixtures make pipeline behavior repeatable in development and CI.

Outcome

The repository provides a local review path with pipeline guardrails, analytical outputs, API and frontend surfaces, tests, and CI checks.

What It Proves

Data pipelines, SQL modeling, reproducibility, API delivery, frontend integration, and CI-safe engineering.

Key Features

  • Deterministic sample pipeline keeps the reviewer path reproducible.
  • Validation and guardrails protect the analytical model inputs.
  • API and dashboard surfaces expose modeled data for inspection.

Architecture

  1. 01

    Sample mobility data

  2. 02

    Validation and ingestion

  3. 03

    DuckDB storage

  4. 04

    dbt-style SQL models

  5. 05

    FastAPI service

  6. 06

    React dashboard

  7. 07

    CI guardrails

Tech Stack

  • Python
  • DuckDB
  • dbt-style SQL
  • FastAPI
  • React
  • CI
  • Data validation

Verification

  • Deterministic sample pipeline
  • Automated tests and CI
  • Repository readiness guardrails

Security & Privacy

  • No real private mobility data is required for the default workflow.
  • Local artifacts and environment files remain outside version control.

Limitations

  • The portfolio links to the repository rather than claiming a public live deployment.
  • Sample data demonstrates the pipeline contract, not production scale.

Future Improvements

  • Add approved production-scale sources when available.
  • Expand data-quality monitoring for larger datasets.

Claims limited to the public repository and its documented local verification path.