← All projects
01 · Big Data

Distributed ETL pipeline - Air traffic

Big Data processing chain for real-time air-traffic analysis. Data from the OpenAIP API is ingested and processed continuously, then surfaced on a map-based dashboard.

OpenAIPNiFiKafkaSparkPostgreSQLPower BI
Real-time streaming · distributed ETLView the code ↗
Real-time Big Data architecture
FIG. 01Real-time Big Data architectureEnd-to-end overview: continuous ingestion from the OpenAIP API, a Kafka message bus, distributed Spark processing and PostgreSQL storage.
Air-traffic monitoring dashboard
FIG. 02Air-traffic monitoring dashboardMap-based view of the tracked flights in near real time.
Apache NiFi ingestion pipeline
FIG. 03Apache NiFi ingestion pipelineThe NiFi flow orchestrates collection and routing of the data into Kafka.