TECH & SPACE ENTHUSIAST

The Engineering Behind AuroraLens

AuroraLens began as a personal challenge — could a student with an interest in space weather build a genuinely accurate, location-specific aurora forecast using nothing but open NASA data and applied machine learning?

The answer required learning FastAPI for the inference backend, building a dual-stage XGBoost pipeline from scratch on the NASA OMNI2 dataset, integrating real-time DSCOVR telemetry, and designing a frontend that made complex geomagnetic data feel intuitive for non-scientists.

The 81.0% weighted F1-score across 1.2 million hours of test data is not the end goal — it is a baseline. The model is open source, documented, and designed to be improved by anyone who wants to contribute.

Mosin Mushtaq

B.Tech AI/ML Engineering

NEXT PROJECT
IN DEVELOPMENT

Space Weather Intelligence Platform

A dedicated infrastructure-focused monitoring system for geomagnetic storm detection, CME tracking, and real-time risk assessment for power grids, GPS networks, and satellite operators.

Technical Architecture

From deep-space telemetry to local forecasts—how the AuroraLens engine processes magnetic flux into predictive insights.

rocket_launch

NASA/NOAA Data

Live DSCOVR Telemetry (L1 Point)

STP_01
settings_input_component

Data Pipeline

Preprocessing & Feature Engineering

STP_02
psychology

XGBoost Engine

Dual-Stage Gradient Boosting

STP_03
dashboard_customize

Forecaster HUD

Real-time UX Visualization

STP_04
terminal

Inference Engine

XGBoost Dual-Stage

The engine utilizes two distinct Gradient Boosting models trained on the NASA OMNI dataset. Stage 1 determines geomagnetic activity probability, while Stage 2 classifies intensity.— Stage 1 output (Activity Probability) feeds Stage 2 as a primary classification feature.

IMF Bz (GSM) Sensitivity92% weight
Solar Wind Speed Impact84% weight
Cloud Cover Density89% weight
clinical_notes

Empirical Validation

Validated against 1.2M hours of NASA OMNI telemetry records, achieving an 81.0% weighted F1-score across KP classification bins.

View Project Artifactsarrow_forward
Model Intelligence

Engine Specifications

Schema

Dual-Stage XGBoost

Two-stage Gradient Boosting pipeline. Stage 1 predicts geomagnetic state probability; Stage 2 classifies intensity (KP-index).

database

Feature Engineering

Inputs include IMF Bz (GSM), Solar Wind Speed/Density, and local tropospheric cloud density updated every 60s.

Verified

81% F1-Score

Validated against 1.2M hours of NASA OMNI telemetry. Optimized for recall on high-intensity (G1-G5) storm events.

© 2026 • AuroraLens Project Artifact