Workflow underneath the interface

Pipeline

Astrolyte is built around repeatable ingestion, indexing, processing, validation, and delivery. The interface is only the visible surface of the workflow underneath.

Stages

Pipeline stages

Astrolyte favors repeatable operations over hidden automation. Each stage leaves behind artifacts that can be inspected, checked, and reused.

1

IRIS + Rubin Sampling

ingest

Ingest

Retrieve public observational data from archives or survey providers without collapsing source truth.

  • remote archive query and download
  • live survey ingest
  • raw storage discipline
2

IRIS

index

Index

Extract metadata and organize files or records cleanly so later stages can operate on stable structure.

  • metadata index generation
  • obs_dir-aware naming
  • duplicate-obsid hardening
3

Rubin Sampling + T CrB

process

Process

Clean, standardize, and transform records into tables, features, figures, and analysis-facing artifacts.

  • schema standardization
  • clean product generation
  • feature and figure production
4

IRIS + Rubin Sampling + T CrB

validate

Validate

Run overlap checks, audit comparisons, and baseline evaluation so failure modes stay visible.

  • duplicate-window pair audit
  • period recovery evaluation
  • AAVSO vs ASAS-SN overlap validation
5

All three lineages

serve

Serve

Expose raw, processed, and curated products for inspection and reuse without breaking provenance.

  • downloadable artifacts
  • inspection-friendly lane pages
  • raw-to-curated lineage visibility

Layers

Raw, processed, curated

The pipeline stays legible because the storage layers never collapse into one undifferentiated output surface.

Layer 1

Raw

Original archive files, untouched source records, and retrieval context remain intact.

FITS referencesarchive responsesretrieval manifests

Layer 2

Processed

Cleaned tables, standardized schemas, indexed metadata, and validated observation records are generated downstream.

metadata indexparquet tablesquality checks

Layer 3

Curated

Derived features, summaries, figures, and analysis-ready exports become the reusable public surface.

figure bundlesevent catalogsdownload exports