EvaluateAISessions

with Precision

A self-improving evaluation system that scores orchestration quality across five weighted metrics. Like observing stars through a telescope, we measure the brilliance of AI orchestration.

Upload Session

View Star Chart

Sample Session

/ 100

Skill Diversity88

Decision Depth95

Error Recovery91

Compound Learning97

Orchestration99

Composite Score

Five Metrics, One Score

Skill Diversity

Breadth of capabilities used

Decision Depth

Quality of tradeoff analysis

Error Recovery

Graceful failure handling

Compound Learning

Extracted insights

Orchestration

Agent coordination

Mission Protocol

Upload Session File

Drop your .md, .txt, or .jsonl file. Parser extracts orchestration signals: skills invoked, agents spawned, decision points, error recoveries, learning signals.

Algorithmic Scoring

Scoring engine evaluates across five weighted metrics. Each metric scored 0-100, combined into composite score with mathematical precision.

Ranked & Archived

Session appears on public leaderboard with full breakdown. Filter by score range, project type, or skill category.

Pattern Extraction

Top sessions analyzed for winning patterns. System improves its own scoring weights through compound learning loops.

The Meta Observation

This system was built in a single AI session and will evaluate itself. The orchestration patterns used to build SessionStellar become the first entries in its own star chart—a recursive validation of its scoring algorithm.

Self-improving since 2026