← Decision Ecosystem

evaluation-calibration-core

Source: GitHub repo (ARCHITECTURE.md). Evidence artifacts from PacketV2 traces.

Data flow

PacketV2 traces -> read_packets() -> compute_metrics() -> build_report() -> Report

Metrics computation

From PacketV2 traces: action distribution, guard trigger rates, latency percentiles, invariant pass rates.

Invariant checks

Contracts

Components

1. Packet Reader (eval_calibration_core/io/packet_reader.py)

Class: PacketReader — Reads PacketV2 from JSONL; validates schema compatibility.

2. Metrics Computation (eval_calibration_core/metrics/compute.py)

Function: compute_metrics(packets: Iterable[PacketV2]) -> dict — Action distribution, guard trigger rates, latency percentiles, invariant verification.

3. Report Generation (eval_calibration_core/report.py)

Function: build_report(packets: Iterable[PacketV2]) -> Report — JSON + Markdown report with metrics and invariant checks.

Safety invariants

Import

from eval_calibration_core.report.builder import build_report