Dashboard

Welcome back, Anna.

Here is what is happening across your behavior packages today.

Create package

Total packages

12

Private packages

4

Eval runs (30d)

87

Green badges

9

Recent packages

View all
NameTypeVersionStatusUpdated
customer-support-classifier

Fine-tune recipe for triaging customer-support tickets into priority and topic queues.

FineTuneSpecv0.4.2Green badge2026-04-28
dod-memo-formatter

PromptSpec for generating DoD-format memorandums with correct headers, subject lines, and reference blocks.

PromptSpecv1.2.0Green badge2026-04-15
export-control-reviewer

SafetySpec policy that flags content covered by EAR/ITAR before generation completes.

Safety Policyv0.6.0Review2026-04-02
sec-risk-disclosure-drafter

Drafts Item 1A risk-factor sections for 10-K and 10-Q filings, grounded in the registrant's filings.

PromptSpecv0.9.1Green badge2026-03-25

Recent eval runs

RunPackageScoreStatusStartedDuration
run_8a91customer-support-classifier94.0%passed2026-05-06 18:42Z3m 4s
run_8a90dod-memo-formatter96.0%passed2026-05-06 15:11Z1m 2s
run_8a8fexport-control-reviewer74.0%failed2026-05-06 11:28Z4m 1s
run_8a8eprompt-injection-eval-suite88.0%passed2026-05-05 21:02Z10m 12s
run_8a8dpolicy-brief-generator91.0%passed2026-05-05 16:34Z5m 5s