anthropic ·claude-sonnet-4-5May 18, 06:01 PM
Asset 1XIUcA9
Score
0.97
Latency
3.86s
Cost
$0.012
Workflow Eval Detail
Automatically segments long-form video content into navigable chapters with timestamps and titles—enabling viewers to jump to key moments instantly.
Chapters workflow shows consistently high quality and low cost across providers, with google gemini-3.1-flash-lite and its preview variant emerging as the leading options on this 14-run sample.
Each eval run captures efficacy, efficiency, and expense. We use this data to compare providers and track regressions over time.
We evaluate chapter segmentation quality, timestamp accuracy, and title relevance alongside latency and cost metrics.
| Provider | Model | Cases | Avg Score | Avg Latency | Avg Tokens | Avg Cost | Avg Cost / Min |
|---|---|---|---|---|---|---|---|
| anthropic | claude-sonnet-4-5 | 5 | 0.99 | 4.57s | 3,837 | $0.0131 | $0.0015/min |
| gemini-2.5-flash | 5 | 0.98 | 7.21s | 5,051 | $0.0047 | $0.0005/min | |
| gemini-3-flash-preview | 5 | 0.96 | 9.96s | 5,622 | $0.0073 | $0.0009/min | |
| gemini-3.1-flash-lite | 5 | 0.99 | 1.48s | 3,789 | $0.0012 | $0.0001/min | |
| gemini-3.1-flash-lite-preview | 5 | 0.99 | 1.32s | 3,782 | $0.0012 | $0.0001/min | |
| openai | gpt-5-mini | 4 | 0.91 | 20.3s | 4,596 | $0.0032 | $0.0004/min |
| openai | gpt-5.1 | 3 | 0.97 | 3.73s | 3,349 | $0.0017 | $0.0002/min |