openai ·gpt-5.1Feb 18, 09:12 PM
Asset 1XIUcA9
Score
0.97
Latency
2.49s
Cost
$0.0017
Workflow Eval Detail
Automatically segments long-form video content into navigable chapters with timestamps and titles—enabling viewers to jump to key moments instantly.
Chapters workflow performs with high structural accuracy and low cost across providers, with OpenAI gpt-5.1 currently the best quality/latency/cost tradeoff, though results are based on a small sample of runs.
Each eval run captures efficacy, efficiency, and expense. We use this data to compare providers and track regressions over time.
We evaluate chapter segmentation quality, timestamp accuracy, and title relevance alongside latency and cost metrics.
| Provider | Model | Cases | Avg Score | Avg Latency | Avg Tokens | Avg Cost | Avg Cost / Min |
|---|---|---|---|---|---|---|---|
| anthropic | claude-sonnet-4-5 | 2 | 0.97 | 4.27s | 3,173 | $0.0105 | $0.0012/min |
| gemini-2.5-flash | 2 | 0.96 | 6.64s | 4,470 | $0.0045 | $0.0005/min | |
| gemini-3-flash-preview | 2 | 0.96 | 8.47s | 4,357 | $0.0046 | $0.0005/min | |
| openai | gpt-5-mini | 2 | 0.92 | 20.01s | 3,782 | $0.0025 | $0.0003/min |
| openai | gpt-5.1 | 2 | 0.97 | 2.2s | 2,771 | $0.0014 | $0.0002/min |