anthropic ·claude-sonnet-4-5May 18, 06:01 PM
Asset 88Lb01q
Score
1
Latency
6.96s
Cost
$0.0119
Workflow Eval Detail
Converts captions into multiple languages, helping you reach global audiences without manual translation work.
Caption Translation workflow shows consistently high quality across providers, with Google gemini-3.1-flash-lite(-preview) emerging as the best balance of quality, latency, and cost, though per-model results are based on only 3 cases each.
Each eval run captures efficacy, efficiency, and expense. We use this data to compare providers and track regressions over time.
We validate VTT structure, translation faithfulness, and language code integrity, plus performance and budget targets.
| Provider | Model | Cases | Avg Score | Avg Latency | Avg Tokens | Avg Cost | Avg Cost / Min |
|---|---|---|---|---|---|---|---|
| anthropic | claude-sonnet-4-5 | 3 | 1 | 7.69s | 1,896 | $0.012 | $0.021/min |
| gemini-2.5-flash | 3 | 1 | 3.4s | 1,946 | $0.0021 | $0.0037/min | |
| gemini-3-flash-preview | 3 | 0.83 | 20.14s | 6,225 | $0.0155 | $0.0271/min | |
| gemini-3.1-flash-lite | 3 | 1 | 2.33s | 1,855 | $0.0012 | $0.0021/min | |
| gemini-3.1-flash-lite-preview | 3 | 1 | 2.35s | 1,859 | $0.0012 | $0.0021/min | |
| openai | gpt-5-mini | 3 | 0.91 | 21.7s | 3,211 | $0.0045 | $0.0078/min |
| openai | gpt-5.1 | 3 | 1 | 6.18s | 1,554 | $0.0047 | $0.0082/min |