LLM Cost & Speed

Phase 3

LLM Cost & Speed Comparison

Visualize tradeoffs between model size, speed, and cost

Input 1000 • Output 500

Model	Parameters	Time (s)	Cost ($)
GPT‑3.5 Turbo	175B	12.50	0.0030
GPT‑4	1000B	33.33	0.0900
Llama 2 (70B)	70B	16.67	0.0015

Selecting a model is about balancing…

Smaller models can be better when…