DeepInfra

DeepInfra hosts 12 active AI models, with input pricing from $0.06 per 1M tokens, averaging 69 tok/s output throughput, with up to 1.0M context window. Compare DeepInfra's API pricing, latency, and feature support against other LLM providers.

12 models hosted
~69 tok/s output
Input $0.06/1M
Output $0.06/1M

Model catalog

Models available via DeepInfra's API.

← All API providers · Benchmark scores