DeepInfra

DeepInfra hosts 12 active AI models, with input pricing from $0.06 per 1M tokens, averaging 69 tok/s output throughput, with up to 1.0M context window. Compare DeepInfra's API pricing, latency, and feature support against other LLM providers.

12 models hosted

~69 tok/s output