API providers

Where to run frontier models — pricing, throughput, and full model catalogs.

9 of 9 providers

Anthropic

Anthropic hosts 5 active AI models, with input pricing from $1.00 per 1M tokens, averaging 50 tok/s output throughput, with up to 1.0M context window. Compare Anthropic's API pricing, latency, and feature support against other LLM providers.

5 models

50 tok/s avg

from $1/1M in

Amazon Bedrock

Bedrock hosts 0 active AI models. Compare Bedrock's API pricing, latency, and feature support against other LLM providers.

0 models

DeepInfra

DeepInfra hosts 12 active AI models, with input pricing from $0.06 per 1M tokens, averaging 69 tok/s output throughput, with up to 1.0M context window. Compare DeepInfra's API pricing, latency, and feature support against other LLM providers.

12 models

69 tok/s avg

from $0.06/1M in

Fal

Fal hosts 29 active AI models, with up to 8K context window. Compare Fal's API pricing, latency, and feature support against other LLM providers.

29 models

Google

Google hosts 14 active AI models, with input pricing from $0.25 per 1M tokens, averaging 88 tok/s output throughput, with up to 1.0M context window. Compare Google's API pricing, latency, and feature support against other LLM providers.

14 models

88 tok/s avg

from $0.25/1M in

Novita

Novita hosts 20 active AI models, with input pricing from $0.08 per 1M tokens, averaging 53 tok/s output throughput, with up to 262K context window. Compare Novita's API pricing, latency, and feature support against other LLM providers.

20 models

53 tok/s avg

from $0.08/1M in

OpenAI

OpenAI hosts 34 active AI models, with input pricing from $0.10 per 1M tokens, averaging 103 tok/s output throughput, with up to 1.1M context window. Compare OpenAI's API pricing, latency, and feature support against other LLM providers.

34 models

103 tok/s avg

from $0.1/1M in

Replicate

Replicate hosts 35 active AI models, with up to 10K context window. Compare Replicate's API pricing, latency, and feature support against other LLM providers.

35 models

Xai

xAI hosts 10 active AI models, with input pricing from $0.20 per 1M tokens, averaging 90 tok/s output throughput, with up to 2.0M context window. Compare xAI's API pricing, latency, and feature support against other LLM providers.

10 models

90 tok/s avg

from $0.2/1M in