No papers today (arXiv publishes Mon-Fri).
Inference Providers
10 platforms for AI model serving, structured for quick comparison.
OpenAI
openai.com/api/pricing/Best for: General purpose production apps
Source: pricing page · Status: error · Checked: 2026-05-03
Anthropic
www.anthropic.com/pricingBest for: Long-context analysis, agents, research writing
Source: pricing page · Status: ok · Checked: 2026-05-03
Together AI
www.together.ai/pricingBest for: Cost-sensitive open-source model inference
Source: pricing page · Status: ok · Checked: 2026-05-03
Groq
groq.com/pricingBest for: Very low latency chat and real-time UX
Source: pricing page · Status: ok · Checked: 2026-05-03
Cohere
cohere.com/pricingBest for: Enterprise RAG, retrieval, embeddings
Source: pricing page · Status: ok · Checked: 2026-05-03
Replicate
replicate.com/pricingBest for: Trying many open models quickly
Source: pricing page · Status: changed · Checked: 2026-05-03
AWS Bedrock
aws.amazon.com/bedrock/pricing/Best for: AWS-integrated enterprise workloads
Source: pricing page · Status: changed · Checked: 2026-05-03
Google Vertex AI
cloud.google.com/vertex-ai/generative-ai/pricingBest for: Gemini/GCP-integrated workloads
Source: pricing page · Status: changed · Checked: 2026-05-03
Hugging Face
huggingface.co/pricingBest for: Custom models, endpoints, open ML ecosystem
Source: pricing page · Status: changed · Checked: 2026-05-03
Mistral AI
mistral.ai/pricingBest for: European AI stack and efficient model APIs
Source: pricing page · Status: ok · Checked: 2026-05-03