Groq

Operational

Fastest inference on custom LPU hardware

Lightning-fast inference powered by custom Language Processing Units.

— uptime —

API Documentation View Pricing Website

Company

Headquarters: Mountain View, CA
Founded: 2016

Capabilities

Models Hosted: Curated Deck Curated Deck means this provider runs a focused lineup of models, chosen for specific use cases.
Inference Speed: 500+ tokens/sec
Specialties: Ultra-low latency
Unique Features: Custom LPU hardware
API Style: OpenAI-compatible
Compute Location: Global

Models

Coming soon

We are standardizing model listings across providers.

Why Use Groq

Blazing Fast

500+ tokens/second inference speed.

Free Tier

Generous free tier for testing.

Details

About Groq

Groq developed custom Language Processing Units optimized for AI inference.

bots.so

The AI Inference Model Index

Models

Providers

Resources

Legal

bots.so aggregates publicly available model deployment information from official provider sources. We are not affiliated with any model provider. Model availability changes rapidly; always verify on official sites.

Groq

API Issues Detected

All Systems Operational

Company

Capabilities

Models

Why Use Groq

Blazing Fast

Free Tier

Details

About Groq

Models

Providers

Resources

Legal

Groq

API Issues Detected

All Systems Operational

Company

Capabilities

Models

Why Use Groq

Blazing Fast

Free Tier

Details

About Groq

Get the signal, skip the noise.

Models

Providers

Resources

Legal