Together AI

Llama 3.3 70B

Available open-source

Meta's most efficient open-source model with flagship performance

Released December 6, 2024

Context Window
128K
TTFT
N/A
Speed
N/A
Max Output
4K
Training Cutoff
December 2023

Last tested: 2026-01-15

About

Llama 3.3 70B delivers performance comparable to the much larger 405B model while being significantly more cost-effective. Built on Meta's latest architecture improvements, it excels at reasoning, coding, and multilingual tasks while remaining fully open-source under the Llama 3.3 Community License.

Capabilities

function-calling streaming json-mode multilingual code-generation

Pricing

Input
$0.88/M
per 1M tokens
Output
$0.88/M
per 1M tokens

Static pricing

Details

Overview

Llama 3.3 70B represents Meta’s commitment to democratizing AI by delivering frontier-level performance in an open-source package. This model achieves comparable results to Llama 3.1 405B on many benchmarks while requiring significantly fewer compute resources.

Key Strengths

Efficiency Leadership: The model delivers exceptional performance-per-dollar, making it ideal for production workloads where cost matters. With 128K context window support, it handles long documents and complex conversations with ease.

Code Generation: Llama 3.3 70B excels at programming tasks, supporting multiple languages and demonstrating strong reasoning about code structure and debugging.

Multilingual Support: Built-in support for 8 languages including English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai makes it suitable for global applications.

Use Cases

  • Enterprise Chat Applications: Cost-effective alternative to proprietary models for customer service and internal tools
  • Code Assistance: IDE integrations, code review, and documentation generation
  • Document Analysis: Legal document review, contract analysis, and research synthesis
  • Multilingual Applications: Global customer support and content localization

Open Source Advantage

As a fully open-source model, Llama 3.3 70B can be self-hosted, fine-tuned, and customized for specific use cases. Organizations retain full control over their AI infrastructure while benefiting from Meta’s research investment.

Newsletter

Get the signal, skip the noise.

Weekly digest of new models and provider updates across 41+ compute providers. Curated for AI builders who ship.

New model releases
Capability updates
Provider status
bots.so
The AI Inference Model Index
© bots.so — The AI Inference Model Index

bots.so aggregates publicly available model deployment information from official provider sources. We are not affiliated with any model provider. Model availability changes rapidly; always verify on official sites.