Llama 3.3 70B

Summary: Llama 3.3 70B is a breakthrough medium-sized language model that delivers flagship-level quality traditionally associated with 405B models while maintaining significantly improved efficiency. This advanced model excels in complex reasoning, nuanced language understanding, and sophisticated problem-solving, making it ideal for enterprise applications, advanced chatbots, content generation, and professional AI assistants that demand high-quality responses without the computational overhead of larger models.

Intelligence

Speed

Sovereignty

Input

Output

High

Medium

Low

Text

Text

Central parameters

Description: Latest text-only model from Meta with 70B parameters, benchmarked to achieve 405B-level quality at 70B inference speeds.

Model identifier: meta-llama/Llama-3.3-70B-Instruct

IONOS AI Model Hub Lifecycle and Alternatives

IONOS Launch

End of Life

Alternative

Successor

March 15, 2025

N/A

Origin

Provider

Country

License

Flavor

Release

USA

Instruct

December 9, 2024

Technology

Context window

Parameters

Quantization

Multilingual

128k

70.6B

fp8

Yes

Modalities

Text

Image

Audio

Input and output

Not supported

Not supported

Endpoints

Chat Completions

Embeddings

Image generation

v1/chat/completions

Not supported

Not supported

Features

Streaming

Tool calling

Supported

Supported

Rate limits

Rate limits ensure fair usage and reliable access to the AI Model Hub. In addition to the contract-wide rate limits, no model-specific limits apply.

Last updated

Was this helpful?