Llama 3.1 405B
AI Model Hub for Free: From December 1, 2024, to June 30, 2025, IONOS is offering all foundation models in the AI Model Hub for free. Create your contract today and kickstart your AI journey!
Summary: Large US model with high response quality but slow inference speed introduced by Meta.
Intelligence
Speed
Sovereignty
Input
Output
High
Low
Low
Text
Text
Central parameters
Description: Meta Llama 3.1 405B is the largest pure open-source large language model trained by Meta. It is especially suited for use cases with irrelevant response time, but crucial answer quality. It natively supports various European languages but focuses on English.
Model identifier: meta-llama/Meta-Llama-3.1-405B-Instruct-FP8
IONOS AI Model Hub Lifecycle and Alternatives
IONOS Launch
End of Life
Alternative
Successor
Origin
Technology
Context window
Parameters
Quantization
Multilingual
128k
406B
int4
Yes
Modalities
Text
Image
Audio
Input and output
Not supported
Not supported
Endpoints
Chat Completions
Embeddings
Image generation
v1/chat/completions
Not supported
Not supported
Features
Streaming
Tool calling
Supported
Supported
Rate limits
Rate limits ensure fair usage and reliable access to the AI Model Hub. In addition to the contract-wide rate limits, no model-specific limits apply.
Last updated
Was this helpful?