Llama 3.1 405B

Summary: Llama 3.1 405B is Meta's flagship large language model representing the pinnacle of open-source AI capabilities with exceptional reasoning abilities and comprehensive knowledge coverage. This massive model excels in the most demanding AI applications including advanced research, complex problem-solving, sophisticated content creation, and enterprise-grade AI solutions where maximum intelligence and accuracy are paramount, despite longer inference times inherent to its large-scale architecture.

Intelligence

Speed

Sovereignty

Input

Output

High

Low

Low

Text

Text

Central parameters

Description: Largest open-source model from Meta with 405B parameters, optimized with FP8 quantization for maximum intelligence and knowledge coverage.

Model identifier: meta-llama/Meta-Llama-3.1-405B-Instruct-FP8

IONOS AI Model Hub Lifecycle and Alternatives

IONOS Launch

End of Life

Alternative

Successor

August 1, 2024

N/A

Origin

Provider

Country

License

Flavor

Release

USA

Instruct

July 23, 2024

Technology

Context window

Parameters

Quantization

Multilingual

128k

406B

int4

Yes

Modalities

Text

Image

Audio

Input and output

Not supported

Not supported

Endpoints

Chat Completions

Embeddings

Image generation

v1/chat/completions

Not supported

Not supported

Features

Streaming

Tool calling

Supported

Supported

Rate limits

Rate limits ensure fair usage and reliable access to the AI Model Hub. In addition to the contract-wide rate limits, no model-specific limits apply.

Last updated

Was this helpful?