For the complete documentation index, see llms.txt. This page is also available as Markdown.

Cloud GPU VM Overview

The powerful Graphics Processing Unit (GPU) VMs excel at massive parallel processing. This specialized hardware best accelerates compute-intensive workloads, such as training complex AI models, machine learning inference, and high-speed 3D rendering.

Cloud GPU VMs use predefined templates that cannot be modified. Use the DCD or IONOS CLOUD API with template read access and volume management to deploy Cloud GPU VMs. Plan storage requirements carefully to select the appropriate template, as insufficient internal storage will require the use of additional volumes.

Cloud GPU VM configuration

Dedicated instance specifications and configuration templates

The server sizing model allocates Cloud GPU VMs with corresponding dedicated CPU cores and RAM based on available host capacity. The architecture uses PCIe passthrough for direct hardware access and optimal performance.

Template specifications

You may choose between the following four template sizes. The templates can only be used with the Cloud GPU VMs. CPU and RAM allocate proportionally to the number of GPUs. Resources use dedicated cores with limited flexibility.

The breakdown of resources is as follows:

Template

GPU Model

GPU Type

Number of GPUs

Dedicated CPUs

RAM (GiB)

Storage (GB)

S

NVIDIA H200

H200 PCIe

1

15

267

1024

M

NVIDIA H200

H200 PCIe

2

30

534

1536

L

NVIDIA H200

H200 PCIe

4

60

1068

2048

XL

NVIDIA H200

H200 PCIe

8

127

2136

4096

Resource limits: By default, you can deploy only 1 Cloud GPU VM using the H200–S template. To deploy templates sized M, L, or XL, or to run multiple S instances, you must first request a resource limit increase through the IONOS Cloud Support.

Dedicated resource model

  • CPU Cores: Dedicated AMD EPYC Turin (non-shared) allocation, with a fixed ratio proportional to the number of GPUs.

  • Memory: Fixed ratio based on host specifications.

  • Flexibility: Static resource allocation without dynamic scaling. For more information, see Known Constraints.

Storage selection

Note: The platform supports only IONOS CLOUD Linux images at start.

The first connected volume serves as the storage volume, containing the operating system and required system files. Provision storage volumes with adequate capacity at the initial Cloud GPU VM provisioning, because they use fixed sizing and cannot be detached or upscaled after deployment. You can also add additional storage for datasets requiring expansion. For more information, see Block Storage.

GPU specifications

The following table provides the specifications:

Specification

Details

Hardware architecture

Uses "PCIe passthrough" architecture to provide direct hardware access for optimal performance, simplified deployment, and accelerated production readiness. For more information, see GPU architecture and performance.

GPU model

Primary Offering: High-End NVIDIA H200 GPUs

Maximum GPUs per template

8x GPU units per server

Deployment density

Optimized for high-performance inference workloads

Data security

IONOS CLOUD provides SSD premium as the default attached volume for Cloud GPU VMs.

Last updated

Was this helpful?