# Cloud GPU VM Overview

The powerful Graphics Processing Unit (GPU) VMs excel at massive parallel processing. This specialized hardware best accelerates compute-intensive workloads, such as training complex AI models, machine learning inference, and high-speed 3D rendering.

Cloud GPU VMs use predefined templates that cannot be modified. Use the DCD or Cloud API with template read access and volume management to deploy Cloud GPU VMs. Plan storage requirements carefully to select the appropriate template, as insufficient internal storage will require the use of additional volumes.

## Cloud GPU VM configuration

### Dedicated instance specifications and configuration templates

The server sizing model allocates Cloud GPU VMs with corresponding dedicated CPU cores and RAM based on available host capacity. The architecture uses PCIe passthrough for direct hardware access and optimal performance.

#### Template specifications

You may choose between the following four template sizes. The templates can only be used with the Cloud GPU VMs. CPU and RAM allocate proportionally to the number of GPUs. Resources use dedicated cores with limited flexibility.

{% hint style="warning" %}
**Warning:** Configuration templates are created during provisioning and cannot be changed later.
{% endhint %}

The breakdown of resources is as follows:

| **Template** | **GPU Model** | **GPU Type** | **Number of GPUs** | **Dedicated CPUs** | **RAM (GiB)** | **Storage (GB)** |
| ------------ | ------------- | ------------ | ------------------ | ------------------ | ------------- | ---------------- |
| **S**        | NVIDIA H200   | H200 PCIe    | 1                  | 15                 | 267           | 1024             |
| **M**        | NVIDIA H200   | H200 PCIe    | 2                  | 30                 | 534           | 1536             |
| **L**        | NVIDIA H200   | H200 PCIe    | 4                  | 60                 | 1068          | 2048             |
| **XL**       | NVIDIA H200   | H200 PCIe    | 8                  | 127                | 2136          | 4096             |

{% hint style="info" %}
**Resource limits:** By default, you can deploy only **1 Cloud GPU VM** using the **H200–S** template. To deploy templates sized **M**, **L**, or **XL**, or to run multiple **S** instances, you must first request a resource limit increase through the [<mark style="color:blue;">IONOS Cloud Support</mark>](https://docs.ionos.com/cloud/support/general-information/contact-information).
{% endhint %}

### Dedicated resource model

* **CPU Cores:** Dedicated AMD EPYC Turin (non-shared) allocation, with a fixed ratio proportional to the number of GPUs.
* **Memory:** Fixed ratio based on host specifications.
* **Flexibility:** Static resource allocation without dynamic scaling. For more information, see [<mark style="color:blue;">Known Constraints</mark>](https://docs.ionos.com/cloud/compute-services/compute-engine/cloud-gpu-vm/overview/limitations).

## Storage selection

{% hint style="info" %}
**Note:** The platform supports only [<mark style="color:blue;">IONOS Cloud Linux images</mark>](https://docs.ionos.com/cloud/storage-and-backup/images-snapshots) at launch.
{% endhint %}

The first connected volume serves as the storage volume, containing the operating system and required system files. Provision storage volumes with adequate capacity at the initial Cloud GPU VM provisioning, because they use fixed sizing and cannot be detached or upscaled after deployment. You can also add additional storage for datasets requiring expansion. For more information, see [<mark style="color:blue;">Block Storage</mark>](https://docs.ionos.com/cloud/storage-and-backup/block-storage).

## GPU specifications

The following table provides the specifications:

| **Specification**             | **Details**                                                                                                                                                                                                                                                                                                                                                                 |
| ----------------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| **Hardware architecture**     | Uses "PCIe passthrough" architecture to provide direct hardware access for optimal performance, simplified deployment, and accelerated production readiness. For more information, see [<mark style="color:blue;">GPU architecture and performance</mark>](https://docs.ionos.com/cloud/compute-services/compute-engine/cloud-gpu-vm-faq#gpu-architecture-and-performance). |
| **GPU model**                 | **Primary Offering:** High-End NVIDIA H200 GPUs                                                                                                                                                                                                                                                                                                                             |
| **Maximum GPUs per template** | 8x GPU units per server                                                                                                                                                                                                                                                                                                                                                     |
| **Deployment density**        | Optimized for high-performance inference workloads                                                                                                                                                                                                                                                                                                                          |

## Data security

IONOS Cloud provides SSD premium as the default attached volume for Cloud GPU VMs.
