Overview
AI Model Hub for free: From December 1, 2024 until March 31, 2025, IONOS offers all foundation models of the AI Model Hub for free. Create your contract now and get your AI journey started today!
The IONOS AI Model Hub is designed to simplify the deployment and management of advanced machine learning models, eliminating the complexities associated with hardware and infrastructure. This inference service serves a range of powerful AI models that enable developers to implement sophisticated AI solutions without concerns about underlying hardware and operational overhead.
IONOS' AI Model Hub supports various use cases, including:
Text Generation: Utilize pre-trained Large Language Models (LLMs) to generate text and answer queries using textual descriptions.
Image Generation: Utilize pre-trained text-to-image models to create images based on textual descriptions.
Document Collections: Store and query extensive document collections based on semantic similarity.
Retrieval Augmented Generation (RAG): Enhance responses by combining Large Language Models with contextually relevant documents stored in a vector database.
Features
The IONOS AI Model Hub Service offers a wide array of features tailored to meet the needs of modern developers:
Managed Hosting: Utilize AI models without needing to maintain the underlying infrastructure.
Security and Compliance: Keep your data secure and compliant with regulations, as data processing is confined within Germany. Your input data is not used for training purposes in any way.
Scalability: Scale your AI deployments seamlessly to meet your needs.
Integration Options: Easily integrate with your applications using REST APIs that are fully OpenAI-compatible, with support for popular programming languages like Python and Bash.
Diverse Model Offerings: Choose from various foundation models, including Large Language Models and text-to-image models, each capable of generating innovative and sophisticated AI outputs.
Document Collections: Store and manage document collections and perform semantic similarity searches to extract contextually relevant information.
Retrieval Augmented Generation: Combine vector databases and Large Language Models to generate enhanced outputs that are contextually aware, providing more accurate and helpful responses.
Token-based Billing: Pay for the services based on the number of tokens used, enabling cost-efficient usage and transparency in billing.
Concepts
Understanding the foundational concepts of the IONOS AI Model Hub will help you leverage its full potential:
Foundation Models
Foundation models are pre-trained on massive datasets to perform a wide range of language and image processing tasks. They can generate text, answer questions, and create images based on textual descriptions. With IONOS, you can access these models via APIs, simplifying the process of integrating advanced AI capabilities into your applications.
Key Points:
Access various open-source Large Language Models for text generation and text-to-image models for image generation.
Use models without managing underlying hardware.
Maintain data privacy and comply with German data protection regulations.
Document Collections
Vector databases provide a way to store and manage document collections, enabling semantic similarity searches. Documents are converted to embeddings (vector representations), allowing the discovery of related content through similarity searches.
Key Points:
Persist documents and search for semantically similar content.
Use API endpoints to manage document collections and perform searches.
Ensure document storage and processing stays within Germany.
Retrieval Augmented Generation (RAG)
Retrieval Augmented Generation enhances the performance of Large Language Models by combining their inherent capabilities with contextually relevant information retrieved from document collections stored in vector databases. This approach allows the model to produce highly accurate and detailed responses tailored to specific queries.
Key Points:
Use Large Language Models together with document collections from vector databases.
Improve response accuracy and relevance by incorporating additional context.
Implement sophisticated AI solutions using a combination of querying and generation.
Components
API Endpoints
Use dedicated REST API endpoints to interact with various models and services. These endpoints are designed to facilitate the quick and easy integration of AI capabilities into your applications. The IONOS AI Model Hub provides two API options for maximum flexibility: its native IONOS AI Model Hub API and an OpenAI-compatible API, making it easy to work with tools that support OpenAI endpoints.
OpenAI-Compatible Endpoints:
These endpoints mirror OpenAI’s API structure, allowing for seamless integration with tools and platforms already designed for OpenAI:
Models: Retrieve the list of available models and their details.
Chat Completions: Generate conversational responses using supported Large Language Models.
Image Generations: Generate high-quality images based on text prompts.
Embeddings: Generate text embeddings as numerical vectors for semantic search, text similarity, and clustering.
Native IONOS AI Model Hub Endpoints:
Model Management: Endpoints for retrieving model lists, querying models, and managing predictions.
Document Management: Endpoints for creating, modifying, retrieving, and deleting document collections and individual documents.
Querying and Generating: Endpoints for combining semantic searches with Large Language Models to implement Retrieval Augmented Generation scenarios.
Authentication and Authorization
Security is paramount, and IONOS provides robust mechanisms to authenticate and authorize API requests. You must generate and use API tokens to access the AI services securely. For more information about generating a corresponding token, see the Access Management tutorial.
Data Privacy and Compliance
IONOS ensures that all data processing complies with German and European data protection regulations. Your data is processed within Germany, providing an additional layer of security and compliance.
Technical Support
IONOS offers expert technical support to help you troubleshoot and optimize your AI deployments. Whether you need assistance with API integration or model performance, the support and Professional Service team is available to ensure your success during German business hours.
Backup of Collections in Vector Database
IONOS does not back up the data saved to collections in the vector database. Ensure you have a backup strategy to restore your collections in case of accidental deletion.
Last updated