Models overview

Available models for Berget AI serverless inference

All models run on Berget AI's European infrastructure and are accessed via an OpenAI-compatible API. Pricing is per million tokens.

Text

General-purpose instruction-following models for chat, reasoning, and structured output.

Embedding

Vector representations of text for semantic search, clustering, and retrieval-augmented generation.

Reranking

Scores and reorders a list of documents by relevance to a query, useful for improving retrieval quality in RAG pipelines.

Speech-to-text

Transcribes audio to text, with specialised models for Swedish and Norwegian alongside a multilingual option.

Next steps

On this page