Models

The language models available in meinGPT, their strengths, and recommendations for usage

meinGPT brings together leading language models (LLMs) from multiple providers under a single interface. For each chat, you choose the model that fits the task — based on the work to be done, the speed you need, and your data protection requirements.

Indicators in the model picker

The model picker shows several indicators to help you decide:

  • EU flag — the model is hosted in the EU and is GDPR-compliant without further measures.
  • US flag / globe — the model runs outside the EU.
  • Quality and speed indicators — visible on hover, each rated 1–5.
  • Reasoning tag — marks models that use an explicit thinking step before answering.
  • Preview tag — the model is still in a preview phase (functionality and behavior may change).
  • Free indicator — the model is not billed and is always available (currently GPT-5 Nano).
  • Best model — heuristic auto-selection; defaults to EU-hosted models only.

Hinweis

Data protection: EU-hosted models process all data within the EU. For assistants, you must set the model explicitly — an EU model is not chosen automatically.

Recommendations for everyday use

Use caseRecommendation
Standard chat (default)GPT-5.1, GPT-5.5, Claude Sonnet 4.6
Deep reasoning, strategy, auditsGPT-5 Thinking, o3, Gemini 2.5 Pro
Coding and refactoringClaude Sonnet 4.6, Claude Opus 4.8, GPT-5.4
Very long documents / 1M contextGPT-5.4, GPT-5.5, Claude Opus 4.8, Gemini 2.5 Pro
High volume, high throughput, low costGPT-5 mini, GPT-5 Nano (free), Gemini 2.5 Flash
Web research with source citationsPerplexity Online, Perplexity Deep Research
Default for assistants with tool callso3, GPT-5 Thinking

Tipp

When in doubt, start with a regular (non-reasoning) model. Only switch to a reasoning model if the answers aren't good enough — reasoning consumes noticeably more credits.

Reasoning models vs. standard LLMs

Reasoning models perform a visible thinking step before producing the actual answer. This delivers significantly better results on multi-step or logic-heavy tasks — at the cost of speed and credits.

Task typeStandard LLMReasoning model
Simple text, emails, summariesYesNo (too expensive)
Creative writingYesNo
Complex logic, mathematics, formal analysisNoYes
Multi-step planningSometimesYes
Coding (simple)YesNo
Coding (complex / architectural)SometimesYes
Compliance analysis, auditsNoYes

OpenAI

OpenAI models mostly run via Azure OpenAI Service — for EU hosting, exclusively in EU data centers (Sweden Central). Individual US variants go directly via OpenAI. Data is not used for training.

ModelHostingReasoningContextGood for
GPT-4o MiniEU128kClassification, tagging, simple summaries, FAQ bots
GPT-4oEU128kGeneral assistant, coding, multimodal, multilingual
GPT-4.1EU1MKnowledge work, consistent output, tool use, long documents
o4-miniEU200kEfficient reasoning tasks, many requests with a logic component
o3EU200kStructured reasoning, multi-step workflows, tool calls in assistants
o3-proUS200kHighly demanding logic, formal analysis, compliance, audits
GPT-5EU400kHigh-quality text, code, agents, long contexts
GPT-5 ThinkingEU400kDifficult logic, planning, mathematics, deep-dive analysis
GPT-5 miniEU400kMany requests, simple tasks, classification, pipelines
GPT-5 NanoEU400kFree for basic conversations and highly cost-sensitive bulk processing
GPT-5.1EU400kStandard model for business and development, agents, planning
GPT-5.2EU✓ Coding400kCoding-heavy workflows that need reasoning (EU)
GPT-5.2 (US)US✓ Coding400kSame class as GPT-5.2, US region
GPT-5.3 ChatUS✓ Coding128kChat-oriented variant with reasoning and coding focus
GPT-5.4EU✓ Coding1MVery long contexts, heavy reasoning and coding tasks (EU)
GPT-5.4 ProUS✓ Coding1MMaximum reasoning + 1M context for the most demanding workloads
GPT-5.5EU✓ Coding1MCurrent EU flagship: reasoning, coding, long contexts

Anthropic (Claude)

Claude models run via Google Vertex. Anthropic pioneered the Artifacts concept — accordingly, Claude models produce especially clean artifact output (interactive documents, diagrams, code snippets).

ModelHostingReasoningContextGood for
Claude Haiku 4.5EU200kFast, cheap Anthropic responses for simple tasks
Claude Sonnet 4EU200kAdvanced coding, good balance of intelligence and speed
Claude Sonnet 4.5EU200kProduction coding, complex agents, multi-tool workflows
Claude Sonnet 4.6EU200kEfficient Sonnet all-rounder with high speed
Claude Opus 4.1US200kAgentic tasks, large refactorings, software engineering
Claude Opus 4.5EU200kOpus class with EU hosting for high-quality requirements
Claude Opus 4.6EU200kCurrent EU Opus for the most demanding tasks
Claude Opus 4.7EU1MOpus with 1M context for very long documents and codebases
Claude Opus 4.8EU1MCurrent Opus with 1M context, sharper agentic coding and stronger reasoning

Google (Gemini)

Gemini models run via Google Vertex and stand out with the largest context windows (1M+ tokens). Ideal for long documents, entire knowledge bases, or extensive file uploads.

ModelHostingReasoningContextGood for
Gemini 2.5 FlashEU1MBulk processing, fast APIs, image analysis
Gemini 2.5 ProEU1MComplex code projects, STEM, large datasets, agentic workflows
Gemini 3 Flash (Preview)US1MNext-generation Flash — preview
Gemini 3.5 FlashEU1MLatest Flash, combines reasoning at Pro level with high speed, multimodal input (text, image, audio, video)
Gemini 3.1 Pro (Preview)US1MMultimodal reasoning, long contexts, agentic workflows — preview

Mistral

Mistral models are hosted in European data centers and offer a strong EU-native alternative for business applications.

ModelHostingReasoningContextGood for
Mistral Medium 3EU128kEnterprise deployments, multimodal analysis, long documents, coding
Mistral Medium 3.5EU256kLatest Mistral Medium generation with larger context
Magistral Medium 1.2EU128kEU reasoning alternative for logic and analysis tasks
CodeStralEU— Coding256kIDE code completion, fill-in-the-middle, code tests

DeepSeek

DeepSeek models run via Azure AI Foundry and offer cost-efficient reasoning and coding options.

ModelHostingReasoningContextGood for
DeepSeek-R1Global128kMathematical proofs, chain-of-thought, logic puzzles
DeepSeek-V3.2Global— Coding128kCost-efficient batch processing, coding (Python/C++), multilingual

Other models

ModelProviderHostingReasoningContextGood for
Llama 3.3Meta (hosted on Nebius)EU128kEnterprise RAG, tool use, JSON output, on-premise scenarios
GPT-OSS 120BOpenAI open weights (hosted on Nebius)EU128kSelf-hosting, RAG, on-prem agents
Kimi K2.5Moonshot (hosted on Azure AI Foundry)Global✓ Coding131kReasoning and coding tasks with large context
Kimi K2.6Moonshot (hosted on Azure AI Foundry Sweden)Global✓ Coding256kSuccessor to K2.5 with doubled context window
Perplexity OnlinePerplexityUS128kWeb research, source citations, fact-checking
Perplexity Deep ResearchPerplexityUS128kDeeper web research with multiple search steps and summaries

Image generation

Image models (FLUX family, GPT Image, Gemini image models, Recraft) are documented separately under Image model management — including hosting region, premium status, and recommendations.

Enabling or disabling models

Which models your users can choose from in the workspace is controlled by you as an admin in the settings. Details: Workspace configuration.

Was this page helpful?