Cookie Settings

    We use cookies to improve your experience on our website. You can choose which cookie categories you want to accept. Learn more

    Responsible Party
    Contact Form
    uNaice
    As of June 2026 · 25+ models · tokens & subscriptions compared

    Which AI model
    fits your task — and what does it really cost?

    Tell us what you want to do — we'll show you the right model, the honest token bill and why ChatGPT Plus, Claude Pro & co. are not directly comparable.

    This comparison updates automatically once a week · next update: Mon, 06/07/2026 · last updated: 29/06/2026

    Get your data right first

    DataNaicer bundles, cleans and enriches your data — the prerequisite for any AI model.

    Need support introducing AI in your company?

    Book a 30-min web meeting with our experts — concrete next steps, no slideware.

    Interactive model finder

    Find the right AI model in just a few clicks

    Pick your use case and three requirements — we suggest the best models and compare the costs for you.

    1

    Pick the use case that fits best.

    2

    Tells us whether to prefer cheap or top-tier models.

    Multi-select. For sensitive data usually EU (GDPR).

    The “context window” — the AI's short-term memory.

    Auto-matched to your selected use case – you can override anytime.

    Prices & context sizes automatically once a week updated · next update: Mon, 06/07/2026
    Your result

    Your personal recommendation

    Out of 40+ models, freshly calculated — this fits you best:

    Costs estimated for 500k input + 200k output tokens / month (≈ a few thousand requests). Need an exact calculation? Let's run the numbers together →

    ByteDance Doubao 🇨🇳

    Doubao Pro 1.5

    Top pick

    Extremely cheap for mass content, social tagging and content moderation.

    Input: $0.11/1MOutput: $0.28/1MContext: 256k
    Cost / month (API)$0.11

    Google

    Gemini 2.5 Flash-Lite

    Tagging product data, bulk translations, simple classification — when every cent per call matters.

    Input: $0.1/1MOutput: $0.4/1MContext: 1M
    Cost / month (API)$0.13

    Meta (Llama)

    Llama 4 Maverick

    Open-weights flagship for on-prem and fine-tuning when you want to host or adapt the weights yourself.

    Input: $0.5/1MOutput: $1.5/1MContext: 1M
    Cost / month (API)$0.55

    OpenAI

    GPT-5 mini

    The workhorse for production apps: chat assistants, content drafts, RAG answers, mid-tier tool calling.

    Input: $0.4/1MOutput: $1.6/1MContext: 400k
    Cost / month (API)$0.52

    Don't want to decide yourself?

    We build you multi-model routing that automatically picks the cheapest good-enough model per task — and integrate it into your ERP, CRM or PIM. Why “chat only” isn't enough →

    Free intro consultation

    Immer up-to-date bei neuen KI-Modellen oder Preisen

    Kurze Mail, sobald ein neues Modell erscheint oder sich Preise ändern. Kein Spam, jederzeit abbestellbar.

    Mit dem Absenden bestätigst du, dass wir dir Updates per E-Mail senden dürfen. Abmeldung jederzeit per Link in der Mail.

    Picking the model is just the start

    Turning a model into real business value takes clean data, clear processes and the right distribution. That's where we come in.

    Need clean data as input?

    No model in the world will rescue bad input data. We bundle, clean and enrich your data so every token counts.

    Go to DataNaicer

    Want to turn data into content directly?

    Product descriptions, SEO copy and variants at scale — generated from data instead of typed laboriously into ChatGPT.

    Go to ContentNaicer

    Need continuous reach?

    News Stream takes over LinkedIn, blog and newsletter fully automatically — we pick the right AI model per format for you.

    Go to News Stream

    Publisher or media house?

    Built for publishers: News Stream produces editorial content, SEO articles and social posts at scale — with transparent tiered pricing.

    See publisher pricing

    All AI models – pricing in detail

    List prices per 1 M tokens (USD) plus a benefit recommendation per model.

    OpenAI

    ModelInput / 1MOutput / 1MContextWhat you'd use it for
    GPT-5$5.00$15.00400kWhen you need a real second opinion: strategy papers, legal analysis, hard code refactors, agents that orchestrate multiple tools cleanly.
    Subscription equivalent: ChatGPT Plus 20 $ / Pro 200 $ pro Monat
    GPT-5 mini$0.40$1.60400kThe workhorse for production apps: chat assistants, content drafts, RAG answers, mid-tier tool calling.
    GPT-4o$2.50$10.00128kVoice assistants, image description and OCR — anywhere you need to mix text, image and audio.
    o4-mini$1.10$4.40200kMaths, logic, unit-test generation and coding agents when you need reasoning but don't want to pay GPT-5 prices.

    Anthropic

    ModelInput / 1MOutput / 1MContextWhat you'd use it for
    Claude Fable 5$10.00$50.001MAnthropic's new top tier (Mythos class). For the truly hard problems: multi-step research agents, autonomous coding sessions running for hours, deep legal analysis.
    Subscription equivalent: Nur über Claude Max 200 $ oder API
    Claude Opus 4.8$5.00$25.001MCurrently the first choice for long-running coding agents (Claude Code, Cursor) and whole codebases or document stacks in context. Writes at a top level with style and consistency.
    Subscription equivalent: Claude Pro 20 $ / Max 100–200 $ pro Monat
    Claude Sonnet 4.6$3.00$15.001MThe workhorse for coding agents and long-context RAG — cheaper than Opus, almost as good for most tasks.
    Claude Haiku 4.5$1.00$5.00200kCustomer-support bots, ticket routing, sentiment and intent classification at high volume.

    Google

    ModelInput / 1MOutput / 1MContextWhat you'd use it for
    Gemini 2.5 Pro$1.25$10.002MWhen you need to understand entire codebases, videos or hundreds of PDFs at once — the king of long context.
    Subscription equivalent: Gemini Advanced 21,99 $ / AI Ultra 250 $ pro Monat
    Gemini 2.5 Flash$0.30$2.501MRAG, translations, image recognition at scale — excellent price/performance with long context.
    Gemini 2.5 Flash-Lite$0.10$0.401MTagging product data, bulk translations, simple classification — when every cent per call matters.

    Meta (Llama)

    ModelInput / 1MOutput / 1MContextWhat you'd use it for
    Llama 4 Maverick$0.50$1.501MOpen-weights flagship for on-prem and fine-tuning when you want to host or adapt the weights yourself.
    Llama 4 Scout$0.15$0.6010MLoad huge knowledge bases fully into context without having to build a RAG pipeline.

    Mistral

    ModelInput / 1MOutput / 1MContextWhat you'd use it for
    Mistral Large 2$2.00$6.00128kFirst choice for GDPR requirements and EU data residency. Multilingual, solid function calling.
    Subscription equivalent: Le Chat Pro 14,99 € pro Monat
    Mistral Small 3$0.20$0.6032kEU-compliant low-latency apps, edge deployments, fast internal tools.

    xAI

    ModelInput / 1MOutput / 1MContextWhat you'd use it for
    Grok 4$3.00$15.00256kWhen real-time web and X data have to be part of the answer — e.g. trend research or social monitoring.
    Subscription equivalent: X Premium+ 40 $ / SuperGrok 30–300 $ pro Monat
    Grok 4 mini$0.30$1.50128kFast, cheap answers with up-to-date web knowledge.

    DeepSeek 🇨🇳

    ModelInput / 1MOutput / 1MContextWhat you'd use it for
    DeepSeek-V3.2$0.27$1.10128kVery cheap general-purpose model with surprisingly strong coding performance — the price breaker for volume.
    DeepSeek-R1$0.55$2.19128kFrontier-level reasoning at a fraction of the cost of GPT-5 or Opus 4.5.

    Alibaba Qwen 🇨🇳

    ModelInput / 1MOutput / 1MContextWhat you'd use it for
    Qwen3-Max$1.20$6.001MTop all-rounder from China, very strong in coding and multilingual tasks (CN, EN, DE).
    Qwen3-Coder$0.30$1.201MSearch huge repos, refactor, generate tests — cheap coding agents.

    Moonshot Kimi 🇨🇳

    ModelInput / 1MOutput / 1MContextWhat you'd use it for
    Kimi K2$0.60$2.502MLong documents, research agents, "read this book and answer my questions" scenarios.

    Zhipu GLM 🇨🇳

    ModelInput / 1MOutput / 1MContextWhat you'd use it for
    GLM-4.6$0.60$2.20200kSolid all-rounder with good tool use and CN/EN strength — happy to use as a backup router target.

    Baidu Ernie 🇨🇳

    ModelInput / 1MOutput / 1MContextWhat you'd use it for
    Ernie 4.5 Turbo$0.55$2.20128kChinese-language customer communication, marketing content for the CN market.

    ByteDance Doubao 🇨🇳

    ModelInput / 1MOutput / 1MContextWhat you'd use it for
    Doubao Pro 1.5$0.11$0.28256kExtremely cheap for mass content, social tagging and content moderation.

    MiniMax 🇨🇳

    ModelInput / 1MOutput / 1MContextWhat you'd use it for
    MiniMax M2$0.30$1.201MMultimodal apps (text + image) at a low price, popular in Asia.

    Note: Prices are list prices for the providers' APIs (as of June 2026) and change regularly. Discounts via batch, cache or volume contracts and regional hosting surcharges (Azure, Vertex AI, Bedrock) are not included.

    Turn your data into content at scale

    ContentNaicer generates product descriptions, SEO copy and variants — data-driven, not typed.

    Distribute content across all channels — automatically

    News Stream publishes to LinkedIn, blog and newsletter on autopilot. Talk to an expert.

    Common questions about AI model costs

    5 things nobody says out loud — and that aren't on any price list.

    • Are output tokens really more expensive than input tokens?

      Yes, usually by a factor of 3–5×. If you generate long answers you pay disproportionately. Trick: keep system prompts short, cap answer length.

    • Is a large context window automatically more expensive?

      Long prompts cost linearly with token count. A 1M context only pays off if you actually need it — RAG is often cheaper.

    • What are reasoning or thinking tokens?

      The GPT-o series, DeepSeek-R1 and Claude Opus “think” internally. These thinking tokens get billed too, which is why reasoning models often come out higher than expected.

    • What does multi-model routing actually give you?

      Route simple tasks to Flash-Lite or DeepSeek and hard ones to Sonnet or GPT-5 — in our projects this typically saves 60–80 % of the cost.

    • Which AI models can I use under GDPR?

      For GDPR-critical workloads, Mistral (EU hosting) or self-hosted Llama are usually the only clean choice. Chinese models have no place in sales or personal data.

    Not sure which model fits your use case?

    We pick for you — data-driven, vendor-neutral and with an eye on total cost of ownership.