Cost calculator

What does a conversational AI agent cost per month?

Estimate the raw LLM model spend of running a conversational shopping agent. Adjust your traffic, engagement, retrieval, tools and model — the defaults are pre-filled to match how a real WisWes agent runs.

iPick your industry, B2B/B2C and current conversion rate to auto-tune retrieval, tools and conversation depth below. Spec- and fitment-heavy verticals like electronics, auto parts and furniture need more RAG and tools than fast-moving ones like grocery or apparel, so they cost more per conversation. You can still fine-tune every value afterwards.Read the full FAQ →

Audience

iUnique users = your monthly unique visitors (GA4 “Users”, or Shopify “Online store visitors”). Engagement rate = visitors who open the chat ÷ total visitors — start at 20% if you don’t track it yet. Conversations you’ll pay for = users × engagement rate.Read the full FAQ →
Engaged conversations / month400

Retrieval (RAG)

iRAG context ≈ (results returned per answer) × (tokens per result). WisWes returns ~8 product matches (~60 tok each) or ~3 FAQ answers (~150 tok each) per turn → ~1,200 tok. Paste one retrieved snippet into the Token estimator, then multiply by how many you show.Read the full FAQ →

Tools

iCount the actions your agent can take (search, recommend, add-to-cart, track order, hand off…). Each tool’s JSON definition is ~150–250 tokens and is re-sent on every call — so tools × tokens-per-tool is added to every turn. Paste one tool’s schema into the Token estimator to measure yours precisely.Read the full FAQ →
Tool definitions per call3,230 tok
iPaste a real sample — a product snippet, your system prompt, or one tool’s JSON schema — to measure it, then apply it to a field. Rule of thumb: ~4 characters ≈ 1 token.Read the full FAQ →

Model

iPrices are per 1M tokens, split into input (what you send: prompt + tools + RAG + history) and output (what the model writes). Output is usually 4–10× the input rate, but you send far more input — so input often dominates. Use a “flash/mini” model for high-volume FAQ and search; reserve premium models for complex reasoning.Read the full FAQ →
Input $0.300 / 1MOutput $2.50 / 1M

Gemini 3 Flash price is estimated — not yet in the rate card.

Estimated monthly model spend

$12.17/mo

Conversations400
Cost / conversation$0.0304
Input tokens31M
Output tokens1.2M
Input $9.17 Output $3.00
Input tokens / call5,770
Input tokens / conversation76,440
Output tokens / conversation3,000
ModelGemini 3 Flash

Raw LLM provider spend only — excludes the WisWes subscription, infrastructure and embedding costs. Assumes no prompt-cache discount, matching current WisWes behaviour.

How the estimate works

Everything you need to fill in each field — and how WisWes turns it into a monthly number. Each settings panel above links here.

Using this calculator

The monthly cost is (input tokens ÷ 1,000,000 × the model’s input price) + (output tokens ÷ 1,000,000 × its output price). WisWes derives the token counts by multiplying your unique monthly users by the engagement rate to get conversations, then estimating input and output tokens per conversation from the system prompt, tool definitions, retrieved RAG context, accumulated history and the model’s answers.

E-commerce profile

Audience: users, engagement & conversations

Retrieval (RAG)

Tools

Tokens & the token estimator

Choosing a model

Advanced assumptions

Scope & WisWes billing

Stop guessing your AI bill — ship a predictable plan.

WisWes runs frontier models with usage included and pay-per-result overages. Start a 14-day free trial — no credit card.