Plainbase Ask — Self-hosted support chatbot

02 — What's inside

Six surfaces. One job each.

The admin is laid out the way you actually work: ingest, instruct, embed, review, escalate, tune. Nothing else.

A — Knowledge Base

Upload a file or crawl a whole help center.

Every answer is grounded in what's indexed. Drop in PDFs and Markdown for one-off content; point the crawler at a URL and it walks the same-domain links, and stays within the page limit you set.

PDF & Markdown uploads — chunked and embedded the moment they arrive.
Site crawler with a hard page cap (default 50, max 200) and live "Stop" button.
Scheduled rechecking — daily, weekly, biweekly, monthly. Or never.
Per-source embed cost shown next to every document, so spend is never a surprise.

ask.acme.co/admin/knowledge-base admin

KNOWLEDGE BASE

Sources

Documents

142

+12 this week

Chunks

3,821

avg 27 / doc

Corpus

2.1M

characters

Embed spend

$1.84

all-time

SOURCE

TYPE

CHUNKS

COST

CRAWLED

docs.acme.co

WEB

2,940

$1.21

2h ago

2026-pricing-guide.pdf

PDF

$0.04

—

status.acme.co

WEB

412

$0.18

9h ago

refund-policy.md

$0.01

—

B — Instructions

Preset guardrails. With plenty of flexibility.

Plainbase Ask assembles its system prompt from four layers, stacked. You edit one of them — the part that actually changes between deployments. Tone, scope, when to escalate, anything time-sensitive.

Tone & persona — give the bot a name and a voice in 1–3 sentences.
Scope & guardrails — what to answer, what to politely decline.
Escalation hints — when to proactively offer the ticket button.
Trigger phrases — exact strings that surface the ticket button on contact.

ask.acme.co/admin/instructions admin

INSTRUCTIONS · SYSTEM PROMPT

How the prompt is assembled

~ 1,240 tokens

System Rules

Hardcoded platform rules: KB search, citation format, ticket tool, markdown allowlist.

LOCKED

Your instructions

Tone & persona · scope & guardrails · escalation hints · additional context · trigger phrases.

YOU EDIT

Knowledge Base

Top matching chunks retrieved per turn from your vector store.

PER-TURN

Conversation memory

Last 10 messages (configurable) so context survives across turns.

ROLLING

C — Widget

One script tag. Themed and translated.

Configure the chat bubble in the admin; copy the embed snippet; paste it into your site. Multiple languages get a built-in picker. Allowed domains keep the widget locked to your sites and nobody else's.

Brand color & logo to match the rest of your site.
Multi-language strings — title, starter message, ticket form copy, office hours.
Domain allowlist so the widget only loads where you allow it.
Live preview next to the form — see the change before you deploy it.

Widget status

Active · accepting requests

Closed button label

Need help? Chat with us

Brand color

#161616

Swatches

Languages

🇬🇧 English 🇫🇷 Français 🇩🇪 Deutsch

Allowed domains

acme.co
app.acme.co
docs.acme.co

D — Conversations

Full conversation history. Including AI input-output.

Read what visitors asked, what the bot answered, and exactly what the model saw on each call — system prompt, retrieved chunks, tool calls, response. Useful when the bot says something it shouldn't have, and you want to know why.

Search and filter by visitor email, message content, or ticket status.
AI Logs — full system prompt, KB chunks, tool calls, response, thinking tokens.
Per-conversation cost — input tokens, output tokens, estimated spend.

ask.acme.co/admin/conversations admin

marc@…5m

How do I cancel my plan and get…

anon11m

Does the team plan include SSO?

lila@…1h

My export isn't downloading…TICKET

anon2h

What languages do you support?

jens@…4h

Pricing for 50 seats annual?

anonyest

Is there an iOS app?

Messages AI Logs · 3 calls · $0.0021

USER How do I cancel my plan and get a prorated refund?

BOT Go to Settings → Billing → Cancel plan. Cancellations made before the renewal date are prorated automatically. [docs.acme.co/billing/cancel]

USER Can I talk to someone? My charge looks wrong.

BOT Of course — let me get a human on this.TICKET TRIGGER

E — Ticketing

Escalation by email. Send it to any tool.

When the bot can't help — or someone asks for a human — the widget captures their email and submits a ticket. The ticket is recorded in the database and emailed to your inbox with the full transcript attached. Forward it into Zendesk, HubSpot, Linear, or just answer it from your support inbox.

One ticket per conversation, rate-limited to one per IP per hour.
Full conversation transcript in the body of the email — context, not summaries.
Any SMTP provider — Postmark, Resend, Brevo, Gmail, or your own server.
Email-to-ticket compatible with Zendesk, HubSpot, Crisp, Freshdesk, Linear, and more.

								
								SENT · 10:42 · plainbase ask →
									support@acme.co

[Plainbase Ask] New support request from marc@example.com

From: bot@acme.co · To: support@acme.co

Subject: Cancellation charge looks wrong — conversation #a7c3b9

User <marc@example.com>
How do I cancel my plan and get a prorated refund?

Maya (bot)
Go to Settings → Billing → Cancel plan. Cancellations
before renewal are prorated automatically.

User
Can I talk to someone? My charge looks wrong.

Maya (bot)
Of course — let me get a human on this with the full thread.

— end of transcript —
conversation_id: a7c3b9d2 · ip: 86.62.x.x · widget: en

F — Config

Guardrails you'll actually want.

Five rate limits with sensible defaults protect your model budget. Cost tracking lets you mirror provider pricing so the spend numbers in the admin reflect what you're actually paying.

Per-IP throttling — messages-per-second and active-conversation caps.
Hard limits per conversation — message count and max response tokens.
Memory window — pick how many recent messages the model sees each call.

ask.acme.co/admin/config admin

CONFIG · RATE LIMITS

Guardrails

Reset to defaults

Messages / sec / IP

0.2 msg·s⁻¹

Requests above this rate receive 429.

Active conv. / IP

5 sessions

Simultaneous chat sessions per IP.

Messages / conv.

50 msg

Sealed past the cap; visitor opens a new one.

Memory window

10 turns

Higher = better memory, higher cost.

Max response tokens

1,000 tok

Hard cap on each AI reply.

Input rate (USD / 1M)

$0.40

Used for spend estimates only.

06 — Roadmap

What's coming next.

Planned in no particular order. Want something sooner, or want to build it yourself? Open an issue or a PR on GitHub.

PLANNED

Webhook escalation

Instead of an email, Ask POSTs the full conversation payload to a webhook URL — plugs into n8n, Make, Zapier, or any internal system without a mail relay.

PLANNED

Knowledge base gaps

When the bot can't find enough context to answer, it tags the conversation and records which knowledge was missing — so you know exactly what to add to your docs.

PLANNED

Export conversations to CSV

Download your full conversation history as a CSV for offline analysis, CRM imports, or compliance archiving.

PLANNED

Conversation summaries

The LLM generates a one-paragraph summary when a conversation is escalated, or on demand from the admin — so your support team sees the gist before reading the thread.

PLANNED

Robots.txt adherence

A per-source toggle to make the crawler respect the site's robots.txt rules. Off by default for your own domains; on for anything third-party.

PLANNED

Automatic data deletion

Set a retention period and Ask deletes personal data automatically after that window — making conversations GDPR-proof without manual housekeeping.

Have a feature in mind? Request it, or contribute it directly — it's AGPLv3 and PRs are welcome.

Request a feature Contribute on GitHub

08 — FAQ

Questions we get every other week.

Still stuck?

Check out the docs, or shoot us an email at hello@plainbase.dev.

github.com/plainbase-dev/plainbase-ask

01 Is Plainbase Ask really free?

Yes. The code is AGPLv3-licensed and free forever. The only money that ever leaves your account goes directly to your AI provider — Mistral, OpenAI, Anthropic, or Google — at their usual per-token rates. We never sit in the middle of that bill.

If you'd rather not run the server, a managed hosted option is on the roadmap; sign up for updates from the footer.

03 Which AI provider should I pick?

For most teams, Mistral is the recommended default: cheap embeddings, fast chat, EU-based. OpenAI and Google are good if you already have credits. Anthropic doesn't ship embedding models, so you'll pair it with one of the others for the knowledge base.

You can switch providers from the admin without re-indexing as long as you keep the same embedding model.

04 What happens when the bot doesn't know the answer?

It says so, instead of inventing. The system prompt is locked to "answer only from the indexed sources," and the retrieval threshold is tunable per deployment. When confidence is below the threshold — or when the visitor asks for a human — the widget surfaces a Get help from a human button that opens a ticket and emails the full thread to the address you set in 'Escalation' config.

05 Can I run it without Docker?

Yes. But it's not ideal. The Docker container bundles the Node server and Litestream together, and the SQLite files are stored in a volume on disk. If you run the Node server without Docker, you need to set up Litestream separately to sync the SQLite files to your S3 bucket for backups. It's possible, but Docker Compose is the recommended way to run it for simplicity and ease of management.

06 How do I keep the knowledge base fresh?

Each crawl source has a recheck schedule — daily, weekly, biweekly, monthly, or never. The crawler re-fetches the page, diffs the content, and only re-embeds chunks that actually changed, so the bill stays small.

You can also trigger a recrawl manually from the admin.

07 Does the widget work in other languages?

It does. All labels are translateable from within the admin. Visitors can choose their preferred language when opening the widget, the AI sticks to that language for the duration of the chat.

08 Can I restyle the widget to match my site?

Yes. Add your logo and primary color from the admin. For deeper styling, adjust the base widget files.

Answers from your docs. On your website.

Acme Support

Three moving parts. 5 minutes work.

Feed the knowledge base

Shape the answer

Embed & forget

Six surfaces. One job each.

Upload a file or crawl a whole help center.

Sources

Preset guardrails. With plenty of flexibility.

How the prompt is assembled

System Rules

Your instructions

Knowledge Base

Conversation memory

One script tag. Themed and translated.

Full conversation history. Including AI input-output.

Escalation by email. Send it to any tool.

[Plainbase Ask] New support request from marc@example.com

Guardrails you'll actually want.

Guardrails

Messages / sec / IP

Active conv. / IP

Messages / conv.

Memory window

Max response tokens

Input rate (USD / 1M)

Bring your own key. Switch anytime.

Two SQLite files. Emphasis on 'lite'.

One container. One script tag.

What's coming next.

Five minutes from clone to chat bubble. Or less.

Questions we get every other week.

Still stuck?