AI models in InsertChat

GPT-5.2, Claude Sonnet 4.5, Gemini 3.0, Llama 4, Grok 4.1, and DeepSeek V3.2 — all in one workspace. Switch models per conversation, use fast models for FAQs and premium models for complex reasoning. All models share the same knowledge base, tools, and history. BYOK supported.

10+ Models BYOK Support Switch Per Chat
Try all models free

7-day free trial · Cancel anytime · No commitment

Browse models
available today

Pick the model per chat and keep your agent configuration stable.

OpenAI GPT models in InsertChat

Deploy GPT-5.2 and GPT-4.1 agents grounded in your knowledge base. Fast responses, advanced reasoning, and multi-channel deployment without coding.

Claude models in InsertChat

Create AI agents with Anthropic's Claude Sonnet 4.5, Haiku 4.5, and Opus 4.6. Superior reasoning, extended context, and nuanced responses grounded in your documents.

Gemini models in InsertChat

Deploy Gemini 3.0 Flash and Gemini 3.0 Pro agents with multimodal capabilities. Process images, PDFs, and text with cost-efficient, fast AI grounded in your data.

Llama models in InsertChat

Build AI agents with Meta's Llama 4 Maverick and Llama 4 Scout. Open-source models with full data privacy, BYOK support, and enterprise-ready performance for your knowledge base.

Grok models in InsertChat

Deploy xAI's Grok 4.1 Fast Instant and Grok 4.1 Fast Thinking for AI agents with real-time knowledge and 2M token context. Get up-to-date responses grounded in your documents.

Gemini 3.0 Flash in InsertChat

Build agents with Google's Gemini 3.0 Flash. Get fast, cost-efficient multimodal responses grounded in your knowledge base. No coding required.

Gemini 3.0 Pro in InsertChat

Deploy Gemini 3.0 Pro agents with advanced reasoning and multimodal understanding. Process images, documents, and text grounded in your data.

Nano Banana in InsertChat

Generate images directly in your AI agent with Nano Banana. Create visuals on the fly during conversations, grounded in your brand context.

Nano Banana Pro in InsertChat

Generate high-fidelity images with Nano Banana Pro. Premium quality visuals in your AI agent for product imagery, marketing assets, and creative workflows.

Claude Sonnet 4.5 in InsertChat

Deploy Claude Sonnet 4.5 agents with balanced speed and intelligence. Ideal for everyday tasks, document analysis, and customer-facing conversations.

Claude Haiku 4.5 in InsertChat

Build agents with Claude Haiku 4.5 for fast, lightweight AI. Ideal for high-volume support, quick lookups, and cost-sensitive deployments.

Claude Opus 4.6 in InsertChat

Deploy Claude Opus 4.6 for maximum reasoning power. Ideal for complex analysis, research workflows, and tasks requiring deep understanding.

GPT-5.2 Instant Chat in InsertChat

Build real-time conversational AI with GPT-5.2 Instant Chat. Fast, fluent responses for customer support, sales, and internal workflows.

GPT-5.2 Reasoning in InsertChat

Deploy GPT-5.2 Reasoning for multi-step logic, analysis, and complex problem-solving. Ideal for research, planning, and data-heavy workflows.

GPT-5.2 Pro in InsertChat

Deploy GPT-5.2 Pro for premium AI performance. The most capable OpenAI model for enterprise tasks, deep research, and mission-critical workflows.

GPT-4o in InsertChat

Build agents with GPT-4o for fast, multimodal AI. Process text, images, and documents with reliable outputs grounded in your knowledge base.

GPT-4o Mini in InsertChat

Deploy GPT-4o Mini for lightweight, affordable AI agents. Ideal for high-volume support, simple queries, and cost-sensitive use cases.

GPT-4.1 in InsertChat

Build agents with GPT-4.1 for reliable instruction-following and strong coding capabilities. Ideal for structured workflows and tool-heavy agents.

GPT-4.1 Mini in InsertChat

Deploy GPT-4.1 Mini for compact, efficient AI agents. Good instruction-following at lower cost for everyday support and workflow automation.

GPT-4.1 Nano in InsertChat

Build ultra-lightweight agents with GPT-4.1 Nano. The smallest OpenAI model for simple lookups, FAQs, and high-volume, low-cost deployments.

GPT-OSS in InsertChat

Deploy OpenAI's open-source GPT-OSS models (20B and 120B parameters) in InsertChat. Transparent, inspectable AI for teams that value openness.

Codex 5.1 in InsertChat

Build code-generating AI agents with OpenAI Codex 5.1. Produce clean code, explain technical concepts, and automate development workflows in your agent.

Codex 5.1 Max in InsertChat

Deploy Codex 5.1 Max for the most capable code generation. Handle complex multi-file tasks, architecture decisions, and advanced programming challenges.

Codex 5.1 Mini in InsertChat

Build lightweight code-generating agents with Codex 5.1 Mini. Fast code snippets, explanations, and simple automations at lower cost.

DeepSeek V3.2 in InsertChat

Build AI agents with DeepSeek V3.2 for cost-efficient reasoning. Strong performance on coding, math, and analysis tasks at competitive pricing.

DeepSeek V3.2 Thinking in InsertChat

Deploy DeepSeek V3.2 Thinking for extended reasoning with visible thought chains. Ideal for complex problems requiring step-by-step deliberation.

Llama 4 Maverick in InsertChat

Deploy Llama 4 Maverick for open-source AI with strong multilingual and reasoning capabilities. Full data privacy with enterprise-grade performance.

Llama 4 Scout in InsertChat

Build agents with Llama 4 Scout for lightweight, open-source AI. Fast responses and strong efficiency for everyday tasks with full transparency.

Grok 4.1 Fast Instant in InsertChat

Deploy Grok 4.1 Fast Instant for ultra-fast, non-reasoning conversations. 2M token context with real-time speed for support and Q&A.

Grok 4.1 Fast Thinking in InsertChat

Deploy Grok 4.1 Fast Thinking for reasoning capabilities with fast response times. 2M token context with step-by-step problem solving.

Kimi K2 in InsertChat

Deploy Kimi K2 from Moonshot AI for strong multilingual performance and competitive reasoning. A versatile model for global teams and diverse use cases.

Kimi K2 Thinking in InsertChat

Deploy Kimi K2 Thinking for extended reasoning with visible thought chains. Ideal for complex multilingual analysis and deliberate problem-solving.

MiniMax M2.1 in InsertChat

Build AI agents with MiniMax M2.1 for versatile performance across text, reasoning, and creative tasks. Strong multilingual support.

Qwen3 235B in InsertChat

Deploy Qwen3 235B for powerful multilingual AI. Strong performance in Chinese, English, and 20+ languages with competitive reasoning and coding.

Qwen3 235B Thinking in InsertChat

Deploy Qwen3 235B Thinking for extended reasoning in 20+ languages. Visible thought chains for complex multilingual analysis and problem-solving.

GLM 4.7 in InsertChat

Deploy GLM 4.7 for strong Chinese-English bilingual AI. Competitive reasoning and generation for teams serving Chinese-speaking audiences.

GLM 4.6 Visual in InsertChat

Deploy GLM 4.6 Visual for image understanding and visual Q&A. Analyze screenshots, documents, and images in your AI agent conversations.

Mistral Nemo in InsertChat

Build agents with Mistral Nemo for European open-source AI. Lightweight, fast, and privacy-friendly with strong multilingual capabilities.

What teams get
with multi-model access

Balance quality, speed, and budget without rebuilding setup.

  • badge 13Your agent behaves the same regardless of which model powers it — same knowledge base, same tools, same guardrails
  • badge 13Route simple questions to a fast, cheap model and complex ones to a premium model — you control the cost per conversation
  • badge 13Every model answers from your actual content, not its general training data — reducing hallucinations across the board
  • badge 13One workspace replaces separate subscriptions to OpenAI, Anthropic, and Google — your team accesses everything in one place

Related Pages

Questions & Answers

Frequently asked questions

Tap any question to see how InsertChat would respond.

Contact support

InsertChat

AI Support

Hey! 👋 Browsing Product questions. Tap any to get instant answers.

Just now

What is InsertChat?

An AI agent workspace that lets you build agents grounded in your knowledge and deploy them to web, app, or API. Connect tools and integrations to complete workflows.

What's the difference between an agent and an InsertChat agent?

A basic agent is prompt-only. InsertChat agents are grounded in your sources, configurable per use case, and able to use tools and integrations.

How do agents stay accurate and avoid hallucinations?

Ground your agent in a knowledge base your team controls and keep it fresh. Use analytics to find gaps and improve coverage over time.

What can I connect as knowledge?

URLs, sitemaps, documents (PDF and office files), media like YouTube and audio, and structured data. The goal is a clear source of truth for answers.

Do sources stay up to date?

Yes. Refresh sources on demand or set up scheduled refresh depending on the source type.

Which AI models can I use?

GPT-5.2, Claude Sonnet 4.5, Gemini 3.0, Llama 4, Grok 4.1, DeepSeek V3.2, and more. Choose the model per chat, or use BYOK to manage provider access yourself.

Can I pick different models for different workflows?

Yes. Use a faster model for common questions and a stronger model for complex reasoning. InsertChat supports that balance per conversation.

Where can I deploy an agent?

Website widget, in-app embed, or API. Keep one agent setup and reuse it across channels.

Do I need coding skills?

No. Build and deploy AI agents using our visual builder. The embed code is one line of JavaScript.

Can I customize the branding and UI?

Yes. Customize the widget to match your brand. White-label options are available for a fully branded experience.

Does InsertChat support voice?

Yes. Voice dictation and text-to-speech let users speak instead of type.

Does InsertChat support vision?

Yes. Enable vision for agents when images help clarify a request or context.

Can the agent hand off to a human?

Yes. Configure human handoff so the agent escalates when needed. Full conversation history is passed along.

Do you provide analytics?

Yes. Track chats, leads, feedback, and credits used. Find gaps in coverage and prioritize fixes.

Is it mobile friendly?

Yes. The widget and embeds work well on desktop and mobile with no separate experience needed.

What is the fastest way to get started?

Create an account, upload one document, and ask your first question. Most teams go live in under 5 minutes.

0 of 16 questions explored Instant replies

Product FAQ

What is InsertChat?

An AI agent workspace that lets you build agents grounded in your knowledge and deploy them to web, app, or API. Connect tools and integrations to complete workflows.

What's the difference between an agent and an InsertChat agent?

A basic agent is prompt-only. InsertChat agents are grounded in your sources, configurable per use case, and able to use tools and integrations.

How do agents stay accurate and avoid hallucinations?

Ground your agent in a knowledge base your team controls and keep it fresh. Use analytics to find gaps and improve coverage over time.

What can I connect as knowledge?

URLs, sitemaps, documents (PDF and office files), media like YouTube and audio, and structured data. The goal is a clear source of truth for answers.

Do sources stay up to date?

Yes. Refresh sources on demand or set up scheduled refresh depending on the source type.

Which AI models can I use?

GPT-5.2, Claude Sonnet 4.5, Gemini 3.0, Llama 4, Grok 4.1, DeepSeek V3.2, and more. Choose the model per chat, or use BYOK to manage provider access yourself.

Can I pick different models for different workflows?

Yes. Use a faster model for common questions and a stronger model for complex reasoning. InsertChat supports that balance per conversation.

Where can I deploy an agent?

Website widget, in-app embed, or API. Keep one agent setup and reuse it across channels.

Do I need coding skills?

No. Build and deploy AI agents using our visual builder. The embed code is one line of JavaScript.

Can I customize the branding and UI?

Yes. Customize the widget to match your brand. White-label options are available for a fully branded experience.

Does InsertChat support voice?

Yes. Voice dictation and text-to-speech let users speak instead of type.

Does InsertChat support vision?

Yes. Enable vision for agents when images help clarify a request or context.

Can the agent hand off to a human?

Yes. Configure human handoff so the agent escalates when needed. Full conversation history is passed along.

Do you provide analytics?

Yes. Track chats, leads, feedback, and credits used. Find gaps in coverage and prioritize fixes.

Is it mobile friendly?

Yes. The widget and embeds work well on desktop and mobile with no separate experience needed.

What is the fastest way to get started?

Create an account, upload one document, and ask your first question. Most teams go live in under 5 minutes.

Pricing FAQ

What do I pay for with InsertChat?

Pricing is based on how many agents you run, what knowledge sources you connect, and how much conversation usage you drive. Check the pricing page for current tiers.

How much does InsertChat cost?

Plans start at $29/month. Verify the latest pricing and included limits on the pricing page.

Is pricing per seat or per teammate?

Pricing is oriented around agents, sources, and usage rather than seats. Enterprise plans are available for larger teams.

Can I start small and upgrade later?

Yes. Start with self-serve, validate your use case, then scale up as needed.

Can I cancel anytime?

Yes. Cancel anytime with no long-term contract. Your data remains available for 30 days after cancellation.

Do you offer enterprise pricing?

Yes. Enterprise plans cover larger orgs, advanced requirements, and custom deployment needs.

Do you support annual billing?

Yes. Toggle to annual billing on the pricing page and save 20%. For invoicing or procurement workflows, contact us.

What happens if we hit limits?

You will get a notification before you hit a limit. Upgrade your plan anytime with one click, or reduce usage. Nothing stops working without warning.

What counts as a source?

A source is any connected item your agent learns from: a URL, document, YouTube link, or other knowledge input. Your plan determines how many you can connect.

What are credits?

Credits budget your usage across conversations, sources, and tools. They keep costs predictable as you scale.

Can I control which models we use so costs do not spike?

Yes. Choose the model per chat to balance quality, speed, and budget for different workflows.

What is BYOK?

Bring Your Own Key. Use your own provider API key for model access to consolidate billing or apply your own setup.

Can I use my own logo and domain on the $29 plan?

Yes. The $29 plan includes your own logo and custom domain.

Can I test before I commit?

Start self-serve to validate your workflow. For guided proof-of-concept or enterprise requirements, contact us.

Do you have discounts for startups or nonprofits?

If pricing is a blocker, contact us with your context.

How do I start?

Sign up for a 7-day free trial with full access. Pick your plan after you see it working with your own content.

Security FAQ

Where is my data stored?

European servers. GDPR compliant, never used for training, and deletable at any time.

What gets sent to AI model providers?

Your prompt and relevant context excerpts from connected sources are sent to the selected model provider to generate an answer.

Do you use our data to train models?

No. InsertChat never uses your data to train models.

Is my data isolated from other customers?

Yes. Data is scoped to your workspace and agents. Sources and conversations remain isolated.

Can I delete data?

Yes. Delete sources, conversation history, leads, and feedback at any time.

What data does InsertChat store?

Agent configuration, connected knowledge sources, and conversation data needed for the experience and analytics.

Can I keep an agent private?

Yes. Choose public or private agents depending on whether anyone or only authenticated users can access the embed.

Do you have role-based access controls?

Yes. Control who can manage agents and data with role-based access.

Can I restrict what the agent can do?

Yes. Control tool enablement per agent to limit actions to only what is necessary.

Do you support GDPR?

Yes. Full GDPR compliance with Data Processing Addendum (DPA) available on request.

Can you provide a DPA?

Yes. Our DPA covers processing obligations, subprocessors, and deletion/return terms. Contact us to request it.

Do you list subprocessors?

Yes. Subprocessors are documented in the DPA. Request it or contact us for details.

How do you handle security questionnaires?

Contact us and we provide the right documentation for your team's review process.

Is InsertChat safe to embed on a public website?

Yes, when configured correctly. Ground answers in approved sources and keep tool access controlled.

Do you support self-hosting?

Yes. Enterprise plans include self-hosting and bring-your-own-LLM options.

How do I evaluate InsertChat?

Start a free trial with non-sensitive data. When ready, request our security questionnaire and DPA.

Integrations FAQ

What integrations are available?

600+ integrations including Slack, Notion, Google Workspace, Salesforce, HubSpot, Zendesk, Shopify, WooCommerce, and Zapier. Our REST API allows custom integrations with any system.

Can I connect to Slack?

Yes. Deploy your agent directly to Slack so your team can interact in channels or DMs.

Do you integrate with HubSpot?

Yes. Sync leads, contacts, and conversation data directly into HubSpot.

Can I use InsertChat with Zendesk?

Yes. Ticket creation, handoffs, and syncing support conversations are all supported.

Do you support Shopify?

Yes. Your agent can answer product questions, check order status, and assist with common e-commerce queries.

What about WooCommerce?

Yes. WooCommerce works similarly to Shopify with access to product catalogs and order information.

Can I connect Google Workspace?

Yes. Connect Google Drive, Docs, and other Workspace tools as knowledge sources.

Do you have a Zapier integration?

Yes. Connect InsertChat with thousands of apps via Zapier to automate workflows and sync data.

Can the agent search the web?

Yes. Enable web search so the agent can find current information beyond your knowledge base.

Do you support calendar booking?

Yes. The agent can schedule meetings directly during conversations.

Can I use webhooks?

Yes. Send events to your own systems for custom integrations and real-time notifications.

Do you have an API?

Yes. Full REST API for creating agents, managing sources, and interacting with conversations programmatically.

Can I install it with Google Tag Manager?

Yes. Install via script embed or Google Tag Manager.

Can I embed it in my product?

Yes. Use in-app embeds for a native feel, or the API to build a custom interface.

Do you support custom SMTP?

Yes. Custom domain and SMTP options are available so outbound messaging aligns with your infrastructure.

How do I connect my first integration?

Start your trial, go to Settings > Integrations, and connect in one click. 600+ apps available.