Models

Pick the model behind the branded assistant

Browse 274 models. Switch for speed, cost, or depth without changing the assistant.

274+ model pagesBYOK supportPer-assistant choice

Try all models free

7-day free trial · No card required

Search model pages

274 model pages match your filters.

Model family

Model library

Browse models

Pick models without changing content, branding, or deployment.

Build with o3

Use o3 in InsertChat for deliberate reasoning, 200K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with o4-mini

Use o4-mini in InsertChat for high-throughput traffic, 200K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Pixtral 12B 2409

Use Pixtral 12B 2409 in InsertChat for balanced production work, 128K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Pixtral Large

Use Pixtral Large in InsertChat for balanced production work, 128K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen3-14B

Use Qwen3-14B in InsertChat for balanced production work, 41.0K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen3 235B A22b Instruct 2507

Use Qwen3 235B A22b Instruct 2507 in InsertChat for balanced production work, 131K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen3-30B-A3B

Use Qwen3-30B-A3B in InsertChat for balanced production work, 41.0K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen 3 32B

Use Qwen 3 32B in InsertChat for balanced production work, 128K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen 3.6 Max Preview

Use Qwen 3.6 Max Preview in InsertChat for flagship capability, 240K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen3 235B A22B Thinking 2507

Use Qwen3 235B A22B Thinking 2507 in InsertChat for deliberate reasoning, 262.1K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen3 235B Thinking

Deploy Qwen3 235B Thinking for extended reasoning in 20+ languages. Visible thought chains for complex multilingual analysis and problem-solving.

Open page

Build with Qwen3 235B

Deploy Qwen3 235B for powerful multilingual AI. Strong performance in Chinese, English, and 20+ languages with competitive reasoning and coding.

Open page

Build with Qwen 3.5 Flash

Use Qwen 3.5 Flash in InsertChat for high-throughput traffic, 1M-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen 3.5 Plus

Use Qwen 3.5 Plus in InsertChat for balanced production work, 1M-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen 3.6 Plus

Use Qwen 3.6 Plus in InsertChat for balanced production work, 1M-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen 3 Coder 30B A3B Instruct

Use Qwen 3 Coder 30B A3B Instruct in InsertChat for coding-heavy work, 262.1K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen3 Coder Next

Use Qwen3 Coder Next in InsertChat for coding-heavy work, 256K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen3 Coder Plus

Use Qwen3 Coder Plus in InsertChat for coding-heavy work, 1M-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen3 Coder 480B A35B Instruct

Use Qwen3 Coder 480B A35B Instruct in InsertChat for coding-heavy work, 262.1K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen3 Max Preview

Use Qwen3 Max Preview in InsertChat for flagship capability, 262.1K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen 3 Max Thinking

Use Qwen 3 Max Thinking in InsertChat for deliberate reasoning, 256K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen3 Max

Use Qwen3 Max in InsertChat for flagship capability, 262.1K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen3 Next 80B A3B Instruct

Use Qwen3 Next 80B A3B Instruct in InsertChat for balanced production work, 262.1K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Build with Qwen3 Next 80B A3B Thinking

Use Qwen3 Next 80B A3B Thinking in InsertChat for deliberate reasoning, 131.1K-token context window, and a grounded route that keeps setup, comparison, and review in one place.

Open page

Page 10 of 12. Showing 24 of 274 matching model pages.

Outcomes

What you get

Balance quality, speed, and budget without rebuilding setup.

Keep the same sources, tools, and guardrails across models
Route simple questions to faster, cheaper models
Ground every model in your owned content
Keep the visitor experience branded while teams choose the model

More to explore

Multiple AI Models Multi-Model Feature

Interactive FAQ

Try the FAQ like a visitor.

Open product, pricing, security, integration, and free-tool questions in the same chat your visitors use.

InsertChat

Interactive FAQ

Hey. Pick a question below and see how InsertChat turns FAQs into clear, source-backed answers.

Just now

0 of 21 questions explored Instant FAQ answers

Product FAQ

What is InsertChat?

InsertChat is a white-label AI assistant for your website. Train it, brand it, publish it, and learn from visitor questions.

How does InsertChat use my website content?

Connect approved pages, docs, videos, FAQs, policies, and other sources. InsertChat turns them into source-backed answers and next steps.

Can I control the assistant's tone and sources?

Yes. Choose its sources, tone, welcome message, and prompts so it stays on brand.

How does InsertChat stay accurate?

Answers use approved content and source links. Analytics show unclear or missing answers so you can improve coverage.

Can it collect leads or route support questions?

Yes. InsertChat can collect details, qualify intent, add context, and send chats to the right inbox, CRM, workflow, or person.

Can I control how the assistant behaves?

Yes. Control prompts, model choice, tool access, and the branded assistant experience so behavior stays consistent.

Which AI models can I use?

InsertChat supports multiple model providers. Choose each assistant's model for quality, speed, and cost, or use BYOK.

Can I pick different models for different workflows?

Yes. Use a faster model for common questions and a stronger model for complex reasoning. InsertChat supports that balance per conversation.

Where can I deploy an assistant?

Use a widget, embed, full-page assistant, custom domain, in-app embed, or API. Reuse one setup across surfaces.

Do I need coding skills?

No. Build and deploy AI assistants using our visual builder. The embed code is one line of JavaScript.

Can I customize the branding and UI?

Yes. Customize the assistant name, logo, colors, welcome message, suggested prompts, tone, domain, and white-label presentation.

Can I use my own domain?

Yes. Custom domains are supported, typically via enterprise options.

Does InsertChat support voice?

Yes. Voice dictation and text-to-speech let users speak instead of type.

Does InsertChat support vision?

Yes. Enable vision for assistants when images help clarify a request or context.

What tools and integrations are supported?

Zendesk, HubSpot, Shopify, WooCommerce, calendar booking, web search, Perplexity, and webhooks for your own systems.

Can I control which tools the assistant is allowed to use?

Yes. Tool access is controlled per assistant so you enable only what you need.

Can the agent hand off to a human?

Yes. Configure human handoff so the agent escalates when needed. Full conversation history is passed along.

Do you provide analytics?

Yes. Track chats, leads, feedback, top questions, unanswered questions, most-used sources, and content gaps.

Is it mobile friendly?

Yes. The widget and embeds work well on desktop and mobile with no separate experience needed.

What's the fastest path to a successful deployment?

Start with one assistant and a small set of high-value sources. Iterate using real questions from analytics.

What is the fastest way to get started?

Create an account. Connect one key source. Ask a test question, brand the assistant, then publish it on one page.