Glossary

AI glossary for content assistants

Plain-English definitions of 13,917 AI terms for branded assistant teams.

Plain EnglishRAGLLMs

Start for Free

Search glossary terms

13,917 glossary pages match your filters.

Glossary

13,917 terms. Open one for definitions and related concepts.

Seldon Core

Seldon Core is an open-source platform for deploying ML models on Kubernetes, providing serving, monitoring, and advanced inference capabilities.

Open page

Kedro

Kedro is an open-source Python framework for creating reproducible, maintainable, and modular data science code using software engineering best practices.

Open page

Flyte

Flyte is an open-source workflow orchestration platform designed for ML and data pipelines, providing type-safe, reproducible, and scalable workflow execution.

Open page

Arize AI

Arize AI is an ML observability platform for monitoring model performance in production, detecting data drift, and troubleshooting model degradation.

Open page

WhyLabs

WhyLabs is an AI observability platform built on the open-source whylogs library for profiling and monitoring data and ML model quality in production.

Open page

Mastra

Mastra is a TypeScript framework for building AI applications and agents with built-in support for tool calling, RAG, workflows, and integrations with third-party services.

Open page

Triton Inference Server

Triton Inference Server is NVIDIA's open-source serving platform that deploys models from any framework with dynamic batching, model ensembles, and multi-GPU support.

Open page

Text Generation Inference

Text Generation Inference (TGI) is Hugging Face's production-ready serving solution for LLMs, featuring continuous batching, tensor parallelism, and optimized inference.

Open page

DeepSpeed

DeepSpeed is a deep learning optimization library by Microsoft that enables training of extremely large models through memory-efficient techniques and distributed computing.

Open page

Megatron-LM

Megatron-LM is NVIDIA's framework for training large transformer models using efficient model and pipeline parallelism across GPU clusters.

Open page

Unsloth

Unsloth is a library that makes fine-tuning large language models significantly faster and more memory-efficient through custom CUDA kernels and optimization techniques.

Open page

PEFT

PEFT (Parameter-Efficient Fine-Tuning) is a Hugging Face library implementing techniques like LoRA and adapters that fine-tune large models by updating only a small subset of parameters.

Open page

TRL

TRL (Transformer Reinforcement Learning) is a Hugging Face library for training language models with reinforcement learning from human feedback (RLHF), DPO, and supervised fine-tuning.

Open page

Axolotl

Axolotl is a tool for streamlining LLM fine-tuning with support for multiple model architectures, training techniques, and dataset formats through simple YAML configuration.

Open page

LMQL

LMQL is a query language for large language models that combines natural language prompts with Python logic and output constraints for structured LLM interactions.

Open page

Guidance

Guidance is a programming library by Microsoft for controlling LLM output through interleaved generation and logic, enabling structured and constrained text generation.

Open page

Embedchain

Embedchain (now mem0) is a framework for building RAG applications that automatically handles chunking, embedding, storage, and retrieval from diverse data sources.

Open page

ChromaDB

ChromaDB is an open-source embedding database designed for AI applications, providing simple APIs for storing, searching, and filtering vector embeddings.

Open page

FAISS

FAISS (Facebook AI Similarity Search) is a library for efficient similarity search and clustering of dense vectors, optimized for billion-scale vector operations.

Open page

W&B Weave

W&B Weave is a toolkit from Weights & Biases for building, evaluating, and monitoring LLM applications with tracing, evaluation, and production monitoring.

Open page

Phoenix

Phoenix is an open-source observability tool by Arize for tracing, evaluating, and debugging LLM applications with support for OpenTelemetry-based instrumentation.

Open page

RAGAS

RAGAS is a framework for evaluating retrieval-augmented generation pipelines, providing metrics for faithfulness, answer relevancy, context precision, and context recall.

Open page

LiteLLM

LiteLLM is a lightweight Python library that provides a unified interface for calling 100+ LLM APIs using the OpenAI format, simplifying multi-provider integration.

Open page

Haystack Pipelines

Haystack Pipelines is the core abstraction of the Haystack framework, providing a directed graph system for building composable NLP and LLM application workflows.

Open page

safetensors

safetensors is a file format by Hugging Face for securely storing and loading model tensors, providing fast loading and protection against code execution vulnerabilities.

Open page

GGUF

GGUF (GPT-Generated Unified Format) is a binary file format for storing quantized language models, designed for efficient loading and inference with llama.cpp.

Open page

OpenRouter

OpenRouter is a unified API gateway that provides access to hundreds of AI models from multiple providers through a single OpenAI-compatible endpoint.

Open page

Modal

Modal is a serverless cloud platform for running AI workloads, providing on-demand GPU access, container orchestration, and Python-first infrastructure as code.

Open page

W&B Artifacts

W&B Artifacts is a versioned data and model management system within Weights & Biases for tracking datasets, models, and other ML pipeline outputs.

Open page

Label Studio

Label Studio is an open-source data labeling platform supporting text, image, audio, video, and multi-modal annotation for machine learning projects.

Open page

Prodigy

Prodigy is a commercial annotation tool by Explosion (creators of spaCy) designed for efficient data labeling with active learning and a streamlined annotation workflow.

Open page

Prefect

Prefect is a modern workflow orchestration framework for Python that provides dynamic, code-first pipeline definition with automatic retries, caching, and observability.

Open page

Dagster

Dagster is a data orchestration framework that organizes pipelines around data assets rather than tasks, providing a data-aware approach to workflow management.

Open page

Hugging Face Datasets

Hugging Face Datasets is a library for accessing, processing, and sharing ML datasets with efficient memory-mapped loading and built-in data processing tools.

Open page

Hugging Face Tokenizers

Hugging Face Tokenizers is a fast tokenization library implemented in Rust, providing implementations of popular tokenization algorithms used by modern language models.

Open page

Accelerate

Accelerate is a Hugging Face library that enables PyTorch code to run on any distributed configuration with minimal code changes for multi-GPU and multi-node training.

Open page

whisper.cpp

whisper.cpp is a C/C++ port of OpenAI's Whisper speech recognition model, enabling efficient local audio transcription on CPUs and consumer hardware.

Open page

Stable Diffusion WebUI

Stable Diffusion WebUI (by AUTOMATIC1111) is a browser-based interface for Stable Diffusion with extensive features for image generation, inpainting, and model management.

Open page

ComfyUI

ComfyUI is a node-based visual interface for AI image generation that provides flexible workflow creation through connecting modular processing nodes.

Open page

Diffusers

Diffusers is a Hugging Face library for state-of-the-art diffusion models, providing pretrained pipelines for image, audio, and 3D generation tasks.

Open page

CrewAI Tools

CrewAI Tools is a collection of pre-built tools for CrewAI agents, providing web search, file operations, code execution, and API integration capabilities.

Open page

smolagents

smolagents is a lightweight Hugging Face library for building AI agents that can use tools, write code, and orchestrate multi-step reasoning with minimal complexity.

Open page

AutoGen Studio

AutoGen Studio is a visual interface for building, testing, and deploying multi-agent AI workflows using Microsoft's AutoGen framework without writing code.

Open page

Milvus

Milvus is an open-source vector database designed for scalable similarity search, supporting billions of vectors with high-performance indexing and hybrid search.

Open page

Qdrant

Qdrant is a vector similarity search engine written in Rust, providing fast and scalable vector search with advanced filtering and payload management.

Open page

LangServe

LangServe is a library by LangChain for deploying LangChain chains and agents as REST APIs with automatic documentation, streaming support, and playground UI.

Open page

LangGraph Platform

LangGraph Platform is LangChain's infrastructure for deploying, managing, and scaling stateful AI agent applications built with LangGraph.

Open page

OpenAI SDK

The OpenAI SDK is the official client library for interacting with OpenAI APIs, providing typed interfaces for chat completions, embeddings, assistants, and other AI capabilities.

Open page

Page 121 of 290. Showing 48 of 13,917 matching glossary pages.

Turn owned content into answers

Use InsertChat to launch a branded assistant visitors can ask directly.

Start for Free

7-day free trial · No card required

Interactive FAQ

Try the FAQ like a visitor.

Open product, pricing, security, integration, and free-tool questions in the same chat your visitors use.

InsertChat

Interactive FAQ

Hey. Pick a question below and see how InsertChat turns FAQs into clear, source-backed answers.

Just now

0 of 21 questions explored Instant FAQ answers

Product FAQ

What is InsertChat?

InsertChat is a white-label AI assistant for your website. Train it, brand it, publish it, and learn from visitor questions.

How does InsertChat use my website content?

Connect approved pages, docs, videos, FAQs, policies, and other sources. InsertChat turns them into source-backed answers and next steps.

Can I control the assistant's tone and sources?

Yes. Choose its sources, tone, welcome message, and prompts so it stays on brand.

How does InsertChat stay accurate?

Answers use approved content and source links. Analytics show unclear or missing answers so you can improve coverage.

Can it collect leads or route support questions?

Yes. InsertChat can collect details, qualify intent, add context, and send chats to the right inbox, CRM, workflow, or person.

Can I control how the assistant behaves?

Yes. Control prompts, model choice, tool access, and the branded assistant experience so behavior stays consistent.

Which AI models can I use?

InsertChat supports multiple model providers. Choose each assistant's model for quality, speed, and cost, or use BYOK.

Can I pick different models for different workflows?

Yes. Use a faster model for common questions and a stronger model for complex reasoning. InsertChat supports that balance per conversation.

Where can I deploy an assistant?

Use a widget, embed, full-page assistant, custom domain, in-app embed, or API. Reuse one setup across surfaces.

Do I need coding skills?

No. Build and deploy AI assistants using our visual builder. The embed code is one line of JavaScript.

Can I customize the branding and UI?

Yes. Customize the assistant name, logo, colors, welcome message, suggested prompts, tone, domain, and white-label presentation.

Can I use my own domain?

Yes. Custom domains are supported, typically via enterprise options.

Does InsertChat support voice?

Yes. Voice dictation and text-to-speech let users speak instead of type.

Does InsertChat support vision?

Yes. Enable vision for assistants when images help clarify a request or context.

What tools and integrations are supported?

Zendesk, HubSpot, Shopify, WooCommerce, calendar booking, web search, Perplexity, and webhooks for your own systems.

Can I control which tools the assistant is allowed to use?

Yes. Tool access is controlled per assistant so you enable only what you need.

Can the agent hand off to a human?

Yes. Configure human handoff so the agent escalates when needed. Full conversation history is passed along.

Do you provide analytics?

Yes. Track chats, leads, feedback, top questions, unanswered questions, most-used sources, and content gaps.

Is it mobile friendly?

Yes. The widget and embeds work well on desktop and mobile with no separate experience needed.

What's the fastest path to a successful deployment?

Start with one assistant and a small set of high-value sources. Iterate using real questions from analytics.

What is the fastest way to get started?

Create an account. Connect one key source. Ask a test question, brand the assistant, then publish it on one page.