Glossary

AI Glossary

Plain-English definitions of 13,917 AI and agent terms. Understand RAG, embeddings, LLMs, and the technology behind modern AI assistants.

Clear definitions of the terms you'll encounter when building AI agents — from RAG and embeddings to prompt engineering and fine-tuning. Each entry explains what it means and why it matters for your business.

Search glossary terms

13,917 glossary pages match the current directory state.

Category

Browse by letter

Supervised Learning

Supervised learning is a machine learning approach where models learn from labeled training data, mapping inputs to known correct outputs.

Unsupervised Learning

Unsupervised learning is a machine learning approach where models find patterns and structures in data without labeled examples or predefined outputs.

Semi-Supervised Learning

Semi-supervised learning combines a small amount of labeled data with a large amount of unlabeled data to improve model performance beyond what either could achieve alone.

Reinforcement Learning

Reinforcement learning trains AI agents to make sequential decisions by rewarding desired behaviors and penalizing undesired ones through interaction with an environment.

One-Shot Learning

One-shot learning enables models to learn new concepts from a single example, commonly used in face recognition and image classification tasks.

Active Learning

Active learning is a strategy where the model selects which data points should be labeled next, focusing human annotation effort on the most informative examples.

Federated Learning

Federated learning trains AI models across multiple devices or organizations without sharing raw data, preserving privacy while enabling collaborative model improvement.

Curriculum Learning

Curriculum learning trains models by presenting training examples in a meaningful order, typically from easy to hard, mimicking how humans learn progressively.

Online Learning

Online learning updates the model incrementally as each new data point arrives, rather than training on the entire dataset at once.

Batch Learning

Batch learning trains models on the entire dataset at once, as opposed to online learning which processes examples incrementally.

Domain Adaptation

Domain adaptation transfers a model trained on one data distribution (source domain) to work effectively on a different but related distribution (target domain).

Classification

Classification is a supervised learning task where the model predicts which category or class an input belongs to, such as spam detection or image recognition.

Regression

Regression is a supervised learning task where the model predicts a continuous numerical value, such as price, temperature, or probability.

Clustering

Clustering is an unsupervised learning task that groups similar data points together without predefined labels, discovering natural structures in data.

Time Series Forecasting

Time series forecasting predicts future values based on historical temporal data patterns, used for demand planning, financial analysis, and resource allocation.

Random Forest

Random forest is an ensemble method that combines predictions from many decision trees trained on random subsets of data and features for more accurate, robust predictions.

Gradient Boosting

Gradient boosting builds an ensemble of decision trees sequentially, where each new tree corrects the errors of the previous ones, achieving state-of-the-art results on tabular data.

Support Vector Machine

A support vector machine finds the optimal hyperplane that separates classes with the maximum margin, effective for high-dimensional classification tasks.

K-Nearest Neighbors

K-nearest neighbors classifies data points based on the majority class among their k closest neighbors in feature space, a simple but effective non-parametric method.

Naive Bayes

Naive Bayes is a probabilistic classifier that applies Bayes theorem with a naive independence assumption between features, effective for text classification.

K-Means

K-means is a clustering algorithm that partitions data into k groups by iteratively assigning points to the nearest centroid and updating centroids.

DBSCAN

DBSCAN is a density-based clustering algorithm that groups together closely packed points and identifies outliers as points in low-density regions.

Gaussian Mixture Model

A Gaussian mixture model represents data as a combination of multiple Gaussian distributions, providing probabilistic soft clustering with cluster membership probabilities.

Hidden Markov Model

A hidden Markov model is a probabilistic model for sequential data where the system transitions between hidden states that generate observable outputs.

Bayesian Network

A Bayesian network is a probabilistic graphical model that represents variables and their conditional dependencies as a directed acyclic graph.

Genetic Algorithm

A genetic algorithm is an optimization method inspired by natural evolution that evolves a population of solutions through selection, crossover, and mutation.

AdaBoost

AdaBoost is an ensemble method that combines multiple weak classifiers by weighting them based on their accuracy and focusing on hard-to-classify examples.

Isolation Forest

Isolation forest is an anomaly detection algorithm that identifies outliers as data points that are easy to isolate through random partitioning.

Expectation Maximization

Expectation Maximization is an iterative algorithm for finding maximum likelihood parameters in models with latent variables, used to train Gaussian mixture models and HMMs.

Autoencoders

Autoencoders are neural networks that learn compressed data representations by training to reconstruct their inputs, used for dimensionality reduction and anomaly detection.

Training Set

The training set is the portion of data used to train a machine learning model, from which the model learns patterns and relationships.

Validation Set

The validation set is data held out during training to tune hyperparameters and monitor for overfitting, guiding model selection decisions.

Test Set

The test set is data held out completely during training and validation, used only once for final unbiased evaluation of model performance.

Cross-Validation

Cross-validation is a model evaluation technique that partitions data into multiple folds, training and testing on different splits to get a robust performance estimate.

Class Imbalance

Class imbalance occurs when training data has significantly more examples of some classes than others, causing models to be biased toward the majority class.

Oversampling

Oversampling increases the number of minority class examples in a training set by duplicating or generating synthetic examples to address class imbalance.

SMOTE

SMOTE (Synthetic Minority Over-sampling Technique) creates synthetic training examples for the minority class by interpolating between existing minority samples.

Data Preprocessing

Data preprocessing transforms raw data into a clean, structured format suitable for machine learning, including cleaning, normalization, and feature engineering.

Feature Engineering

Feature engineering creates new input variables from raw data to improve model performance, leveraging domain knowledge to extract predictive signals.

Feature Selection

Feature selection identifies and keeps the most relevant input features while removing irrelevant or redundant ones to improve model performance and reduce complexity.

Feature Importance

Feature importance measures how much each input feature contributes to a model's predictions, helping understand which factors drive outcomes.

Normalization

Normalization scales numerical features to a standard range, typically 0 to 1, ensuring no single feature dominates due to its scale.

Standardization

Standardization transforms features to have zero mean and unit standard deviation, making them comparable regardless of original scale and distribution.

One-Hot Encoding

One-hot encoding converts categorical variables into binary vectors where each category becomes a separate binary feature with a value of 0 or 1.

SHAP Values

SHAP values explain individual predictions by attributing the contribution of each feature based on Shapley values from cooperative game theory.

Adam Optimizer

Adam is an adaptive learning rate optimizer that combines momentum and RMSprop to efficiently train deep learning models with per-parameter learning rates.

Cross-Entropy Loss

Cross-entropy loss measures the difference between predicted probability distributions and true labels, the standard loss function for classification tasks.

Mean Squared Error

Mean squared error measures the average squared difference between predicted and actual values, the standard loss function for regression tasks.

Page 1 of 290. Showing 48 of 13,917 matching glossary pages.

Ready to build your AI agent?

Now that you understand the terminology, see how InsertChat puts it all together.

7-day free trial · No charge during trial

Questions & answers

Frequently asked questions

Tap any question to see how InsertChat would respond.

Contact support
InsertChat

InsertChat

Product FAQ

InsertChat

Hey! 👋 Browsing Product questions. Tap any to get instant answers.

Just now

What is InsertChat?

An AI agent workspace that lets you build agents grounded in your knowledge and deploy them to web, app, or API. Connect tools and integrations to complete workflows.

What's the difference between an agent and an InsertChat agent?

A basic agent is prompt-only. InsertChat agents are grounded in your sources, configurable per use case, and able to use tools and integrations.

How do agents stay accurate and avoid hallucinations?

Ground your agent in a knowledge base your team controls and keep it fresh. Use analytics to find gaps and improve coverage over time.

What can I connect as knowledge?

URLs, sitemaps, documents (PDF and office files), media like YouTube and audio, and structured data. The goal is a clear source of truth for answers.

Do sources stay up to date?

Yes. Refresh sources on demand or set up scheduled refresh depending on the source type.

Can I control how the agent behaves?

Yes. Control prompts, model choice, tool access, and agent experience so behavior stays consistent.

Which AI models can I use?

GPT-5.2, Claude Sonnet 4.5, Gemini 3.0, Llama 4, Grok 4.1, DeepSeek V3.2, and more. Choose the model per chat, or use BYOK to manage provider access yourself.

Can I pick different models for different workflows?

Yes. Use a faster model for common questions and a stronger model for complex reasoning. InsertChat supports that balance per conversation.

Where can I deploy an agent?

Website widget, in-app embed, or API. Keep one agent setup and reuse it across channels.

Do I need coding skills?

No. Build and deploy AI agents using our visual builder. The embed code is one line of JavaScript.

Can I customize the branding and UI?

Yes. Customize the widget to match your brand. White-label options are available for a fully branded experience.

Can I use my own domain?

Yes. Custom domains are supported, typically via enterprise options.

Does InsertChat support voice?

Yes. Voice dictation and text-to-speech let users speak instead of type.

Does InsertChat support vision?

Yes. Enable vision for agents when images help clarify a request or context.

What tools and integrations are supported?

Zendesk, HubSpot, Shopify, WooCommerce, calendar booking, web search, Perplexity, and webhooks for your own systems.

Can I control which tools the agent is allowed to use?

Yes. Tool access is controlled per agent so you enable only what you need.

Can the agent hand off to a human?

Yes. Configure human handoff so the agent escalates when needed. Full conversation history is passed along.

Do you provide analytics?

Yes. Track chats, leads, feedback, and credits used. Find gaps in coverage and prioritize fixes.

Is it mobile friendly?

Yes. The widget and embeds work well on desktop and mobile with no separate experience needed.

What's the fastest path to a successful deployment?

Start with one agent and a small set of high-value sources. Iterate using real questions from analytics.

What is the fastest way to get started?

Create an account, upload one document, and ask your first question. Most teams go live in under 5 minutes.

0 of 21 questions explored Instant replies

Product FAQ

What is InsertChat?

An AI agent workspace that lets you build agents grounded in your knowledge and deploy them to web, app, or API. Connect tools and integrations to complete workflows.

What's the difference between an agent and an InsertChat agent?

A basic agent is prompt-only. InsertChat agents are grounded in your sources, configurable per use case, and able to use tools and integrations.

How do agents stay accurate and avoid hallucinations?

Ground your agent in a knowledge base your team controls and keep it fresh. Use analytics to find gaps and improve coverage over time.

What can I connect as knowledge?

URLs, sitemaps, documents (PDF and office files), media like YouTube and audio, and structured data. The goal is a clear source of truth for answers.

Do sources stay up to date?

Yes. Refresh sources on demand or set up scheduled refresh depending on the source type.

Can I control how the agent behaves?

Yes. Control prompts, model choice, tool access, and agent experience so behavior stays consistent.

Which AI models can I use?

GPT-5.2, Claude Sonnet 4.5, Gemini 3.0, Llama 4, Grok 4.1, DeepSeek V3.2, and more. Choose the model per chat, or use BYOK to manage provider access yourself.

Can I pick different models for different workflows?

Yes. Use a faster model for common questions and a stronger model for complex reasoning. InsertChat supports that balance per conversation.

Where can I deploy an agent?

Website widget, in-app embed, or API. Keep one agent setup and reuse it across channels.

Do I need coding skills?

No. Build and deploy AI agents using our visual builder. The embed code is one line of JavaScript.

Can I customize the branding and UI?

Yes. Customize the widget to match your brand. White-label options are available for a fully branded experience.

Can I use my own domain?

Yes. Custom domains are supported, typically via enterprise options.

Does InsertChat support voice?

Yes. Voice dictation and text-to-speech let users speak instead of type.

Does InsertChat support vision?

Yes. Enable vision for agents when images help clarify a request or context.

What tools and integrations are supported?

Zendesk, HubSpot, Shopify, WooCommerce, calendar booking, web search, Perplexity, and webhooks for your own systems.

Can I control which tools the agent is allowed to use?

Yes. Tool access is controlled per agent so you enable only what you need.

Can the agent hand off to a human?

Yes. Configure human handoff so the agent escalates when needed. Full conversation history is passed along.

Do you provide analytics?

Yes. Track chats, leads, feedback, and credits used. Find gaps in coverage and prioritize fixes.

Is it mobile friendly?

Yes. The widget and embeds work well on desktop and mobile with no separate experience needed.

What's the fastest path to a successful deployment?

Start with one agent and a small set of high-value sources. Iterate using real questions from analytics.

What is the fastest way to get started?

Create an account, upload one document, and ask your first question. Most teams go live in under 5 minutes.