Glossary

AI glossary for content assistants

Plain-English definitions of 13,917 AI terms for branded assistant teams.

Plain EnglishRAGLLMs

Start for Free

Search glossary terms

13,917 glossary pages match your filters.

Glossary

13,917 terms. Open one for definitions and related concepts.

Supervised Learning

Supervised learning is a machine learning approach where models learn from labeled training data, mapping inputs to known correct outputs.

Open page

Unsupervised Learning

Unsupervised learning is a machine learning approach where models find patterns and structures in data without labeled examples or predefined outputs.

Open page

Semi-Supervised Learning

Semi-supervised learning combines a small amount of labeled data with a large amount of unlabeled data to improve model performance beyond what either could achieve alone.

Open page

Reinforcement Learning

Reinforcement learning trains AI agents to make sequential decisions by rewarding desired behaviors and penalizing undesired ones through interaction with an environment.

Open page

One-Shot Learning

One-shot learning enables models to learn new concepts from a single example, commonly used in face recognition and image classification tasks.

Open page

Active Learning

Active learning is a strategy where the model selects which data points should be labeled next, focusing human annotation effort on the most informative examples.

Open page

Federated Learning

Federated learning trains AI models across multiple devices or organizations without sharing raw data, preserving privacy while enabling collaborative model improvement.

Open page

Curriculum Learning

Curriculum learning trains models by presenting training examples in a meaningful order, typically from easy to hard, mimicking how humans learn progressively.

Open page

Online Learning

Online learning updates the model incrementally as each new data point arrives, rather than training on the entire dataset at once.

Open page

Batch Learning

Batch learning trains models on the entire dataset at once, as opposed to online learning which processes examples incrementally.

Open page

Domain Adaptation

Domain adaptation transfers a model trained on one data distribution (source domain) to work effectively on a different but related distribution (target domain).

Open page

Classification

Classification is a supervised learning task where the model predicts which category or class an input belongs to, such as spam detection or image recognition.

Open page

Regression

Regression is a supervised learning task where the model predicts a continuous numerical value, such as price, temperature, or probability.

Open page

Clustering

Clustering is an unsupervised learning task that groups similar data points together without predefined labels, discovering natural structures in data.

Open page

Time Series Forecasting

Time series forecasting predicts future values based on historical temporal data patterns, used for demand planning, financial analysis, and resource allocation.

Open page

Random Forest

Random forest is an ensemble method that combines predictions from many decision trees trained on random subsets of data and features for more accurate, robust predictions.

Open page

Gradient Boosting

Gradient boosting builds an ensemble of decision trees sequentially, where each new tree corrects the errors of the previous ones, achieving state-of-the-art results on tabular data.

Open page

Support Vector Machine

A support vector machine finds the optimal hyperplane that separates classes with the maximum margin, effective for high-dimensional classification tasks.

Open page

K-Nearest Neighbors

K-nearest neighbors classifies data points based on the majority class among their k closest neighbors in feature space, a simple but effective non-parametric method.

Open page

Naive Bayes

Naive Bayes is a probabilistic classifier that applies Bayes theorem with a naive independence assumption between features, effective for text classification.

Open page

K-Means

K-means is a clustering algorithm that partitions data into k groups by iteratively assigning points to the nearest centroid and updating centroids.

Open page

DBSCAN

DBSCAN is a density-based clustering algorithm that groups together closely packed points and identifies outliers as points in low-density regions.

Open page

Gaussian Mixture Model

A Gaussian mixture model represents data as a combination of multiple Gaussian distributions, providing probabilistic soft clustering with cluster membership probabilities.

Open page

Hidden Markov Model

A hidden Markov model is a probabilistic model for sequential data where the system transitions between hidden states that generate observable outputs.

Open page

Bayesian Network

A Bayesian network is a probabilistic graphical model that represents variables and their conditional dependencies as a directed acyclic graph.

Open page

Genetic Algorithm

A genetic algorithm is an optimization method inspired by natural evolution that evolves a population of solutions through selection, crossover, and mutation.

Open page

AdaBoost

AdaBoost is an ensemble method that combines multiple weak classifiers by weighting them based on their accuracy and focusing on hard-to-classify examples.

Open page

Isolation Forest

Isolation forest is an anomaly detection algorithm that identifies outliers as data points that are easy to isolate through random partitioning.

Open page

Expectation Maximization

Expectation Maximization is an iterative algorithm for finding maximum likelihood parameters in models with latent variables, used to train Gaussian mixture models and HMMs.

Open page

Autoencoders

Autoencoders are neural networks that learn compressed data representations by training to reconstruct their inputs, used for dimensionality reduction and anomaly detection.

Open page

Training Set

The training set is the portion of data used to train a machine learning model, from which the model learns patterns and relationships.

Open page

Validation Set

The validation set is data held out during training to tune hyperparameters and monitor for overfitting, guiding model selection decisions.

Open page

Test Set

The test set is data held out completely during training and validation, used only once for final unbiased evaluation of model performance.

Open page

Cross-Validation

Cross-validation is a model evaluation technique that partitions data into multiple folds, training and testing on different splits to get a robust performance estimate.

Open page

Class Imbalance

Class imbalance occurs when training data has significantly more examples of some classes than others, causing models to be biased toward the majority class.

Open page

Oversampling

Oversampling increases the number of minority class examples in a training set by duplicating or generating synthetic examples to address class imbalance.

Open page

SMOTE

SMOTE (Synthetic Minority Over-sampling Technique) creates synthetic training examples for the minority class by interpolating between existing minority samples.

Open page

Data Preprocessing

Data preprocessing transforms raw data into a clean, structured format suitable for machine learning, including cleaning, normalization, and feature engineering.

Open page

Feature Engineering

Feature engineering creates new input variables from raw data to improve model performance, leveraging domain knowledge to extract predictive signals.

Open page

Feature Selection

Feature selection identifies and keeps the most relevant input features while removing irrelevant or redundant ones to improve model performance and reduce complexity.

Open page

Feature Importance

Feature importance measures how much each input feature contributes to a model's predictions, helping understand which factors drive outcomes.

Open page

Normalization

Normalization scales numerical features to a standard range, typically 0 to 1, ensuring no single feature dominates due to its scale.

Open page

Standardization

Standardization transforms features to have zero mean and unit standard deviation, making them comparable regardless of original scale and distribution.

Open page

One-Hot Encoding

One-hot encoding converts categorical variables into binary vectors where each category becomes a separate binary feature with a value of 0 or 1.

Open page

SHAP Values

SHAP values explain individual predictions by attributing the contribution of each feature based on Shapley values from cooperative game theory.

Open page

Adam Optimizer

Adam is an adaptive learning rate optimizer that combines momentum and RMSprop to efficiently train deep learning models with per-parameter learning rates.

Open page

Cross-Entropy Loss

Cross-entropy loss measures the difference between predicted probability distributions and true labels, the standard loss function for classification tasks.

Open page

Mean Squared Error

Mean squared error measures the average squared difference between predicted and actual values, the standard loss function for regression tasks.

Open page

Page 1 of 290. Showing 48 of 13,917 matching glossary pages.

Turn owned content into answers

Use InsertChat to launch a branded assistant visitors can ask directly.

Start for Free

7-day free trial · No card required

Interactive FAQ

Try the FAQ like a visitor.

Open product, pricing, security, integration, and free-tool questions in the same chat your visitors use.

InsertChat

Interactive FAQ

Hey. Pick a question below and see how InsertChat turns FAQs into clear, source-backed answers.

Just now

0 of 21 questions explored Instant FAQ answers

Product FAQ

What is InsertChat?

InsertChat is a white-label AI assistant for your website. Train it, brand it, publish it, and learn from visitor questions.

How does InsertChat use my website content?

Connect approved pages, docs, videos, FAQs, policies, and other sources. InsertChat turns them into source-backed answers and next steps.

Can I control the assistant's tone and sources?

Yes. Choose its sources, tone, welcome message, and prompts so it stays on brand.

How does InsertChat stay accurate?

Answers use approved content and source links. Analytics show unclear or missing answers so you can improve coverage.

Can it collect leads or route support questions?

Yes. InsertChat can collect details, qualify intent, add context, and send chats to the right inbox, CRM, workflow, or person.

Can I control how the assistant behaves?

Yes. Control prompts, model choice, tool access, and the branded assistant experience so behavior stays consistent.

Which AI models can I use?

InsertChat supports multiple model providers. Choose each assistant's model for quality, speed, and cost, or use BYOK.

Can I pick different models for different workflows?

Yes. Use a faster model for common questions and a stronger model for complex reasoning. InsertChat supports that balance per conversation.

Where can I deploy an assistant?

Use a widget, embed, full-page assistant, custom domain, in-app embed, or API. Reuse one setup across surfaces.

Do I need coding skills?

No. Build and deploy AI assistants using our visual builder. The embed code is one line of JavaScript.

Can I customize the branding and UI?

Yes. Customize the assistant name, logo, colors, welcome message, suggested prompts, tone, domain, and white-label presentation.

Can I use my own domain?

Yes. Custom domains are supported, typically via enterprise options.

Does InsertChat support voice?

Yes. Voice dictation and text-to-speech let users speak instead of type.

Does InsertChat support vision?

Yes. Enable vision for assistants when images help clarify a request or context.

What tools and integrations are supported?

Zendesk, HubSpot, Shopify, WooCommerce, calendar booking, web search, Perplexity, and webhooks for your own systems.

Can I control which tools the assistant is allowed to use?

Yes. Tool access is controlled per assistant so you enable only what you need.

Can the agent hand off to a human?

Yes. Configure human handoff so the agent escalates when needed. Full conversation history is passed along.

Do you provide analytics?

Yes. Track chats, leads, feedback, top questions, unanswered questions, most-used sources, and content gaps.

Is it mobile friendly?

Yes. The widget and embeds work well on desktop and mobile with no separate experience needed.

What's the fastest path to a successful deployment?

Start with one assistant and a small set of high-value sources. Iterate using real questions from analytics.

What is the fastest way to get started?

Create an account. Connect one key source. Ask a test question, brand the assistant, then publish it on one page.