Glossary

AI glossary for content assistants

Plain-English definitions of 13,917 AI terms for branded assistant teams.

Plain EnglishRAGLLMs

Start for Free

Search glossary terms

13,917 glossary pages match your filters.

Glossary

13,917 terms. Open one for definitions and related concepts.

Web Scraping

Web scraping is the automated extraction of structured data from web pages, transforming unstructured HTML content into usable datasets.

Open page

Relevance Score

A relevance score is a numerical value assigned to a search result indicating how well it matches a query, used to rank results from most to least relevant.

Open page

Search Result

A search result is an individual item returned by a search system in response to a query, typically containing a title, snippet, URL, and relevance metadata.

Open page

SERP

SERP (Search Engine Results Page) is the page displayed by a search engine in response to a query, containing organic results, ads, featured snippets, and other elements.

Open page

Query Understanding

Query understanding is the process of interpreting a search query to determine user intent, extract entities, and transform the query for better retrieval.

Open page

Query Parsing

Query parsing is the process of analyzing and breaking down a search query into structured components like keywords, operators, phrases, and filters.

Open page

Query Expansion

Query expansion automatically adds related terms, synonyms, or contextual words to a search query to improve recall and find more relevant results.

Open page

Query Suggestion

Query suggestion recommends alternative or refined search queries to users based on popular searches, related topics, and query patterns.

Open page

Spell Correction

Search spell correction automatically detects and fixes misspelled query terms to ensure users find relevant results despite typing errors.

Open page

Filtered Search

Filtered search narrows search results by applying constraints on specific fields or attributes, such as date ranges, categories, prices, or status values.

Open page

Phrase Search

Phrase search finds documents containing an exact sequence of words in the specified order, typically indicated by enclosing the phrase in quotation marks.

Open page

Proximity Search

Proximity search finds documents where specified terms appear within a certain distance of each other, balancing between exact phrase matching and independent keyword search.

Open page

Wildcard Search

Wildcard search uses special characters like * and ? to match patterns in search terms, enabling searches for words with unknown or variable characters.

Open page

Range Search

Range search finds documents with field values falling within a specified numeric, date, or alphanumeric range, enabling queries like price ranges or date intervals.

Open page

Geospatial Search

Geospatial search finds documents or records based on geographic location, supporting queries like finding items within a radius or inside a geographic boundary.

Open page

Pointwise Ranking

Pointwise ranking is a learning-to-rank approach that independently scores each document for relevance, treating ranking as a regression or classification problem on individual items.

Open page

Pairwise Ranking

Pairwise ranking is a learning-to-rank approach that trains models to correctly order pairs of documents, optimizing for relative relevance rather than absolute scores.

Open page

Listwise Ranking

Listwise ranking is a learning-to-rank approach that optimizes the entire ranked list at once, directly maximizing ranking metrics like nDCG.

Open page

RankNet

RankNet is a pairwise learning-to-rank algorithm that uses a neural network with a probabilistic cross-entropy loss to learn document relevance ordering.

Open page

LambdaRank

LambdaRank extends RankNet by weighting pairwise gradients by the change in ranking metrics, directly optimizing for measures like nDCG.

Open page

LambdaMART

LambdaMART combines LambdaRank gradients with gradient boosted decision trees, producing one of the most effective learning-to-rank algorithms in practice.

Open page

BERT Ranking

BERT ranking uses BERT language models to understand the semantic relationship between queries and documents, dramatically improving search relevance over keyword-based methods.

Open page

Query-Document Relevance

Query-document relevance measures the degree to which a document satisfies the information need expressed by a search query, forming the basis of search ranking.

Open page

Click-Through Rate in Search

Click-through rate (CTR) in search measures the percentage of users who click on a search result, serving as an implicit indicator of result relevance and quality.

Open page

Dwell Time

Dwell time is the duration a user spends on a page after clicking a search result before returning to the search results, indicating content satisfaction.

Open page

Search Quality

Search quality encompasses the overall effectiveness of a search system, measured through relevance metrics, user satisfaction, and operational performance indicators.

Open page

Apache Lucene

Apache Lucene is an open-source full-text search library written in Java that provides indexing and search capabilities used as the foundation for Elasticsearch and Solr.

Open page

Vespa

Vespa is an open-source big data serving engine developed by Yahoo that combines search, recommendation, and machine learning serving in a single platform.

Open page

Forward Index

A forward index maps documents to their contained terms and attributes, complementing the inverted index by enabling document-level lookups and attribute access.

Open page

Posting List

A posting list is the list of document identifiers (and optionally positions and frequencies) associated with a term in an inverted index.

Open page

Term Dictionary

A term dictionary is the vocabulary component of a search index that maps terms to their posting lists, enabling fast lookup of which documents contain each term.

Open page

Search Analyzer

A search analyzer is a text processing pipeline that transforms raw text into normalized tokens for indexing and querying, combining character filters, tokenizers, and token filters.

Open page

Token Filter

A token filter is a component of a search analyzer that transforms, removes, or adds tokens during text analysis, such as lowercasing, stemming, or adding synonyms.

Open page

Character Filter

A character filter preprocesses raw text before tokenization in a search analyzer, handling tasks like stripping HTML, normalizing characters, or mapping special patterns.

Open page

Search Stemmer

A search stemmer reduces words to their root or base form during text analysis, enabling matching between different word forms like "running," "runs," and "ran."

Open page

N-Gram Tokenizer

An n-gram tokenizer splits text into overlapping sequences of N characters, enabling partial matching, substring search, and handling of languages without word boundaries.

Open page

Edge N-Gram

Edge n-gram tokenization generates character sequences starting from the beginning of each token, commonly used to implement autocomplete and prefix-matching search features.

Open page

Dense Passage Retrieval

Dense passage retrieval (DPR) uses dual-encoder neural networks to encode queries and passages as dense vectors for efficient semantic similarity search.

Open page

Sentence Similarity

Sentence similarity measures how semantically close two sentences are, using vector representations to quantify meaning overlap for search, deduplication, and matching.

Open page

Semantic Matching

Semantic matching determines whether two text inputs convey the same meaning or intent, going beyond keyword overlap to understand conceptual equivalence.

Open page

Zero-Shot Retrieval

Zero-shot retrieval enables search systems to find relevant documents for queries on topics or domains not seen during training, without requiring domain-specific fine-tuning.

Open page

Late Interaction

Late interaction is a retrieval architecture that encodes queries and documents independently but uses token-level interaction for scoring, balancing efficiency with accuracy.

Open page

Multi-Vector Search

Multi-vector search represents documents using multiple embedding vectors rather than a single vector, capturing richer semantic information for more accurate retrieval.

Open page

Cross-Lingual Search

Cross-lingual search enables finding relevant documents in one language using queries written in a different language, bridging language barriers in information retrieval.

Open page

Multilingual Search

Multilingual search enables a single search system to handle queries and documents in multiple languages, providing relevant results regardless of the language used.

Open page

Visual Search

Visual search enables finding information using images as queries instead of text, using computer vision and AI to match visual content with relevant results.

Open page

Passage Retrieval

Passage retrieval finds and returns specific text passages within documents that are most relevant to a query, rather than returning entire documents.

Open page

User-Based Collaborative Filtering

User-based collaborative filtering recommends items by finding users with similar preferences and suggesting items those similar users have liked.

Open page

Page 131 of 290. Showing 48 of 13,917 matching glossary pages.

Turn owned content into answers

Use InsertChat to launch a branded assistant visitors can ask directly.

Start for Free

7-day free trial · No card required

Interactive FAQ

Try the FAQ like a visitor.

Open product, pricing, security, integration, and free-tool questions in the same chat your visitors use.

InsertChat

Interactive FAQ

Hey. Pick a question below and see how InsertChat turns FAQs into clear, source-backed answers.

Just now

0 of 21 questions explored Instant FAQ answers

Product FAQ

What is InsertChat?

InsertChat is a white-label AI assistant for your website. Train it, brand it, publish it, and learn from visitor questions.

How does InsertChat use my website content?

Connect approved pages, docs, videos, FAQs, policies, and other sources. InsertChat turns them into source-backed answers and next steps.

Can I control the assistant's tone and sources?

Yes. Choose its sources, tone, welcome message, and prompts so it stays on brand.

How does InsertChat stay accurate?

Answers use approved content and source links. Analytics show unclear or missing answers so you can improve coverage.

Can it collect leads or route support questions?

Yes. InsertChat can collect details, qualify intent, add context, and send chats to the right inbox, CRM, workflow, or person.

Can I control how the assistant behaves?

Yes. Control prompts, model choice, tool access, and the branded assistant experience so behavior stays consistent.

Which AI models can I use?

InsertChat supports multiple model providers. Choose each assistant's model for quality, speed, and cost, or use BYOK.

Can I pick different models for different workflows?

Yes. Use a faster model for common questions and a stronger model for complex reasoning. InsertChat supports that balance per conversation.

Where can I deploy an assistant?

Use a widget, embed, full-page assistant, custom domain, in-app embed, or API. Reuse one setup across surfaces.

Do I need coding skills?

No. Build and deploy AI assistants using our visual builder. The embed code is one line of JavaScript.

Can I customize the branding and UI?

Yes. Customize the assistant name, logo, colors, welcome message, suggested prompts, tone, domain, and white-label presentation.

Can I use my own domain?

Yes. Custom domains are supported, typically via enterprise options.

Does InsertChat support voice?

Yes. Voice dictation and text-to-speech let users speak instead of type.

Does InsertChat support vision?

Yes. Enable vision for assistants when images help clarify a request or context.

What tools and integrations are supported?

Zendesk, HubSpot, Shopify, WooCommerce, calendar booking, web search, Perplexity, and webhooks for your own systems.

Can I control which tools the assistant is allowed to use?

Yes. Tool access is controlled per assistant so you enable only what you need.

Can the agent hand off to a human?

Yes. Configure human handoff so the agent escalates when needed. Full conversation history is passed along.

Do you provide analytics?

Yes. Track chats, leads, feedback, top questions, unanswered questions, most-used sources, and content gaps.

Is it mobile friendly?

Yes. The widget and embeds work well on desktop and mobile with no separate experience needed.

What's the fastest path to a successful deployment?

Start with one assistant and a small set of high-value sources. Iterate using real questions from analytics.

What is the fastest way to get started?

Create an account. Connect one key source. Ask a test question, brand the assistant, then publish it on one page.