Connected knowledge

Use PolyAI’s Raven LLM for best results!

Connected Knowledge lets you connect and manage external knowledge sources so your agent can reference accurate, up-to-date information when responding to customer queries. It is designed for teams with information spread across documentation sites, help desk systems, PDFs, and internal materials that change over time.

What is Connected Knowledge?

Connected Knowledge is a fast way to expose external knowledge to your agent. You can connect URLs, files, or platform integrations, keep them synced, and toggle availability across environments or variants. Connected Knowledge does not replace Managed Topics. They work together, and each offers different levels of control and complexity. Both use RAG (retrieval-augmented generation) to match user queries.

Why use multiple knowledge sources?

Your organization may have knowledge living in:

websites
documents such as PDFs, CSV, JSON, and other internal reference files
existing knowledge bases, help desk systems or applications, such as Zendesk or Gladly

Connected Knowledge brings these together, keeps them updated, and lets you reuse them across projects without rewriting content.

How Connected Knowledge differs from the Managed Topics

Both the Connected Knowledge and Managed Topics tabs expose information to your agent. They differ in the following ways: Connected Knowledge

A connection layer for external knowledge.
Fast to set up and simple to manage.
Ideal for FAQ-style agents and large volumes of continuously updated content.
No prompting skill required.
Cannot: trigger actions, flows, SMS, hand-offs, or other agentic functions.
Cannot: specify utterances or control when/why the agent uses specific pieces of information.

Managed Topics

A curated library of topics and prompts.
Offers fine-grained control over utterances, behaviours, and what the agent says.
Can trigger functions, flows, and other agentic actions.
Requires more time, expertise, and maintenance — but enables anything beyond a simple FAQ bot.

Feature	Pain points solved	Use cases
Connected Knowledge	Helps teams avoid maintaining another curated knowledge base and gives non-technical users a simple, fast way to connect and manage external data.	Best for FAQ bots, quickly incorporating external knowledge, and controlling data access across environments or variants.
Managed Topics	Solves the need for actions, functions, flows, SMS triggers, and offers precise, curated control over agent utterances.	Ideal for agentic behaviour, structured and stable knowledge, and projects requiring complex logic or fine-grained control.

Add a new source

Go to Build → Connected Knowledge
Select New source
Choose one of:
- Upload files
- Add URL
- Zendesk
- Gladly
- Salesforce (coming soon)
- Notion (coming soon)
Complete the required details and click Add

Your agent will begin Syncing the content. Once ready, the source appears in the list.

Supported source types

Source Type	Details
Upload files — Text & structured data	`.txt`, `.csv`, `.json`, `.xml`, `.md`, `.html`, `.rtf`
Upload files — PDF	`.pdf`
Upload files — Microsoft Office	`.docx`, `.doc`, `.docm`, `.xlsx`, `.xls`, `.xlsm`, `.pptx`, `.ppt`, `.pptm`, `.msg`
Upload files — OpenDocument	`.odt`, `.ods`, `.odp`
Upload files — Email files	`.eml`
Upload files — E-books	`.epub`
URL scraping	Public documentation pages and help center articles
Zendesk (beta)	Help Center content with API sync
Gladly (beta)	Knowledge source sync
Salesforce (coming soon)	Salesforce knowledge sync
Notion (coming soon)	Notion workspace sync

What exactly gets scraped when I upload a URL?

Connected Knowledge uses a third-party scraper that traverses linked pages, but with limits:

Depth → Only one level below the initial URL.
Breadth → A maximum of 10 embedded pages.

If your page contains more than 10 links, not all will be scraped. In that case, upload additional URLs individually or use integrations like Zendesk/Gladly for complete coverage.

We recommend to connect applications such as Zendesk, over relying on websites where possible!

Keeping content fresh

After external content changes:

click Update to re-scrape files or URLs
or use the Sync icon per source

If a URL requires login or credentials change, syncing may fail. Update access and retry.

Group and manage sources

Group sources by product line, team, region, or document type. Sort by newest, oldest, type, or name. Each source offers:

Sync
Rename
Move to group
Remove

Why isn’t my agent using the sources I connected?

Several factors affect retrieval:

Data structure

Connected Knowledge splits content into 2000-character chunks with 500-character overlap. Very large documents or widely separated related sections may struggle more with relevance. What to do:

Restructure documents into smaller, tighter pieces.
Repeat key headings or terms.
Or curate the material as a managed topic for guaranteed usage.

Update state

Two updates must be current:

Source Update → keeps the data in each source fresh
Agent Update → applies knowledge connection changes to the agent

Both can be triggered manually. Agent updates also run automatically every few minutes.

Environments, variants, saved changes

Each source must be enabled in the correct environment and variant. Any edits must be saved before leaving the page.

Conflicting information?

If the Managed Topics and Connected Knowledge contain conflicting data, the Managed Topics wins. Content from Managed Topics is always prioritised

Tips & tricks

Use PolyAI’s Raven LLM for best results — it paraphrases structured and unstructured content more naturally.
Connected Knowledge results receive a small bias boost to encourage use; this can be tuned if needed.
Connected Knowledge data and Managed Topics are merged at runtime.
- Any system-prompt style guidance applies to both.

Introduction

Analytics

Build

Channels

Configure

Deployments

Troubleshoot

Legal

Connected knowledge

What is Connected Knowledge?

Why use multiple knowledge sources?

How Connected Knowledge differs from the Managed Topics

Add a new source

Supported source types

What exactly gets scraped when I upload a URL?

Keeping content fresh

Group and manage sources

Why isn’t my agent using the sources I connected?

Data structure

Update state

Environments, variants, saved changes

Conflicting information?

Tips & tricks

Introduction

Analytics

Build

Channels

Configure

Deployments

Troubleshoot

Legal

​What is Connected Knowledge?

​Why use multiple knowledge sources?

​How Connected Knowledge differs from the Managed Topics

​Add a new source

​Supported source types

​What exactly gets scraped when I upload a URL?

​Keeping content fresh

​Group and manage sources

​Why isn’t my agent using the sources I connected?

​Data structure

​Update state

​Environments, variants, saved changes

​Conflicting information?

​Tips & tricks

What is Connected Knowledge?

Why use multiple knowledge sources?

How Connected Knowledge differs from the Managed Topics

Add a new source

Supported source types

What exactly gets scraped when I upload a URL?

Keeping content fresh

Group and manage sources

Why isn’t my agent using the sources I connected?

Data structure

Update state

Environments, variants, saved changes

Conflicting information?

Tips & tricks