Integrations

Confluence Integration

Connect Atlassian Confluence to Rapidflare to ingest wiki pages and documentation from your team's knowledge base, enabling your AI Agent to answer questions using your internal documentation.

Overview

The Confluence integration allows you to:

  • Sync spaces - Connect one or more Confluence spaces
  • Preserve hierarchy - Maintain page parent-child relationships
  • Include attachments - Optionally ingest attached files
  • Track versions - Detect updated pages automatically

Supported Content

Content TypeSupportNotes
PagesFullFull content with formatting
Blog PostsFullTreated as regular pages
AttachmentsFullPDFs, documents, images with text
CommentsPartialCan be included optionally
Page LabelsFullUsed as metadata for filtering

Setting Up Confluence

Step 1: Add a New Source

  1. Navigate to Sources in your admin dashboard
  2. Click Add Source
  3. Select Confluence

Step 2: Authenticate

For Confluence Cloud:

  1. Click Connect to Confluence
  2. Sign in with your Atlassian account
  3. Grant Rapidflare read-only access
  4. Select your Confluence site if you have multiple

For Confluence Data Center/Server:

  1. Generate an API token from your Confluence admin settings
  2. Enter your Confluence URL and API token
  3. Test the connection

Step 3: Configure Spaces

  • Space Selection - Choose which spaces to include
  • Page Filters - Optionally filter by labels or parent pages
  • Include Attachments - Toggle attachment ingestion
  • Exclude Patterns - Skip pages matching certain titles or labels

Step 4: Initial Ingestion

Rapidflare will:

  1. Connect to your Confluence instance
  2. Enumerate pages in selected spaces
  3. Download page content and attachments
  4. Extract and index all content

Content Processing

Page Content

Confluence pages are processed with:

  • Full HTML content extraction
  • Table formatting preservation
  • Macro expansion where possible
  • Internal link tracking

Attachments

Attached files are processed based on type:

  • PDFs - Full text extraction
  • Office Documents - Content and tables extracted
  • Images - Stored for display in responses

Metadata

Each page includes metadata for context:

  • Page title and URL
  • Space name and key
  • Labels and categories
  • Last modified date
  • Author information

Best Practices

Space Organization

  • Create dedicated spaces for AI-ready content
  • Use labels consistently to categorize content
  • Archive outdated pages rather than leaving them active

Content Quality

  • Keep pages focused on single topics
  • Use clear, descriptive titles
  • Update pages regularly to maintain accuracy
  • Add labels to help with categorization

Access Control

  • Ensure the authenticating account has access to target spaces
  • Consider using a service account for stable access
  • Review space permissions periodically

Troubleshooting

Pages Not Appearing

  • Verify the page is in a connected space
  • Check that your account can view the page
  • Ensure the page isn't restricted or in a personal space
  • Look for exclusion patterns that might match

Authentication Issues

  • For Cloud: Re-authenticate through the Atlassian OAuth flow
  • For Server: Verify your API token is valid and hasn't expired
  • Check that your account hasn't been deactivated

Attachment Errors

  • Very large attachments may timeout during extraction
  • Some file formats may not be supported
  • Check the ingestion logs for specific error messages

Stale Content

  • Trigger a manual refresh from the source settings
  • Check if the page was updated in Confluence
  • Verify the sync schedule is active
Previous
SharePoint