Notion

🚀

Enhanced

Direct integration with Langfuse tracing

Notion is a collaboration platform that combines note-taking, knowledge management, and project management. This module provides three different loaders to process Notion content: Database, Page, and Folder loaders.

Notion Database Loader

Notion Database Node

The Database loader extracts content from Notion databases, treating each row as a separate document.

Features

  • Load database rows as documents
  • Extract properties as metadata
  • Support property headers
  • Handle concurrent loading
  • Process content with text splitters
  • Customize metadata extraction

Required Parameters

  • Connect Credential: Notion API credentials
  • Database Id: The unique identifier of the Notion database

Notion Page Loader

Notion Page Node

The Page loader extracts content from Notion pages, including all child pages as separate documents.

Features

  • Load page content as documents
  • Process child pages recursively
  • Extract page properties
  • Handle page hierarchy
  • Support text splitting
  • Customize metadata extraction

Required Parameters

  • Connect Credential: Notion API credentials
  • Page Id: The 32-character hex identifier from the page URL

Notion Folder Loader

Notion Folder Node

The Folder loader processes exported and unzipped Notion content from a local folder.

Features

  • Process exported Notion content
  • Handle multiple pages
  • Support local file system
  • Extract page content
  • Maintain document structure
  • Support text splitting
  • Customize metadata extraction

Required Parameters

  • Notion Folder: Path to the exported and unzipped Notion folder

Common Features

All Notion loaders support:

Optional Parameters

  • Text Splitter: A text splitter to process the extracted content
  • Additional Metadata: JSON object with additional metadata
  • Omit Metadata Keys: Comma-separated list of metadata keys to omit

Outputs

  • Document: Array of document objects containing metadata and pageContent
  • Text: Concatenated string from pageContent of documents

Authentication

API Authentication (Database & Page Loaders)

  • Requires Notion Integration Token
  • API rate limiting handled automatically
  • Support for workspace-level access
  • Secure credential management

Local Access (Folder Loader)

  • No authentication required
  • Direct file system access
  • Process offline content
  • Handle exported data

Document Structure

Each document contains:

  • pageContent: Extracted text content
  • metadata:
    • source: Original source (URL or file path)
    • title: Page or database title
    • properties: Notion properties
    • Additional custom metadata

Notes

  • API loaders require Notion integration setup
  • Folder loader needs exported content
  • Rate limiting handled automatically
  • Memory-efficient processing
  • Error handling for invalid inputs
  • Support for large datasets
  • Flexible output formats
  • Metadata customization