Data Ingestion Methods
LarkupRAG supports a variety of ingestion methods:Direct Text Input
Direct Text Input
Paste text snippets or write content directly into the UI.
File Upload
File Upload
Upload PDFs, markdown files, and standard text documents directly from your computer.
Web Scraping
Web Scraping
Extract content directly from URLs using Firecrawl, or perform programmatic Google searches via Serper.dev.

Local Web Scraping
You can perform advanced web scraping locally, directly extracting clean data from target websites.Setting up Firecrawl Locally
You can run Firecrawl locally to avoid external API limits. To do this, simply point your configuration to your local Firecrawl instance instead of the hosted API. To unlock premium web scraping features, or if using a hosted instance, configure the following in your.env file or directly in the UI:
Custom Proxy Configuration
If your organization requires routing traffic through a proxy (e.g., for secure scraping or bypassing strict rate limits), you can easily set proxy configurations directly in your.env file.

