Gets the setting (schedule, URL, etc.) for web crawl data sources.
Required roles: All
API key for authentication
Ingestion setting ID for the web crawl data source
Successful retrieval of web crawl setting
Web crawl ingestion setting ID
Name of the web crawl ingestion setting
Start URL of the web crawl
Maximum crawl depth
When true, only HTML files will be downloaded
Whether to use a headless browser for crawling
Path filters for crawling
Content patterns for filtering
Maximum number of files to crawl
File extensions to include (e.g. ".pdf", ".docx")
Recurrence rule (RFC 5545 RRULE)