Skip to main content

Documentation Index

Fetch the complete documentation index at: https://www.octoparse.com/docs/llms.txt

Use this file to discover all available pages before exploring further.

Octoparse combines task building, extraction execution, reliability tools, monitoring, and data export in one platform for web data collection. This page gives a high-level view of the capabilities available across the Platform section.

Task building

Use Octoparse to create extraction workflows without writing scraper code.

Templates

Start from prebuilt workflows for common websites and data collection scenarios.

Auto-detect

Let Octoparse identify page data automatically and generate a starting workflow.

No-code builder

Build custom workflows by selecting elements and adding actions visually.

Refine data

Clean, rename, format, or extract parts of field values before export.

Task running

After a task is built and tested, run it in the environment that matches your workflow.
CapabilityUse it for
Local extractionTesting, debugging, and running tasks on your own device
Cloud extractionScheduled and unattended extraction
Standard modeRegular cloud task execution
Boost modeHigher-speed or higher-concurrency cloud execution when available
SchedulingRecurring runs at defined intervals

Reliability tools

Websites can change, load content dynamically, require sessions, or block automated behavior. Octoparse includes tools that help improve task stability.

Anti-blocking

Understand common blocking mechanisms and Octoparse reliability options.

Proxy

Use proxy settings when IP rotation or location-specific access is needed.

Captcha

Learn how CAPTCHA affects scraping tasks and what options are available.

Auto-login & cookies

Handle websites that require login, sessions, or cookies.

Monitoring

Monitoring tools help you understand whether a task ran successfully and where issues occurred. Use dashboards, logs, and event records to check:
  • Run status
  • Task progress
  • Errors and warnings
  • Cloud run history
  • Output availability

Data export

Octoparse can export extracted data to files and connected destinations. Common destinations include:
  • CSV, Excel, JSON, HTML, and XML files
  • Google Sheets
  • Databases such as MySQL, PostgreSQL, SQL Server, and Oracle
  • Cloud storage such as Amazon S3, Google Drive, and Dropbox

Governance and collaboration

For team workflows, Octoparse also supports collaboration and security-related capabilities, such as managing task access, account settings, and shared workflows.
Available capabilities may depend on your Octoparse plan, task type, and whether the task is run locally or in the cloud.