Octoparse combines task building, extraction execution, reliability tools, monitoring, and data export in one platform for web data collection. This page gives a high-level view of the capabilities available across the Platform section.Documentation Index
Fetch the complete documentation index at: https://www.octoparse.com/docs/llms.txt
Use this file to discover all available pages before exploring further.
Task building
Use Octoparse to create extraction workflows without writing scraper code.Templates
Start from prebuilt workflows for common websites and data collection scenarios.
Auto-detect
Let Octoparse identify page data automatically and generate a starting workflow.
No-code builder
Build custom workflows by selecting elements and adding actions visually.
Refine data
Clean, rename, format, or extract parts of field values before export.
Task running
After a task is built and tested, run it in the environment that matches your workflow.| Capability | Use it for |
|---|---|
| Local extraction | Testing, debugging, and running tasks on your own device |
| Cloud extraction | Scheduled and unattended extraction |
| Standard mode | Regular cloud task execution |
| Boost mode | Higher-speed or higher-concurrency cloud execution when available |
| Scheduling | Recurring runs at defined intervals |
Reliability tools
Websites can change, load content dynamically, require sessions, or block automated behavior. Octoparse includes tools that help improve task stability.Anti-blocking
Understand common blocking mechanisms and Octoparse reliability options.
Proxy
Use proxy settings when IP rotation or location-specific access is needed.
Captcha
Learn how CAPTCHA affects scraping tasks and what options are available.
Auto-login & cookies
Handle websites that require login, sessions, or cookies.
Monitoring
Monitoring tools help you understand whether a task ran successfully and where issues occurred. Use dashboards, logs, and event records to check:- Run status
- Task progress
- Errors and warnings
- Cloud run history
- Output availability
Data export
Octoparse can export extracted data to files and connected destinations. Common destinations include:- CSV, Excel, JSON, HTML, and XML files
- Google Sheets
- Databases such as MySQL, PostgreSQL, SQL Server, and Oracle
- Cloud storage such as Amazon S3, Google Drive, and Dropbox
Governance and collaboration
For team workflows, Octoparse also supports collaboration and security-related capabilities, such as managing task access, account settings, and shared workflows.Available capabilities may depend on your Octoparse plan, task type, and whether the task is run locally or in the cloud.