Auto-detect helps you create a task faster by letting Octoparse scan a web page, identify repeated data patterns, and generate a starting extraction workflow. It is most useful for pages with structured lists, tables, search results, product grids, directories, or similar repeated items.Documentation Index
Fetch the complete documentation index at: https://www.octoparse.com/docs/llms.txt
Use this file to discover all available pages before exploring further.
When to use Auto-detect
Use Auto-detect when:- The page has repeated items such as products, listings, reviews, or search results
- You want a quick starting workflow
- You are not sure which elements to select manually
- You want Octoparse to suggest fields and pagination logic
- You plan to review and adjust the generated workflow afterward
How Auto-detect works
What to review after detection
After Auto-detect generates a workflow, check:| Area | What to verify |
|---|---|
| Fields | Are the correct values captured? |
| Field names | Are column names clear and meaningful? |
| Pagination | Does the task move to the next page correctly? |
| Detail pages | Does the workflow open item details when needed? |
| Duplicates | Are repeated or unwanted elements included? |
| Missing values | Are some rows missing important fields? |
When manual editing is needed
Manual adjustments may be needed when:- The page layout is irregular
- Important fields are outside the detected list
- The website loads content dynamically
- Pagination is not detected correctly
- The page requires login, filters, popups, or user interaction
- You need to extract data from detail pages
Related pages
No-code builder
Adjust or build workflows manually after Auto-detect.
Refine data
Clean and reformat extracted fields before export.