Amazon is one of the most valuable and difficult e-commerce sources to scrape. Product pages contain price, seller, reviews, variants, rankings, images, and specifications, but the site is dynamic, localized, heavily monitored, and different Amazon marketplaces can expose different layouts. Start by deciding which Amazon page type you need. Search results, product details, and reviews are separate workflows.Documentation Index
Fetch the complete documentation index at: https://www.octoparse.com/docs/llms.txt
Use this file to discover all available pages before exploring further.
Page types
| Page type | Best for | Typical fields |
|---|---|---|
| Search results | Product discovery and ranking | Title, URL, image, price, rating, review count, sponsored flag, ASIN |
| Category pages | Category research and assortment | Product cards, rank position, category URL, price, rating |
| Product detail pages | Deep product intelligence | Title, brand, ASIN, price, seller, bullets, description, specs, variants, images, BSR |
| Review pages | Sentiment and quality analysis | Review title, rating, text, date, reviewer, verified purchase, helpful count |
| Seller pages | Marketplace monitoring | Seller name, storefront URL, seller rating, offer coverage |
Step 1: Pick the input
Use keywords when you need discovery:Step 2: Collect listing data
Search and category pages are useful for breadth. Capture:- Product title
- Product URL
- ASIN when available
- Price
- Rating
- Review count
- Image URL
- Sponsored/organic signal when visible
- Search keyword or category
- Result position
- Timestamp
Step 3: Visit product detail pages
Detail pages provide the fields needed for serious analysis:- Brand
- Feature bullets
- Product description
- Specifications table
- Variants
- Seller and fulfillment signals
- Best Sellers Rank
- Stock or delivery hints
- Full image set
- Coupon or promotion signals
Step 4: Scrape reviews separately
Review extraction is its own workflow. Sort order, pagination, language, star filters, and region can change the output. Useful review fields:- Rating
- Review title
- Review text
- Review date
- Reviewer display name
- Verified purchase flag
- Helpful vote count
- Product variant
- Review URL
Technical challenges
Amazon scraping commonly runs into:- Dynamic layouts and A/B tests
- Regional price and availability differences
- CAPTCHA and bot-detection challenges
- Variant-specific fields
- Missing or hidden seller data
- Review pagination limits
- Sponsored results mixed with organic results