firecrawl

Scrape and extract content from web pages using Firecrawl API

$ Instalar

git clone https://github.com/vm0-ai/vm0-cookbooks /tmp/vm0-cookbooks && cp -r /tmp/vm0-cookbooks/208-competitor-research/claude-competitor-research/skills/firecrawl ~/.claude/skills/vm0-cookbooks

// tip: Run this command in your terminal to install the skill

SKILL.md

View on GitHub →

name: firecrawl description: Scrape and extract content from web pages using Firecrawl API

Firecrawl Skill

This skill uses Firecrawl to scrape web pages and extract their content as clean markdown. It handles JavaScript-rendered pages and provides structured content extraction.

When to Use

Use this skill when:

You need to extract content from a web page
You want to read company profiles from Crunchbase, WellFound, LinkedIn
You need to scrape pricing pages, feature lists, or reviews
The page requires JavaScript rendering

How to Use

Prerequisites

FIRECRAWL_API_KEY environment variable must be set

Basic Usage

scrape.sh "https://example.com/page"

Parameters

Parameter	Required	Default	Description
url	Yes	-	The URL of the page to scrape

Examples

# Scrape a Crunchbase profile
scrape.sh "https://www.crunchbase.com/organization/notion"

# Scrape a pricing page
scrape.sh "https://notion.so/pricing"

# Scrape a Trustpilot review page
scrape.sh "https://www.trustpilot.com/review/notion.so"

Output

Results are saved to /tmp/data/scraped_[timestamp].json:

{
  "success": true,
  "data": {
    "markdown": "# Page Title\n\nPage content in markdown format...",
    "metadata": {
      "title": "Page Title",
      "description": "Page description",
      "sourceURL": "https://example.com/page"
    }
  }
}

Guidelines

Pages with heavy JavaScript may take longer to scrape
Some pages may require authentication (these will fail)
The markdown output has images, SVGs, videos removed for cleaner text
Only main content is extracted (navigation, footers excluded)

Error Handling

401 Unauthorized: Check that FIRECRAWL_API_KEY is set correctly
403 Forbidden: The page may be blocking scraping
500 Error: The page may be temporarily unavailable, retry later
Timeout: Very large pages may timeout, try a more specific URL