Python scraper based on AI
-
Updated
Jun 25, 2026 - Python
Python scraper based on AI
Fast, local-first web content extraction for LLMs. Scrape, crawl, extract structured data — all from Rust. CLI, REST API, and MCP server.
High-performance web crawler API optimized for LLMs. Turn any search or website into clean Markdown using remote browsers. Firecrawl alternative
Fast, lightweight Firecrawl/Tavily alternative in Rust. Web scraper, crawler & search API with MCP server for AI agents. Drop-in Firecrawl-compatible API (/scrape, /crawl, /search). 2.3x faster than Tavily, 1.5x faster than Firecrawl in 1K-URL benchmarks. 6 MB RAM, single binary. Self-host or use managed cloud.
Open‑source alternative to Perplexity Comet, director.ai and firecrawl combined
AI-native web scraper. Single binary with a bundled Claude Code skill. MIT-licensed alternative to Firecrawl.
🕷️ A lightweight Model Context Protocol (MCP) server that exposes Crawl4AI web scraping and crawling capabilities as tools for AI agents. Similar to Firecrawl's API but self-hosted and free. Perfect for integrating web scraping into your AI workflows with OpenAI Agents SDK, Cursor, Claude Code, and other MCP-compatible tools.
Firecrawl Skill: Scrape, Search, Crawl, Browse. Token-efficient scraping skill, optimized for Claude Code, Codex, Gemini.
The web data layer for AI agents — fetch, search, crawl, extract, screenshot, and monitor the web with 50+ domain extractors and MCP.
AnyCrawl — Coolify v4 Template
LLM-friendly web crawler & scraper with a dedicated Reddit engine, built on Crawl4AI — Open WebUI compatible
🔍 Control browsers using natural language commands with multi-AI model support for seamless automation and task management.
Open-source web retrieval & research agent built for AI agents. Works with LangChain, LlamaIndex, CrewAI, OpenAI, MCP, and any REST agent. Supports scrape, search, crawl, map, extract, and research.
Self-hosted web scraper — convert any URL to agent-friendly markdown/JSON. Anti-bot bypass + autonomous browser agents. Free alternative to Firecrawl.
Self-hosted web scraping and Markdown extraction for AI agents
Transform Web Content into LLM-Ready Data
Self-hosted Firecrawl alternative. Scrape, crawl, extract with multi-LLM (Claude/GPT/Ollama), schedule, diff. One docker compose up.
Universal free OSS web-scraping cascade — 9-tier replacement for Firecrawl/WebFetch with site-specific APIs, archive proxies, OCR, headless Playwright, and logged-in browser fallback. MIT.
🔥 The Web Scraping API That Doesn't Rip You Off — Turn any website into LLM-ready markdown. 10x cheaper than Firecrawl.
Free self-hosted Firecrawl alternative — Crawl4AI MCP plugin for Claude Code
Add a description, image, and links to the firecrawl-alternative topic page so that developers can more easily learn about it.
To associate your repository with the firecrawl-alternative topic, visit your repo's landing page and select "manage topics."