Mailto links, regex passes, Cloudflare decoding, JSON-LD parsing, obfuscation handling, and contact-page crawl. Every layer fails gracefully into the next. Whatever's reachable from the surface, Forge finds it.
Generates candidate addresses (info@, contact@, admin@) and verifies them via SMTP RCPT TO without ever sending mail. Clean validation, no spam flags, no inbox damage.
Identifies WordPress, Shopify, React, Next.js, Stripe, Intercom, Google Analytics, Tailwind, and more on every target site. Lead scoring without manual tagging — the platform tells you what the prospect runs.
Ollama-backed inference generates business summaries, industry classification, health scores, and pain points. No per-token cost. No API key. No data leaving the box.


Built-in connectors to FCC ULS (3M telecom licenses), NPI Registry (7M healthcare providers), and SAM.gov (900K federal contractors). Nightly batch ingest, dedup, and match — included, not bolted on.
A Haiku audit agent spot-checks enrichments. Field validators catch garbage. COALESCE writes never overwrite existing data. Every batch is rollback-able. Build mistakes don't become data debt.
Checkpoint-based resume after crashes, hourly watchdog process, and graceful degradation when individual extraction layers fail. The pipeline keeps running while you sleep.
First-class Model Context Protocol server. Claude (or any MCP client) can call discover, enrich, search, export, and stats tools directly. The dataset becomes an agent's working memory.