Gemcraft

Companies.sk — B2B prospect generation with three-tier enrichment fallbacks. B2B prospect generation with three-tier enrichment fallbacks.

Python 3BeautifulSoup4 · lxmlrequestsconcurrent.futuresopenpyxlJSON resumeno paid APIs

A two-stage B2B lead generator that scrapes the FinReg.sk Slovak business registry filtered by region and employee count, then enriches each company with website / email / phone / HR-email by chaining three sources: foaf.sk (Slovak company DB) → DuckDuckGo HTML search → direct contact-page scraping at common URL patterns.

Hard parts. Web-scraping reliability across three different source HTML structures. Rate-limiting without getting IP-blocked. Entity-type filtering (regex skips schools, hospitals, government offices, churches). Pagination state across multi-page FinReg results.