Gemcraft
Companies.sk — B2B prospect generation with three-tier enrichment fallbacks. B2B prospect generation with three-tier enrichment fallbacks.
Python 3BeautifulSoup4 · lxmlrequestsconcurrent.futuresopenpyxlJSON resumeno paid APIs
A two-stage B2B lead generator that scrapes the FinReg.sk Slovak business registry filtered by region and employee count, then enriches each company with website / email / phone / HR-email by chaining three sources: foaf.sk (Slovak company DB) → DuckDuckGo HTML search → direct contact-page scraping at common URL patterns.
Hard parts. Web-scraping reliability across three different source HTML structures. Rate-limiting without getting IP-blocked. Entity-type filtering (regex skips schools, hospitals, government offices, churches). Pagination state across multi-page FinReg results.