🚀 We're Hiring: Data Engineering Lead
📍 Location: Lahore, Pakistan (On-site)
💼 Employment Type: Full-Time (Contract Based)
We're looking for an experienced Data Engineering Lead to own the architecture, implementation, and long-term reliability of a large-scale data integration ecosystem spanning APIs, regulatory portals, commercial data providers, and web-scraped sources.
🔹 Key Responsibilities
• Lead feasibility assessment of approximately 200 external data sources and classify them as Build-Now, Build-with-Caveats, or Defer/Replace.
• Design and implement scalable integration architecture across APIs, bulk-file ingestion, government data sources, and large-scale web scraping.
• Develop and maintain complexity-tiered integration models with realistic effort and maintenance estimates.
• Build resilient web scrapers with anti-bot handling, proxy rotation, schema drift detection, and automated monitoring.
🔹 Required Qualifications
✔ 3+ years of production Data Engineering experience, including senior-level ownership of data-intensive systems.
✔ Deep expertise in web scraping, anti-bot strategies, proxy management, headless browsers, and dynamic website extraction.
✔ Strong experience integrating REST APIs, bulk data feeds, and semi-structured government data sources.
✔ Advanced proficiency in Python and SQL with strong data modeling skills.
✔ Experience with Airflow, Dagster, Prefect, or similar orchestration platforms.
✔ Hands-on experience with monitoring, alerting, schema drift detection, and maintaining long-lived pipelines.
✔ Strong communication skills with the ability to estimate engineering effort and communicate technical trade-offs to stakeholders.
If you're excited about building data infrastructure at scale and solving challenging integration problems, we'd love to hear from you.
📩 Apply now or send your resume to mariam.imtanan@napollo.net