About the Project
We develop systems for collecting and processing data from web resources, including complex and protected sites. We work with headless browsers, anti-bot protections, and large volumes of data.
Responsibilities
- Development and maintenance of parsers and crawlers in Python
- Automation of browsers using Selenium
- Working with Puppeteer (via Node.js or integrations)
- Development and refinement of solutions based on CEF / CEFSharp
- Bypassing anti-bot protections (Cloudflare, anti-fraud, JS challenges — within the law)
- Optimization of parser speed, stability, and scalability
- Working with proxies, IP rotation, fingerprint management
- Integration of parsers with APIs and data storage systems
- Support of existing solutions and their refactoring
- Maintaining technical documentation
Requirements
Mandatory:
- Commercial development experience in Python for 2+ years
- Proficient work with Selenium
- Experience using Puppeteer
- Practical experience with CEF / CEFSharp
- Understanding of browser and JavaScript principles
- Experience parsing dynamic websites
- Knowledge of HTTP, cookies, headers, sessions
- Experience working with proxies (HTTP / SOCKS)
- Git, Linux
Will be a plus:
Playwright / Undetected Chrome / Stealth solutions / Asyncio, aiohttp / Experience with captcha services / Docker / Task queues (RabbitMQ, Kafka, Redis) / Experience working with large volumes of data
Work Conditions:
- Full-time 5/2, remote work with flexible hours.
- Interesting projects and tasks, as well as opportunities for professional development;
- Fully remote work - we are not tied to a location, we gather the strongest employees regardless of their location;
- Generous sharing of knowledge and experience;
- Reasonable management, non-bureaucratic approach to discussing work processes;
- Final salary level is determined with the successful candidate.
- Possibility of contract for services, contract, GPC.