Yandex Crowd is Yandex's large-scale infrastructure service. We implement crowdsourcing to scale business processes such as data labeling, content moderation, field research, testing, and we build internal product functions: customer service, telemarketing, localization, and documentation.
We launched the Yandex Tasks crowdsourcing platform, participated in the launch of YandexGPT, the launch of Yandex Search in Kazakhstan, teaching Alice the Arabic language and even whispering — and in many other Yandex projects.
We are looking for a Crowd Solutions Architect (CSA) to join the Data Labeling Crowdsourcing Service. You will create, support, and develop processes related to data labeling for Yandex services. Your main task is to implement high-quality data labeling, taking into account the specifics of the subject area and the product for which the data is being labeled, whether for ML or business process automation. You will be involved in both technical implementation and managerial tasks — in most cases, the distribution is close to 50:50.
What tasks await you
- Interact with the customer, gather requirements for a data labeling project, decompose the task and design the solution
- Create a work plan and form the project budget
- Build data collection and processing pipelines
- Prepare experiments and research, extract, transform, and clean data
- Evaluate results: labeling speed and the quality of the obtained data
- Define key project performance metrics and the methodology for their calculation
- Build dashboards to analyze or track labeling project metrics and processes overall
- Improve the quality of projects
- Interact with performers of data labeling tasks
- Organize interaction with related teams
We expect that you
- Are familiar with Python, Java, or Groovy, can write scripts for processing or transforming data in one of these languages
- Understand the basics of data visualization
- Have processed large files (JSON, TSV, CSV)
- Can write and apply SQL queries
- Can switch between different tasks and work directions
- Have interacted with customers, can manage expectations, estimate deadlines, lead task discussions, and record results
- Are ready to work with rapidly developing services under changing requirements
It will be a plus if you
- Know what an API (Rest API) is, have interacted with services via API
- Understand machine learning principles
- Have applied mathematical statistics at work
- Have set up and launched projects in Toloka
Working conditions
- A strong and friendly team you can grow with
- Complex tasks for services with millions of users
- The ability to influence the process and result
- Market-level salary and above
- Bonuses every six months for everyone who successfully passes the review
- Flexible working hours
- Mortgage programs: at 3% for 10 years or interest-free for three years
- Extended voluntary health insurance program, 80% payment for VHI for spouses and children
- Compensation for food expenses on the office premises
- Gym, fitness room, yoga at the office
- Free parking for employees