Data Scientist Middle+
Remote | 270 - 300 k RUB Gross
🔎 Data Scientist Middle+ ❇️ | TopSelection agency
💰 Salary: 270-300K RUB/month Gross
🎯 Location/Remote: Russia
🕰 Project Duration: 3 months +
📄 Employment type: Individual Entrepreneur only
🏛 Project: Large retail company.
Required skills:
- At least 4 years of experience in a similar role;
Algorithms and Data Structures:
- Ability to evaluate algorithm complexity;
- Knowledge of algorithms and data structures from a standard course;
- Familiarity with evaluating the performance of standard library data structures;
- Familiarity with extended algorithm set: probabilistic, external memory algorithms, graph algorithms.
Probability Theory and Mathematical Statistics:
- Limit theorems and the ability to apply them;
- Experience in hypothesis testing, knowledge of bootstrap and A/B tests, as well as variance reduction and identification of causal relationships.
Machine Learning (ML):
- Understanding of ML algorithms and approaches;
- Ability to effectively adapt them to solve tasks as needed;
- Ability to build models from standard components;
- Understanding the sequence of stages in ML solution development and readiness to make changes to project architecture;
- Understanding which business metrics for the model need to be monitored;
- Ability to propose proxy metrics and loss functions related to business metrics;
- Ability to push a simple model to pilot, implement a method from a paper, embed a component into an existing pipeline.
- Familiarity with less common ML methods (e.g., Bayesian models/PGM/VBI, RL, etc.).
Model Operations (MLOps):
- Ability to fit a model once, check if it's still valid upon request, update and deploy any necessary software in a cluster;
- Ability in model versioning, data versioning, and monitoring model performance and data quality.
SQL:
- Simple queries (selections, groupings, window functions);
- Experience in query optimization.
Programming and Engineering:
- Ability to translate ideas into clean code that can be both executed and read;
- Proficiency in using git;
- Ability to write tests;
- Ability to understand others' code and debug it effectively;
- Basic understanding of all architecture components.
Work with Databases and Big Data (DB & BD):
- Industrial experience working with multiple systems;
- Knowledge of the internal structure of systems used in work;
- Experience in optimizing computations/queries/transactions/ACID/Indexes.
- Ability to decompose understandable tasks;
- Independently find solutions or use existing approaches for understandable tasks.
Tasks:
- Developing code and tests in python, participating in team development, participating in code reviews;
- Working on and modifying the design of existing components as necessary;
- Preparing and validating training data samples for the implementation and maintenance of mathematical models for big data products;
- Calculating A/B test results, designing A/B tests;
- Implementing modern machine learning and data analysis methods in big data products;
- Deploying models and services within the company's environment (hadoop/k8s/airflow);
- Writing SQL queries, optimizing queries (spark, in particular);
- Proposing ideas for improving the model/approach;
- Proposing proxy metrics and loss functions related to business metrics;
- Controlling the compliance of big data product managers' work results in big data product releases;
- Technical implementation of data-science solutions on the department's products;
- Formulating data requirements for developing mathematical models as part of big data product releases;
- Analyzing the subject area to improve model quality and formulating proposals to achieve the goals of big data projects and products.
📲 How to apply:
Send your resume to @AllaDemHR