Data Engineer for the Integration team
About the team
The Data-loadka team creates and maintains self-service tools for loading data into the company's analytical platform. The core of the load process is mapping data into 6NF tables using the Anchor Modeling methodology, on which the mart layer is built. As the main calculation engine, we use Trino / Vertica and are actively developing custom Trino plugins for data loading.
You will:
- redesign the architecture of the data loading tool for the DWH (6NF);
- maintain infrastructure components;
- adapt Trino plugins to the company's needs;
- work on the infrastructure for ELT migrations from Vertica to Trino.
We expect you to:
- have experience with Lakehouse storage and/or columnar databases: loading data into S3-like storage and optimizing queries based on it;
- write in Python or another programming language;
- want to work in a platform team without strict product requirements.
It would be great if you:
- have experience developing data processing pipelines;
- have worked with Trino, ClickHouse, Vertica;
- know Java.
Working with us means:
- the opportunity to implement your ideas in a project with a multi-million audience;
- a talented team ready to support your initiatives;
- powerful hardware, additional monitors, and everything needed for productive work;
- a transparent bonus system, a decent salary — the amount will be discussed during the interview;
- a personal learning budget that can be spent on books, courses, and conferences;
- health care: from day one, you will have VHI with dental coverage, and a therapist and masseuse are available at the office;
- remote work from anywhere in the world or a great office two minutes from the Belorusskaya metro station: a panoramic view of the city center, spaces for focused work, and relaxation zones.