Description
At SberData, we are building a centralized data repository for the entire Sber. This includes over 800 data sources and 100+ PB of information, data request and delivery within 15 minutes, and a modern technological stack for data processing, including proprietary DBMS builds based on Hadoop and Greenplum.
Team Objectives:
- Organizing and executing incremental data loads from automated source systems
- Organizing and executing archival data reloads
- Analyzing and resolving incidents and problems related to data loading
- Analyzing and optimizing resource usage related to data loading.
Responsibilities
Candidate Responsibilities:
- Supporting data warehouses on the Hadoop platform
- Organizing and monitoring data loading from source systems
- Planning and conducting deployments
- Analyzing and investigating incidents.
Requirements
What's important to us:
- Higher technical education
- Experience in support/administration
- Basic knowledge of Linux and working in the console
- Knowledge of SQL
- Skills in working with generative AI models; experience creating AI agents and using them in work will be an advantage.
Will be a plus:
- RHEL administration experience
- Knowledge of ITIL/ITSM
- Knowledge of one programming language - Java, Bash, Python
- Experience with Hadoop: HDFS, Hive, Spark.
Conditions
Working Conditions:
- Office at Lenin Ave., 17
- Work schedule: 2 on/2 off from 12:00 PM to 12:00 AM and from 12:00 AM to 12:00 PM; corporate taxi is provided for night shifts
- Over 400 educational programs from SberUniversity for professional and career development
- Extended voluntary health insurance (DMS), preferential insurance for family, and corporate pension program
- Free SberPrime+ subscription, discounts on products from partner companies
- Referral bonus for recommending friends to join the Sber team.