Description
In our SberWorks tribe (Sber), the DataLake team is responsible for storing and retrieving data based on Hadoop - developing and maintaining the cold storage.
Currently, we are launching a new service within the Productivity Platform - a hot storage for fast search and retrieval: platform-wide full-text search, semantic search via embeddings (vector search), context formation for RAG, and serving relevant fragments in AI assistants.
The goal of creating the Productivity Platform is to provide a single seamless path for the Bank's teams, reducing production costs through maximum automation and the application of artificial intelligence at every stage of the production process.
The Data Lake is the "lifeblood" of the platform, and your role in this project will be key.
The first stage of selection for this vacancy is a conversation with an AI recruiter. After your application, you will receive an invitation by email to have an initial interview with the AI recruiter in Telegram. The dialogue will take approximately 10 minutes. Its goal is to clarify missing details and speed up the consideration of your candidacy. The AI recruiter is just starting its journey, so we ask for your understanding. Your experience and participation will help make it convenient and useful!
Responsibilities
- gathering and formalizing requirements for hot/cold storage from stakeholders
- identifying requirements and interacting with data consumers/providers, including during source system analysis (the task is to precisely determine the needs regarding DataLake data and translate them into technical requirements, guaranteeing successful integration and data exchange)
- designing the logical data model for the detailed layer of the storage (developing a data structure that will ensure high performance and reliability of the system)
- developing technical documentation describing the algorithms for transforming/converting data from source systems into the data warehouse model
- interacting with the development team and providing analytical support during the development process (collaborating with developers, ensuring correct interpretation of requirements and timely resolution of arising issues)
- requirements management, maintaining and coordinating documentation.
Requirements
- experience as a systems analyst for 4+ years
- experience with ElasticSearch / OpenSearch and experience using SQL (PostgreSQL, Hive)
- experience working on data warehouse projects for LLMs
- experience designing integrations, knowledge of tools and technologies: REST, Kafka, JSON
- experience with UML, BPMN notations
- understanding of non-functional requirements.
Will be a plus:
- skills in working with generative AI models; experience creating AI agents and using them in work will be an advantage
- experience using GigaChat, Kandinsky, and equivalents in products, skills in creating and using AI agents
- instrumental proficiency in using AI for analysis, generation, and automation.
Conditions
- hybrid work schedule after the probationary period
- office location: SberCity Business Center on Kutuzovsky Prospekt (Kutuzovskaya metro station)
- annual salary review, annual bonus
- employment according to the Labor Code of the Russian Federation
- social package: VHI, gym, sports and cultural events, opportunity to study at the best corporate university in Russia, participation in industry conferences
- flexible mortgage discount, equal to 1/3 of the Central Bank's key rate.