Server Hardware Operations Manager
About the Team
The team is responsible for the company's hardware infrastructure: server hardware, its placement, commissioning, support, and development in the data center. This is an internal infrastructure product evolving towards HWaaS — a predictable and standardized service for providing hardware to internal customers.
Your Responsibilities:
- Manage a Data Center team and be responsible for its results;
- Ensure the operation of server hardware, including commissioning, allocation, repairs, upgrades, and decommissioning;
- Validate server configurations, technical proposals, and specifications in the procurement process;
- Organize the work of contractors, Remote Hands, and collaboration with adjacent departments at the intersection of responsibility zones regarding site readiness, infrastructure changes, server hardware operation, and management tool development (writing basic technical specifications);
- Ensure the opening of new halls within your area of responsibility: racks, structured cabling systems, equipment placement, and server infrastructure readiness;
- Define requirements for structured cabling systems inside racks and between racks, as well as prepare or validate technical specifications for contractors;
- Develop technical specifications for equipment, components, consumables, and spare parts for procurement; participate in validating contracts and specifications for equipment and consumables received during procurement; participate in claims work as a technical expert;
- Plan the procurement of consumables, spare parts, and components for upgrades, ensure their availability and controlled consumption;
- Control payments for services and work within your area of responsibility;
- Ensure the achievement of the team's OKRs.
We Expect You to Have:
- Experience managing a technical, operational, or infrastructure team;
- Deep understanding of server hardware architecture and the operating principles of its key components;
- Understanding of the physical and logical operation of server nodes and their interaction with firmware, BMC, and the operating system;
- Hands-on experience in operating server hardware: commissioning, diagnostics, repair, component replacement, upgrades, and decommissioning;
- Deep understanding of BMC operation principles, out-of-band management, and Redfish; ability to use this understanding for diagnostics, operation, configuration validation, and technical risk assessment;
- Experience in technical validation of server configurations, components, and vendor proposals;
- Understanding of related infrastructure domains: servers, networks, basic data center engineering infrastructure;
- Ability to effectively collaborate with adjacent technical teams, including network engineers;
- Understanding of the principles of building and operating structured cabling systems inside racks and between racks;
- Experience in preparing or validating technical requirements, technical specifications, and specifications;
- Experience organizing and overseeing the work of contractors, external performers, and Remote Hands;
- Willingness to take responsibility for the technical quality of solutions and the team's operational results.
It Would Be Great if You:
- Have worked in a large-scale infrastructure;
- Have experience interacting with infrastructure process automation;
- Have worked with contractors and in commercial data centers.
Working with Us Means:
- The opportunity to implement your ideas in a project with a multi-million user audience;
- A talented team ready to support your initiatives;
- Powerful hardware, additional monitors, and everything needed for productive work;
- A transparent bonus system, a decent salary — we'll discuss the amount during the interview;
- A personal training budget that can be spent on books, courses, and conferences;
- Health care: from day one, you'll have voluntary health insurance including dentistry, with a therapist and massage therapist available in the office.