#ML #Production #ONNX #Remote

Software Engineer ML (Production / Speech & Audio)

Company: International Product IT Project (VoIP / Cloud Telephony) Employment: Full-time Format: Hybrid (office in Tashkent for 2-3 months → then full remote) Salary: starting from $3500 to $5000 (discussed individually)

We are a product team creating an intelligent cloud telephony ecosystem for the US and Canadian markets. Our product is a fault-tolerant platform with millions in traffic turnover. ML here is not a supporting feature, but the foundation of the system, operating in real-time. We are looking for an engineer who thoroughly understands the internal architecture of audio models and is ready to be responsible for their operation in a high-load production environment.

What you will be doing:

Developing the AMD (Answering Machine Detection) system: retraining and tuning models for real-time call classification (distinguishing human from answering machines/IVR).
Full-cycle development: from collecting and "dirty" labeling audio data to deploying and calibrating thresholds in production.
Integration into the Core product: porting ML components into the backend infrastructure (C# / SIP / RTP stack) via ONNX Runtime.
Latency optimization: fighting for milliseconds in audio streaming conditions.
Deep Analysis: identifying errors and analyzing complex edge cases in real call scenarios.
Research (R&D): experimenting with noise reduction, VAD, and new speech processing architectures.

Our stack: Python, C# wav2vec 2.0, Whisper, HuggingFace Transformers MFCC, embeddings, spectrograms ONNX / ONNX Runtime, Quantization SIP / RTP, Windows / Linux

We expect:

2+ years of experience in ML in production (when your model was actually working with users).
Practical experience with Speech/Audio: understanding how audio features and modern sound processing architectures work.
Engineering approach (QA-mindset): you are genuinely interested in delving into data anomalies and stress-testing the system.
Understanding of classic and modern approaches: Fine-tuning, Transfer Learning, and the ability to work with metrics (Precision/Recall, ROC-AUC, Calibration).
Ability to work end-to-end: from raw file to optimized inference.

What's important:

Engineering autonomy: we value those who find problems themselves and bring solutions to production.
Background: we highly welcome candidates coming to ML from Backend or QA; we value code and testing culture.
Readiness for dynamics: the project is growing, there are many tasks, and they directly impact the business.

Will be a plus:

Experience in the Speech/Audio domain (ASR, VAD, Audio Classification).
Understanding of VoIP specifics and data streaming processing.
Experience with MLOps and model monitoring tools.

Conditions:

Mandatory offline onboarding in Tashkent (2-3 months) for product immersion, followed by full remote work.
Real production tasks in an international high-load product.
Opportunity for professional growth and compensation review as tasks become more complex.
Work in a team with strong engineering expertise and no bureaucracy.

📩 CV via Telegram: @haas_maru

Contacts

Similar vacancies

ML Engineer (Speech/Audio)

Applied ML Engineer

AI Engineer (Agents)

ML Developer (Middle/Senior)

ML Engineer, Voice AI

Senior Machine Learning Engineer

Machine Learning Engineer (TTS)

AI Engineer (Audio)

AI engineer

ML Engineer [AI Team]

ML/AI Engineer

AI/ML Engineer

Software Engineer ML (Production / Speech & Audio)

Key Skills

Details

Average salary for this role