#ML #Production #ONNX #Remote
Software Engineer ML (Production / Speech & Audio)
Company: International Product IT Project (VoIP / Cloud Telephony) Employment: Full-time
Format: Hybrid (office in Tashkent for 2-3 months → then full remote) Salary: starting from $3500 to $5000 (discussed individually)
We are a product team creating an intelligent cloud telephony ecosystem for the US and Canadian markets. Our product is a fault-tolerant platform with millions in traffic turnover. ML here is not a supporting feature, but the foundation of the system, operating in real-time. We are looking for an engineer who thoroughly understands the internal architecture of audio models and is ready to be responsible for their operation in a high-load production environment.
What you will be doing:
- Developing the AMD (Answering Machine Detection) system: retraining and tuning models for real-time call classification (distinguishing human from answering machines/IVR).
- Full-cycle development: from collecting and "dirty" labeling audio data to deploying and calibrating thresholds in production.
- Integration into the Core product: porting ML components into the backend infrastructure (C# / SIP / RTP stack) via ONNX Runtime.
- Latency optimization: fighting for milliseconds in audio streaming conditions.
- Deep Analysis: identifying errors and analyzing complex edge cases in real call scenarios.
- Research (R&D): experimenting with noise reduction, VAD, and new speech processing architectures.
Our stack: Python, C# wav2vec 2.0, Whisper, HuggingFace Transformers MFCC, embeddings, spectrograms ONNX / ONNX Runtime, Quantization SIP / RTP, Windows / Linux
We expect:
- 2+ years of experience in ML in production (when your model was actually working with users).
- Practical experience with Speech/Audio: understanding how audio features and modern sound processing architectures work.
- Engineering approach (QA-mindset): you are genuinely interested in delving into data anomalies and stress-testing the system.
- Understanding of classic and modern approaches: Fine-tuning, Transfer Learning, and the ability to work with metrics (Precision/Recall, ROC-AUC, Calibration).
- Ability to work end-to-end: from raw file to optimized inference.
What's important:
- Engineering autonomy: we value those who find problems themselves and bring solutions to production.
- Background: we highly welcome candidates coming to ML from Backend or QA; we value code and testing culture.
- Readiness for dynamics: the project is growing, there are many tasks, and they directly impact the business.
Will be a plus:
- Experience in the Speech/Audio domain (ASR, VAD, Audio Classification).
- Understanding of VoIP specifics and data streaming processing.
- Experience with MLOps and model monitoring tools.
Conditions:
- Mandatory offline onboarding in Tashkent (2-3 months) for product immersion, followed by full remote work.
- Real production tasks in an international high-load product.
- Opportunity for professional growth and compensation review as tasks become more complex.
- Work in a team with strong engineering expertise and no bureaucracy.
📩 CV via Telegram: @haas_maru