Больше информации по резюме будет доступно после регистрации

Зарегистрироваться
Был более двух недель назад

Мужчина, 25 лет, родился 1 декабря 2000

Алматы, готов к переезду, готов к командировкам

Machine Learning engineer

Специализации:
  • Программист, разработчик

Тип занятости: полная занятость, частичная занятость, проектная работа/разовое задание

Опыт работы 5 лет 10 месяцев

Август 2023по настоящее время
2 года 9 месяцев

Санкт-Петербург

Информационные технологии, системная интеграция, интернет... Показать еще

Machine Learning engineer
◦ Enhanced acoustic model precision and accelerated vocoder performance, elevating sound quality and efficiency. ◦ Spearheaded innovative research initiatives to mitigate speaker accents in diverse languages without the need for audio recordings. ◦ Executed a successful migration of the speech synthesis system to a modern architecture, ensuring scalability and performance enhancements.
Март 2023Август 2023
6 месяцев
Galamat Technology
Team Lead
◦ Spearheaded the development and rollout of a cutting-edge speech recognition service, catering to over 100 languages in both offline and streaming modes. ◦ Led the successful deployment of a speech synthesis platform into production, featuring 6 Kazakh speakers and over 240 Russian speakers, achieving exceptional quality metrics. ◦ Implemented an end-to-end development pipeline using ClearML, optimizing resource allocation, versioning data, logging experiments, and documenting hypotheses and solutions, while managing a cross-functional team of 2 ML engineers, MLOps expert, and Backend developer. ◦ Implemented a seamless language identification service integrated into the ASR pipeline, enhancing overall system capabilities.
Ноябрь 2020Март 2023
2 года 5 месяцев
Galamat Technology
Machine Learning Engineer
◦ Developed and optimized an end-to-end ASR training pipeline for Kazakh and Russian languages, producing models that are competitive with other companies’ solutions. ◦ Built and implemented the entire TTS model training pipeline, from data collection through to production deployment, resulting in highly controlled synthesis with 4+ MOS for each speaker. ◦ Deployed an inference service to accelerate the TTS model, achieving a high bandwidth of 300+ RTFX and low latency between audio chunks for more concurrent streams. ◦ Optimized the training pipeline by adding various techniques, resulting in a significant speed-up of the process. This reduced the waiting time for model training and improved overall efficiency. ◦ Deployed an inference service to optimize the ASR models for Kazakh and Russian languages, achieving a high bandwidth of 450+ RTFX and an low latency for 500+ streams. ◦ Implemented MLOps practices including Data Version Control (DVC) and Weights & Biases (W&B) experiment tracking, which optimized the machine learning development cycle, improved reproducibility of experiments, and increased collaboration within the team. ◦ Designed and deployed an efficient API service using REST and gRPC to leverage proprietary ASR and TTS solutions. Achieved exceptional low-latency and high-throughput performance. ◦ Constructed a universal pipeline for audio data processing and preparation to train ASR models. Accelerated work with low-resource languages from 1 week to just 1 day.
Июль 2020Октябрь 2020
4 месяца
Galamat Technology
Machine Learning Intern
◦ Built program for automated payment transfers via Kaspi app, processing 2000+ daily transactions for Kazakh speech data platform. Streamlined payments, drove growth, and replaced 10+ manual positions for platform success. ◦ Designed and developed a sophisticated Kazakh speech data collection platform, attracting over 44,000 users. Achieved successful collection of 10,000+ hours of speech in 3 months, on a limited budget.

Навыки

Уровни владения навыками
Python
Git
Docker
Flask
Machine Learning
Bash
PyTorch
Lightning
NeMo
Scikit
Unix
gRPC
TensorRT
ONNX
Triton
Riva
DVC
WandB
FastAPI
ClearML

Обо мне

Machine Learning Engineer with over 3 years of experience, I specialize in the Speech domain, specifically ASR, TTS, VAD etc. I have a proven track record of designing and building ML solutions from scratch. I am passionate about research and enjoy deploying state-of-the-art models into production environments.

Высшее образование

2022
Высшее образование
ИСУИТ, Вычислительная техника и программное обеспечение

Знание языков

Казахский — Родной

Английский — B2 — Средне-продвинутый

Русский — C2 — В совершенстве

Повышение квалификации, курсы

2021
Introduction to Machine Learning in Production
Coursera, DeepLearning AI
2021
Нейронные сети и обработка текста Samsung Research Russian Open Education
Stepik
2020
Математика и Python для анализа данных
Coursera, Яндекс, МФТИ, Машинное обучение
2020
Обучение на размеченных данных
Coursera, Яндекс, МФТИ, Машинное обучение
2020
Нейронные сети и компьютерное зрение
Coursera, Samsung Research, Deep learning

Гражданство, время в пути до работы

Гражданство: Казахстан

Разрешение на работу: Другое, Казахстан, Россия

Желательное время в пути до работы: Не имеет значения