Больше информации по резюме будет доступно после регистрации
ЗарегистрироватьсяБыл более двух недель назад
Мужчина, 25 лет, родился 1 декабря 2000
Алматы, готов к переезду, готов к командировкам
Machine Learning engineer
Специализации:
- Программист, разработчик
Тип занятости: полная занятость, частичная занятость, проектная работа/разовое задание
Опыт работы 5 лет 10 месяцев
Август 2023 — по настоящее время
2 года 9 месяцев
Санкт-Петербург
Информационные технологии, системная интеграция, интернет... Показать еще
Machine Learning engineer
◦ Enhanced acoustic model precision and accelerated vocoder performance, elevating sound quality and efficiency.
◦ Spearheaded innovative research initiatives to mitigate speaker accents in diverse languages without the need for
audio recordings.
◦ Executed a successful migration of the speech synthesis system to a modern architecture, ensuring scalability and
performance enhancements.
Март 2023 — Август 2023
6 месяцев
Galamat Technology
Team Lead
◦ Spearheaded the development and rollout of a cutting-edge speech recognition service, catering to over 100
languages in both offline and streaming modes.
◦ Led the successful deployment of a speech synthesis platform into production, featuring 6 Kazakh speakers and
over 240 Russian speakers, achieving exceptional quality metrics.
◦ Implemented an end-to-end development pipeline using ClearML, optimizing resource allocation, versioning data,
logging experiments, and documenting hypotheses and solutions, while managing a cross-functional team of 2 ML
engineers, MLOps expert, and Backend developer.
◦ Implemented a seamless language identification service integrated into the ASR pipeline, enhancing overall system
capabilities.
Ноябрь 2020 — Март 2023
2 года 5 месяцев
Galamat Technology
Machine Learning Engineer
◦ Developed and optimized an end-to-end ASR training pipeline for Kazakh and Russian languages,
producing models that are competitive with other companies’ solutions.
◦ Built and implemented the entire TTS model training pipeline, from data collection through to production deployment, resulting in highly controlled synthesis with 4+ MOS for each speaker.
◦ Deployed an inference service to accelerate the TTS model, achieving a high bandwidth of 300+ RTFX and low latency between audio chunks for more concurrent streams.
◦ Optimized the training pipeline by adding various techniques, resulting in a significant speed-up of the process. This reduced the waiting time for model training and improved overall efficiency.
◦ Deployed an inference service to optimize the ASR models for Kazakh and Russian languages, achieving
a high bandwidth of 450+ RTFX and an low latency for 500+ streams.
◦ Implemented MLOps practices including Data Version Control (DVC) and Weights & Biases (W&B) experiment tracking, which optimized the machine learning development cycle, improved reproducibility of experiments, and increased collaboration within the team.
◦ Designed and deployed an efficient API service using REST and gRPC to leverage proprietary ASR and TTS solutions. Achieved exceptional low-latency and high-throughput performance.
◦ Constructed a universal pipeline for audio data processing and preparation to train ASR models. Accelerated work with low-resource languages from 1 week to just 1 day.
Июль 2020 — Октябрь 2020
4 месяца
Galamat Technology
Machine Learning Intern
◦ Built program for automated payment transfers via Kaspi app, processing 2000+ daily transactions for
Kazakh speech data platform. Streamlined payments, drove growth, and replaced 10+ manual positions
for platform success.
◦ Designed and developed a sophisticated Kazakh speech data collection platform, attracting over 44,000
users. Achieved successful collection of 10,000+ hours of speech in 3 months, on a limited budget.
Навыки
Уровни владения навыками
Обо мне
Machine Learning Engineer with over 3 years of experience, I specialize in the Speech domain, specifically ASR, TTS, VAD etc. I have a proven track record of designing and building ML solutions from scratch. I am passionate about research and enjoy deploying state-of-the-art models into production environments.
Высшее образование
2022
Высшее образование
ИСУИТ, Вычислительная техника и программное обеспечение
Знание языков
Повышение квалификации, курсы
2021
Introduction to Machine Learning in Production
Coursera, DeepLearning AI
2021
Нейронные сети и обработка текста Samsung Research Russian Open Education
Stepik
2020
Математика и Python для анализа данных
Coursera, Яндекс, МФТИ, Машинное обучение
2020
Обучение на размеченных данных
Coursera, Яндекс, МФТИ, Машинное обучение
2020
Нейронные сети и компьютерное зрение
Coursera, Samsung Research, Deep learning
Гражданство, время в пути до работы
Гражданство: Казахстан
Разрешение на работу: Другое, Казахстан, Россия
Желательное время в пути до работы: Не имеет значения