We are looking for an expert in Speech. Our vacancy is for those who have strong expertise in different areas of Speech and also want to expand their knowledge in computer vision and neural networks for image synthesis.

At Rask.ai, you will work on cutting-edge machine learning models to solve the challenges of creating ultra-realistic AI voiceovers.

Your role would be designing machine learning systems for Speaker Diarization, Transcription, and other Speech processing tasks. You will research and implement appropriate ML algorithms and tools; read, understand, and reproduce papers from recent ML conferences.

Requirements

Proven experience as a Machine Learning Engineer in Speech processing or similar role (3+)
Strong prototyping skills: you are able to work in fast iterations and create solutions based on open-source code
Strong software engineering skills
Strong model implementation, training, and debugging skills in PyTorch
Strong expertise in Speech processing; expertise in speaker recognition/ speaker diarization is a plus
ClearML / Wandb / Neptune / MLFlow.
You have deep knowledge of the latest state-of-the-art research, techniques, and innovations in machine learning, you feel confident in implementing papers from scratch

Nice to have but not a deal breaker

Publications at top ML conferences.
Strong results in ML competitions (e.g., gold/silver medals on Kaggle)
Experience with distributed computing.
Experience in deep learning model deployment (Triton) and optimization.
Experience in collecting and processing big amounts of data and creating large datasets.

Ready to apply for this role?

Apply Now →

Senior Deep Learning Engineer in Speech R&D

Requirements

Nice to have but not a deal breaker

Related jobs

Senior Machine Learning Engineer (f/m/d)

(Senior) Backend Engineer, Platform

Account Executive

Partnerships Manager