Rask AI logo
Rask AI

Senior Deep Learning Engineer in Speech R&D

RemoteFull-timeSeniorWorldDevelopment

We are looking for an expert in Speech. Our vacancy is for those who have strong expertise in different areas of Speech and also want to expand their knowledge in computer vision and neural networks for image synthesis.

At Rask.ai, you will work on cutting-edge machine learning models to solve the challenges of creating ultra-realistic AI voiceovers.

Your role would be designing machine learning systems for Speaker Diarization, Transcription, and other Speech processing tasks. You will research and implement appropriate ML algorithms and tools; read, understand, and reproduce papers from recent ML conferences.

Requirements

  • Proven experience as a Machine Learning Engineer in Speech processing or similar role (3+)
  • Strong prototyping skills: you are able to work in fast iterations and create solutions based on open-source code
  • Strong software engineering skills
  • Strong model implementation, training, and debugging skills in PyTorch
  • Strong expertise in Speech processing; expertise in speaker recognition/ speaker diarization is a plus
  • ClearML / Wandb / Neptune / MLFlow.
  • You have deep knowledge of the latest state-of-the-art research, techniques, and innovations in machine learning, you feel confident in implementing papers from scratch

Nice to have but not a deal breaker

  • Publications at top ML conferences.
  • Strong results in ML competitions (e.g., gold/silver medals on Kaggle)
  • Experience with distributed computing.
  • Experience in deep learning model deployment (Triton) and optimization.
  • Experience in collecting and processing big amounts of data and creating large datasets.

Ready to apply for this role?

Apply Now →

Related jobs

Apply Now →