Senior Researcher – Speech AI

Huawei Technologies Canada Co., Ltd. • Markham, York Region • 8h ago

Huawei Canada has an immediate permanent opening for a Senior Researcher.

About the team: The Human-Machine Interaction Lab unites global talents to redefine the relationship between humans and technology. Focused on innovation and user-centered design, the lab strives to advance human-computer interaction research. Our team includes researchers, engineers, and designers collaborating across disciplines to develop novel interactive systems, sensing technologies, wearable and IoT systems, human factors, computer vision, and multimodal interfaces. Through high-impact products and cutting-edge research, we aim to enhance user experiences and interactions with technology.

About the job:

Design and implement speech foundation models and speech language models (SLMs) for a variety of applications
Develop algorithms for speech and acoustic signal processing, including ASR, speech enhancement, beamforming, and acoustic event detection
Conduct original research and prototyping using deep learning, transformers, RNNs, and other modern machine learning techniques
Collaborate closely with cross-functional teams to bring speech-related solutions from concept to integration into real-world systems
Evaluate and benchmark algorithms using both quantitative metrics (e.g., WER, PESQ, STOI) and qualitative assessments
Develop robust infrastructure and pipelines to support rapid experimentation and deployment of research ideas
Stay at the forefront of developments in speech technology, foundation models, and conversational AI, and incorporate state-of-the-art methods into research and product development

About the ideal candidate:

Ph.D. in Computer Science, Electrical Engineering, or a related field with a strong focus on speech processing and machine learning
Demonstrated experience with LLMs, ASR systems, and modern speech models (e.g., Whisper, wav2vec, HuBERT, Conformer)
Proficiency in Python and experience with at least one additional programming language (e.g., C++, Java, JavaScript)
Expertise in deep learning frameworks such as PyTorch, TensorFlow, or Keras
Familiarity with common audio/speech processing libraries (e.g., TorchAudio, Librosa, PyAudio)
Strong grasp of digital and statistical signal processing, including spectral and spatial filtering
Experience working with large-scale, noisy datasets in real-world environments
A solid publication record in relevant venues (e.g., NeurIPS, ICML, ICLR, Interspeech, ICASSP, AES)