Senior Deep Learning Compiler Engineer

Huawei Technologies Canada Co., Ltd. • Markham • 1m ago

Huawei Canada has an immediate permanent opening for a Senior Deep Learning Compiler Engineer.

About the team:

Initially founded in 1991 as Huawei's ASIC Design Center, the IC Lab is a leading global fabless semiconductor lab. This lab delivers trusted, cutting-edge semiconductor products and services for smart devices, contributing to smart home and mobility solutions. The local team in Canada specializes in semiconductors, and chipset solutions.

About the job:

Lead research efforts to develop state-of-the-art algorithms that significantly improve deep learning model inference times, memory usage, and overall performance. Focus on ground breaking techniques that push the boundaries of compiler optimization, and drive innovations that directly impact the effectiveness of the system in real-world wireless applications.
Drive initiatives that accelerate the development workflow within the team. You will identify bottlenecks, streamline processes, and introduce best practices to ensure that the team remains agile, efficient, and able to deliver high-quality results quickly. You’ll play a key role in defining engineering practices that enhance both individual and team output.
Provide mentorship to junior engineers and foster a culture of continuous learning. You’ll guide their professional development, help refine their skills, and ensure the team remains technically strong and aligned with best practices. Additionally, you’ll review designs and code to ensure high standards are maintained while providing constructive feedback to elevate the team's overall capabilities.
Lead the design and development of optimized code in C++ and Python for deep learning inference systems. You will take ownership of performance-critical components and help define appropriate high-level abstractions that balance clarity with efficiency.
Work closely with other engineering teams to ensure that optimizations and new algorithms are implemented efficiently across the entire system. Your ability to clearly communicate complex ideas and collaborate with cross-functional teams will be crucial in driving system-level changes that result in tangible improvements.
Lead efforts to integrate and optimize deep learning systems for custom hardware accelerators, such as GPUs, TPUs, or other AI accelerators. Your deep understanding of hardware-software interactions will drive major performance improvements in real-time applications, ensuring that the system runs at peak efficiency.
Work closely with product managers and other stakeholders to help define the technical roadmap. You will take an active role in determining the direction of the technology stack and prioritize initiatives based on business needs and technological advancements.

About the ideal candidate:

5+ years of experience in software engineering or a related domain, with a proven track record of developing complex systems and optimizing deep learning pipelines. A background in compiler design or machine learning systems is an asset.
Deep knowledge of C++ and Python, with experience writing optimized, high-performance code. Strong understanding of algorithms, data structures, and performance profiling.
Passion for researching new algorithms that improve deep learning systems. You stay up-to-date with the latest research in compiler optimization, deep learning, and hardware acceleration, and you’re eager to apply new findings to solve practical challenges.
You are an experienced mentor who enjoys guiding junior engineers, sharing knowledge, and helping others grow technically. Experience reviewing code, providing feedback, and driving technical discussions to ensure high-quality engineering standards.
Excellent problem solver, with the ability to navigate complex technical challenges independently. You take initiative, and you excel at driving projects forward with minimal supervision.
Ability to articulate complex technical concepts clearly to both technical and non-technical stakeholders is critical. You thrive in a collaborative environment and can effectively balance individual contributions with team goals.
Experience with MLIR (Multi-Level Intermediate Representation) and other specialized compiler frameworks is an asset, as is a background in optimizing deep learning models for edge devices and real-time applications.