Huawei Canada has an immediate permanent opening for a Distinguished System
Reliability Architect.
About the team:
Initially founded in 1991 as Huawei's ASIC Design Center, the IC Lab is a
leading global fabless semiconductor lab. This lab delivers trusted,
cutting-edge semiconductor products and services for smart devices, contributing
to smart home and mobility solutions. The local team in Canada specializes in
semiconductors, and chipset solutions.
About the job:
-
System Chip Reliability Management and Control: Closely cooperate with chip
development and deeply participate in chip reliability design based on
application scenario requirements to ensure system reliability from the
beginning. Participate in the chip reliability test plan. Analyze chip
failure cases, identify potential design defects or process problems, and
promote improvement. Establish a chip reliability warning mechanism to detect
and resolve chip risks.
-
Network Reliability Management and Control: Design and optimize network
architectures to maintain stability under high traffic. Develop redundancy
strategies, monitor performance, and lead fault troubleshooting.
-
Hardware Reliability Management and Control: Oversee hardware reliability
across the product lifecycle, from component selection to post-market
analysis. Implement rigorous testing and maintain a fault database.
-
System Engineering Reliability Management and Control: Develop reliability
strategies considering system architecture, software-hardware collaboration,
and interface compatibility. Use FTA and FMEA for risk analysis.
-
System Reliability Problem Definition and Analysis: Rapidly diagnose system
failures, coordinate cross-functional teams to resolve issues, and maintain a
reliability knowledge base for future improvements.
About the ideal candidate:
-
Master’s or Ph.D. in Electronic Engineering, Computer Science, or Reliability
Engineering.
-
15+ years in system reliability, with expertise in chip, network, and
hardware reliability management.
-
Strong knowledge of chip reliability testing, network architecture
optimization, and hardware fault analysis.
-
Proficiency in reliability analysis methods (FTA, FMEA) and related software
tools.
-
Excellent communication skills, with experience presenting to executives and
global R&D teams.