Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs.
Our novel wafer-scale architecture provides the AI compute power of dozens of
GPUs on a single chip, with the programming simplicity of a single device. This
approach allows Cerebras to deliver industry-leading training and inference
speeds and empowers machine learning users to effortlessly run large-scale ML
applications, without the hassle of managing hundreds of GPUs or TPUs.
Cerebras' current customers include global corporations across multiple
industries, national labs, and top-tier healthcare systems. In January, we
announced a multi-year, multi-million-dollar partnership with Mayo Clinic,
underscoring our commitment to transforming AI applications across various
fields. In August, we launched Cerebras Inference, the fastest Generative AI
inference solution in the world, over 10 times faster than GPU-based hyperscale
cloud inference services.
ABOUT THE ROLE
As a Network Engineer on the Cluster Architecture Team, you will work closely
with the vendors, internal networking teams and industry peers to develop
best-in-class interconnect architecture of the current and future generations of
the Cerebras AI clusters. You will be responsible for developing
proof-of-concept of new network designs and features enabling resilient and
reliable network for AI workloads. The role will require cross-functional
collaboration and interaction with diverse hardware components (e.g., network
devices and the Wafer-Scale Engine) as well as software at several layers of the
stack, from host-side networking to cluster-level coordination. The role also
requires understanding of network monitoring systems and network debugging
methodologies.
RESPONSIBILITIES
- Design AI/ML and HPC Clusters with a focus on the network technology.
- Identify and address performance or efficiency bottlenecks, ensuring high
resource utilization, low latency, and high throughput communication.
- Stay current on emerging networking technologies: evaluate new hardware,
fabrics, and protocols to improve cluster performance, scalability, and cost
efficiency.
- Drive technical projects involving multiple teams, various software and
hardware components coming together to realize advanced networking
technologies.
- Bring effective communication skills.
- Collaborate with vendors and industry peers to drive network hardware and
feature roadmap.
- Pre-deployment readiness & port mapping: build/validate rack/row and
patch-panel port maps, cabling plans, if required in rare cases.
- Bring-up & rare deployment debugging: assist with lab/staging validation,
packet captures, link level diagnostics, and synthetic traffic tests.
SKILLS & QUALIFICATIONS
- Ph.D. in Computer Science or Electrical Engineering + 5 years industry
experience or Master’s in CS or EE + 8 years industry experience.
- 3+ Years of experience in large scale network designs in WAN or Datacenter.
- Extensive experience debugging networking issues in large distributed systems
environment with multiple networking platforms and protocols.
- Experience of managing and leading multi-phase and multi-team projects.
- Networking platforms like Juniper, Arista, Cisco, open-box architectures
(SONiC, FBOSS).
- Networking protocols like RoCE, BGP, DCQCN, PFC, streaming telemetry.
- Familiarity with automation languages like Python or Go.
- Familiarity with network visibility and management systems.
WHY JOIN CEREBRAS
People who are serious about software make their own hardware. At Cerebras we
have built a breakthrough architecture that is unlocking new opportunities for
the AI industry. With dozens of model releases and rapid growth, we’ve reached
an inflection point in our business. Members of our team tell us there are five
main reasons they joined Cerebras:
- Build a breakthrough AI platform beyond the constraints of the GPU.
- Publish and open source their cutting-edge AI research.
- Work on one of the fastest AI supercomputers in the world.
- Enjoy job stability with startup vitality.
- Our simple, non-corporate work culture that respects individual beliefs.
Read our blog: Five Reasons to Join Cerebras in 2025.
[https://www.cerebras.net/blog/5-reasons-to-join-cerebras]
APPLY TODAY AND BECOME PART OF THE FOREFRONT OF GROUNDBREAKING ADVANCEMENTS IN
AI!
Cerebras Systems is committed to creating an equal and diverse environment and
is proud to be an equal opportunity employer. We celebrate different
backgrounds, perspectives, and skills. We believe inclusive teams build better
products and companies. We try every day to build a work environment that
empowers people to do their best work through continuous learning, growth and
support of those around them.
This website or its third-party tools process personal data. For more details,
click here [https://www.cerebras.net/privacy/] to review our CCPA disclosure
notice.