Staff Engineer, ML Models

Tenstorrent • Toronto • 1w ago

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.

As a Staff Machine Learning Engineer on the AI Models team at Tenstorrent, you’ll take the lead in bringing up and optimizing cutting edge AI models to run on our custom AI devices. You’ll experiment, optimize, and push boundaries while solving real world problems. If you love the craft of ML and want to work on models that are used in real world applications, you’ll feel right at home.

This role is hybrid, based out of Toronto, ON; Austin, TX; Santa Clara, CA, with the opportunity to be remote on a candidate-by-candidate basis.

Who You Are

Confident with Python programming and hands-on experience with PyTorch for developing deep learning models.
Driven by curiosity and a desire to experiment, always seeking to understand how complex systems work and how to make them better.
Possess a deep understanding of ML model architectures, with the ability to optimize both individual components and overall model performance.
Comfortable leading a small group of engineers and working closely with cross-functional teams.

What We Need

Hands-on experience bringing up state-of-the-art ML models on new hardware platforms.
Strong debugging instincts to investigate performance issues, tune architectures, and boost model accuracy and robustness.
Working knowledge of model optimization techniques, like quantization, flash attention, kernel fusing, and multi-device parallelization.
A curiosity-driven mindset that stays current with ML research and brings practical insights to real-world engineering challenges.

What You Will Learn

How to get real ML models to fly on a custom AI accelerator
Ways to optimize ML model performance, from application to silicon level.
What it takes to go from research paper to production ready ML deployment
How to work alongside compiler, kernel, and hardware teams to drive new features, performance optimizations, and fixes.

Compensation for all engineers at Tenstorrent ranges from $100k - $500k including base and variable compensation targets. Experience, skills, education, background and location all impact the actual offer made.

Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.

This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology. Due to U.S. export laws, including those codified in the U.S. Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2). These requirements apply to persons located in the U.S. and all countries outside the U.S. As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S. Commerce Department or applicable federal agency. If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.