Sr. ML Engineer, AI Cloud

Tenstorrent • Toronto • 3w ago

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.

We're looking for a Senior Machine Learning Engineer to join the ML Applications Team in the Tenstorrent Cloud. Primary focus will be on developing realistic demonstrations of AI applications, and APIs for them, for the Tenstorrent Cloud. Experience spanning into platform infrastructure and Kubernetes is also highly valuable.

This role is hybrid OR remote, based out of Santa Clara, CA, Austin, TX, Toronto, ON.

We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.

Who You Are

Fluent in Python; experience with Pytorch and HuggingFace models.
Have used and developed APIs for AI/ML applications.
Experienced with AI serving frameworks such as vllm, llm-d, LlamaIndex, torchserve, etc.
Experienced in performance benchmarking, profiling, and real-time system optimization.
Experienced with Containers, Docker, Git, CI/CD, Agile.

What We Need

Hands-on design, development, and support of realistic demonstrations of AI applications running on Tenstorrent hardware in our cloud.
Your help in anticipating, clarifying, and de-risking technological requirements for such demonstrations.
Your help encouraging development and design choices in sustainable directions that will allow our capabilities to compound.
Your support enabling customer access to trying our latest hardware and software.

What You Will Learn

Vertical integration of AI applications on Tenstorrent software and hardware.
Deployment of containerized applications and APIs with Kubernetes.
Leverage the latest AI tools in your day-to-day work.
Collaborate closely with a diverse range of experts, from HW, SW, AI/ML, and ops backgrounds.

Compensation for all engineers at Tenstorrent ranges from $100k - $500k including base and variable compensation targets. Experience, skills, education, background and location all impact the actual offer made.

Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.

This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology. Due to U.S. export laws, including those codified in the U.S. Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2). These requirements apply to persons located in the U.S. and all countries outside the U.S. As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S. Commerce Department or applicable federal agency. If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.