Tenstorrent is leading the industry on cutting-edge AI technology,
revolutionizing performance expectations, ease of use, and cost efficiency. With
AI redefining the computing paradigm, solutions must evolve to unify innovations
in software models, compilers, platforms, networking, and semiconductors. Our
diverse team of technologists have developed a high performance RISC-V CPU from
scratch, and share a passion for AI and a deep desire to build the best AI
platform possible. We value collaboration, curiosity, and a commitment to
solving hard problems. We are growing our team and looking for contributors of
all seniorities.
We're looking for a Senior Machine Learning Engineer to join the ML Applications
Team in the Tenstorrent Cloud. Primary focus will be on developing realistic
demonstrations of AI applications, and APIs for them, for the Tenstorrent Cloud.
Experience spanning into platform infrastructure and Kubernetes is also highly
valuable.
This role is hybrid OR remote, based out of Santa Clara, CA, Austin, TX,
Toronto, ON.
We welcome candidates at various experience levels for this role. During the
interview process, candidates will be assessed for the appropriate level, and
offers will align with that level, which may differ from the one in this
posting.
Who You Are
- Fluent in Python; experience with Pytorch and HuggingFace models.
- Have used and developed APIs for AI/ML applications.
- Experienced with AI serving frameworks such as vllm, llm-d, LlamaIndex,
torchserve, etc.
- Experienced in performance benchmarking, profiling, and real-time system
optimization.
- Experienced with Containers, Docker, Git, CI/CD, Agile.
What We Need
- Hands-on design, development, and support of realistic demonstrations of AI
applications running on Tenstorrent hardware in our cloud.
- Your help in anticipating, clarifying, and de-risking technological
requirements for such demonstrations.
- Your help encouraging development and design choices in sustainable
directions that will allow our capabilities to compound.
- Your support enabling customer access to trying our latest hardware and
software.
What You Will Learn
- Vertical integration of AI applications on Tenstorrent software and hardware.
- Deployment of containerized applications and APIs with Kubernetes.
- Leverage the latest AI tools in your day-to-day work.
- Collaborate closely with a diverse range of experts, from HW, SW, AI/ML, and
ops backgrounds.
Compensation for all engineers at Tenstorrent ranges from $100k - $500k
including base and variable compensation targets. Experience, skills, education,
background and location all impact the actual offer made.
Tenstorrent offers a highly competitive compensation package and benefits, and
we are an equal opportunity employer.
This offer of employment is contingent upon the applicant being eligible to
access U.S. export-controlled technology. Due to U.S. export laws, including
those codified in the U.S. Export Administration Regulations (EAR), the Company
is required to ensure compliance with these laws when transferring technology to
nationals of certain countries (such as EAR Country Groups D:1, E1, and E2).
These requirements apply to persons located in the U.S. and all countries
outside the U.S. As the position offered will have direct and/or indirect
access to information, systems, or technologies subject to these laws, the offer
may be contingent upon your citizenship/permanent residency status or ability to
obtain prior license approval from the U.S. Commerce Department or applicable
federal agency. If employment is not possible due to U.S. export laws, any
offer of employment will be rescinded.