Boson AI is an early-stage startup building large language tools for interaction
and entertainment. Our founders, Alex Smola [https://alex.smola.org/], Mu Li
[https://github.com/mli], and a team of Deep Learning, Optimization, NLP, AutoML
and Statistics scientists and engineers are working on high quality generative
AI models for language and beyond.
We are seeking research scientists and engineers to join our team full-time in
our Santa Clara office. As part of your role, you will work on modeling and
training LLMs, understanding and interpreting model behavior and aligning models
to human values. The ideal candidate will possess a strong background in machine
learning, and have motivations for developing state-of-the-art models towards
AGI.
\n
Responsibilities
- Design and verify novel model architectures and training objectives.
- Investigate novel model alignment algorithms.
- Write efficient and clean code for ML training.
- Conduct large-scale experiments to verify the modeling choices and identify
improvement areas.
Experience
- Summarize results and clearly communicate the motivations and observations in
your work
- Proficiency in at least one deep learning framework, such as PyTorch.
- Participation in at least one research project related to LLM or multimodal
models, e.g. experience in training or fine-tuning them.
- Experience in alignment research
- Experience in large-scale distributed model training
- Experience in writing GPU kernels in CUDA
Qualifications
- PhD or Master's degree with solid scientific contributions
- Active GitHub repository
- Active scientific track record
- Excellent problem-solving skills
\n
$150,000 - $600,000 a year
Total compensations includes base pay, equity, and benefits. We have a 401k
plan, HSA, FSA, free food (even dried mangoes).
\n