Member of Technical Staff, Modeling
💰 $80,000 – $130,000/yr
Job Description
About Cohere
Cohere's mission is to scale intelligence to serve humanity. We're training and deploying frontier AI models for developers and enterprises building next-generation AI systems powering content generation, semantic search, retrieval-augmented generation (RAG), and autonomous agents. Our work is instrumental to the widespread adoption of AI across industries.
We are a team of world-class researchers, engineers, and designers obsessed with building exceptional products. Each team member contributes directly to increasing model capabilities and customer value. We operate with velocity and intentionality, prioritizing what's best for our customers while maintaining the highest standards of technical excellence.
The Opportunity
As a Member of Technical Staff in Modeling, you'll design and implement novel research ideas, ship state-of-the-art models to production, and maintain deep connections to the academic research community. We maintain one of the highest ratios of compute to engineers in the world, with no strict boundaries between engineering and research roles. You'll contribute to writing production-grade code and conducting cutting-edge research based on your interests and organizational needs.
This role offers access to world-class compute infrastructure, proprietary datasets, and a team of leading researchers—everything you need to do your best work.
Key Responsibilities
- Design, build, and scale AI systems for production deployment to end users
- Research, implement, and experiment with novel ideas on supercompute and data infrastructure
- Collaborate with leading researchers and stay at the forefront of AI/ML innovation
- Contribute to both foundational research and production systems
Required Qualifications
- Exceptional software engineering skills with proven ability to write clean, maintainable production code
- Strong proficiency in Python and ML frameworks including TensorFlow, TF-Serving, JAX, and XLA/MLIR
- Experience writing GPU kernels using CUDA
- Demonstrated experience with large-scale distributed training strategies
- Deep familiarity with autoregressive sequence models, particularly Transformers and related architectures
- Strong understanding of model serving, optimization, and production deployment
Bonus Qualifications
- Published research at top-tier venues (NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, COLING, ACL, EMNLP, Nature)
- Experience contributing to open-source ML projects
- Background in large language model development or deployment
Why Join Cohere?
Work with a diverse team of world-leading researchers and engineers. Access cutting-edge compute infrastructure and proprietary datasets. Operate with genuine remote flexibility—we have offices in Toronto, London, Paris, San Francisco, and New York, but with no location restrictions for this role. Contribute directly to shaping the future of AI adoption globally.
💰 Compensation not publicly listed. Market estimate for similar roles: from $80K, varying by experience and location.
Note: This position is currently not actively hiring. We encourage qualified candidates to submit resumes for future consideration. Your application will be reviewed if a suitable opening becomes available.