Member of Technical Staff, Modeling

cohere·April 11, 2026·0 views

🌍 Remote · WorldwideFull-time

💰 $80,000 – $130,000/yr

Python CUDA JAX Transformers TensorFlow Distributed Training Machine Learning AI Research

Job Description

About Cohere

Cohere's mission is to scale intelligence to serve humanity. We're training and deploying frontier AI models for developers and enterprises building next-generation AI systems powering content generation, semantic search, retrieval-augmented generation (RAG), and autonomous agents. Our work is instrumental to the widespread adoption of AI across industries.

We are a team of world-class researchers, engineers, and designers obsessed with building exceptional products. Each team member contributes directly to increasing model capabilities and customer value. We operate with velocity and intentionality, prioritizing what's best for our customers while maintaining the highest standards of technical excellence.

The Opportunity

As a Member of Technical Staff in Modeling, you'll design and implement novel research ideas, ship state-of-the-art models to production, and maintain deep connections to the academic research community. We maintain one of the highest ratios of compute to engineers in the world, with no strict boundaries between engineering and research roles. You'll contribute to writing production-grade code and conducting cutting-edge research based on your interests and organizational needs.

This role offers access to world-class compute infrastructure, proprietary datasets, and a team of leading researchers—everything you need to do your best work.

Key Responsibilities

Design, build, and scale AI systems for production deployment to end users
Research, implement, and experiment with novel ideas on supercompute and data infrastructure
Collaborate with leading researchers and stay at the forefront of AI/ML innovation
Contribute to both foundational research and production systems

Required Qualifications

Exceptional software engineering skills with proven ability to write clean, maintainable production code
Strong proficiency in Python and ML frameworks including TensorFlow, TF-Serving, JAX, and XLA/MLIR
Experience writing GPU kernels using CUDA
Demonstrated experience with large-scale distributed training strategies
Deep familiarity with autoregressive sequence models, particularly Transformers and related architectures
Strong understanding of model serving, optimization, and production deployment

Bonus Qualifications

Published research at top-tier venues (NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, COLING, ACL, EMNLP, Nature)
Experience contributing to open-source ML projects
Background in large language model development or deployment

Why Join Cohere?

Work with a diverse team of world-leading researchers and engineers. Access cutting-edge compute infrastructure and proprietary datasets. Operate with genuine remote flexibility—we have offices in Toronto, London, Paris, San Francisco, and New York, but with no location restrictions for this role. Contribute directly to shaping the future of AI adoption globally.

💰 Compensation not publicly listed. Market estimate for similar roles: from $80K, varying by experience and location.

Note: This position is currently not actively hiring. We encourage qualified candidates to submit resumes for future consideration. Your application will be reviewed if a suitable opening becomes available.