Senior Cloud Engineer (Multicloud)
💰 $80,000 – $130,000/yr
Job Description
About the Role
Pragmatike is hiring on behalf of a fast-growing AI startup recognized as a Top 10 GenAI company by GTM Capital, founded by MIT CSAIL researchers. This is an exceptional opportunity to join a world-class team building infrastructure for the next generation of AI systems.
We are searching for a Senior Cloud Engineer (Multicloud) with deep, hands-on experience building, operating, and scaling production infrastructure across AWS, GCP, and Azure. You will work directly on the cloud and platform layer supporting large-scale, distributed AI systems used by Fortune 500 customers.
This role is ideal for an engineer who has operated real multicloud environments in production—not someone limited to a single provider. You will be responsible for building reliable, scalable systems while navigating the complexity of differing cloud primitives, networking models, and operational trade-offs. Your expertise will directly impact the performance and reliability of mission-critical AI infrastructure.
What You'll Do
- Build, deploy, and operate production infrastructure across AWS, GCP, and Azure
- Maintain consistent environments using Infrastructure as Code (Terraform preferred)
- Deploy and operate Kubernetes clusters and containerized workloads across multiple cloud providers
- Design and manage cloud networking (VPC/VNet design, peering, load balancing, private connectivity)
- Implement monitoring, logging, alerting, and incident response for multicloud systems
- Optimize performance, reliability, and cost across providers through autoscaling and capacity planning
- Support AI training and inference workloads in multicloud environments
- Troubleshoot complex production issues spanning compute, networking, storage, and Kubernetes layers
- Collaborate closely with AI, backend, and platform teams to support production systems
What We're Looking For
- 5+ years of experience as a Cloud, Platform, or Infrastructure Engineer
- Hands-on production experience with AWS, GCP, and Azure (deep expertise in at least one)
- Strong experience running Kubernetes in production across multiple clouds
- Terraform expertise managing multicloud infrastructure at scale
- Solid understanding of cloud networking differences and security models across providers
- Experience operating distributed systems with on-call ownership and incident response
- Proficiency with containerization, CI/CD pipelines, and modern DevOps practices
- English (required)
Location & Start
Primary Location: Cambridge, MA (Eastern Time / UTC -4). Relocation package available or remote option for out-of-state applicants. Start Date: ASAP
💰 Compensation not publicly listed. Market estimate for similar roles: from $80K, varying by experience and location.