About the Role

We are seeking a Senior Machine Learning Engineer to join our pioneering AI Infrastructure & Cloud Services firm, dedicated to dismantling the barriers of large-scale AI innovation. This role is essential for architecting and operating the core systems that power massive-scale distributed training and inference. As a Senior Machine Learning Engineer, you will play a crucial role in creating seamless, resilient, and secure environments for the world’s builders.

What You'll Do

  • Architect Infrastructure: Design and optimize ML systems that support massive distributed training and high-concurrency inference workloads.
  • Orchestration & Reliability: Build repeatable execution patterns across shared, high-density compute environments.
  • Performance Engineering: Troubleshoot and resolve complex scalability and reliability bottlenecks.
  • Cross-Functional Partnership: Collaborate with Systems and Platform teams to bridge the gap between hardware capabilities and developer experience.
  • Innovate: Solve problems that haven't been solved yet, pushing the boundaries of AI development.

Requirements

  • Proven experience managing ML workloads across large-scale clusters.
  • Proficiency in orchestrating GPU-accelerated environments (Kubernetes, SLURM).
  • Deep understanding of the Linux networking stack, drivers, and low-level performance tuning.
  • Experience solving problems that emerge when moving from "handfuls of devices" to massive, warehouse-scale compute.
  • Strong commitment to scaling a promising company with a builder mindset.

Nice to Have

  • Experience with Infrastructure as Code.
  • Knowledge of cloud services and deployment strategies.
  • Familiarity with machine learning frameworks and libraries.

What We Offer

  • Competitive industry salary with RSUs.
  • 100% paid insurance plans (medical, dental, vision).
  • PTO and Paid Holidays.
  • 401(k) with company match.
  • Paternity/Maternity Leave.
  • FSA and Life Insurance.
  • Mental Health Support.
  • Comprehensive relocation packages to help you move and settle in your new role.

This is not a maintenance job; you will be pushed daily to innovate and build the infrastructure that will define the next decade of AI development. If you are interested in relocating to the United States, check out our comprehensive Relocation Jobs in the United States page for detailed relocation packages and benefits.

Language Requirements
EnglishC1
BasicIntermediateAdvancedNative
Why This Job8.5 of 10

This role offers a unique opportunity to innovate in AI infrastructure at a leading firm, with competitive compensation and relocation support.

Salary Range
Required
0/1
Optional
0/1
Bonus
0/1

Generating success profile...

Analyzing job requirements and market data

Loading market overview...

Analyzing market trends and skill demands

Industry News

Loading latest industry news...

Finding relevant articles from the last 6 months

All job postings are automatically gathered by algorithms. We do not review or verify listings, be careful when applying and do not sign-in with iCloud or Google services.