Remote Software Engineer - Infrastructure Supercomputing
About the Role
We are seeking a Remote Software Engineer - Infrastructure Supercomputing to join our innovative team at xAI. This role is crucial for operating some of the world’s largest GPU supercomputing clusters, focusing on both AI training and serving production models. As a Remote Software Engineer, you will be part of a small, highly motivated team that thrives on curiosity and engineering excellence.
What You'll Do
- Design and implement scalable, highly available containerized applications using Rust.
- Manage compute fleets with Pulumi, Terraform, Ansible, or other stateful automation libraries.
- Enhance deployment pipelines and ensure robust, secure service delivery across our production environments.
- Work with both on-premise clusters and cloud providers to optimize performance and security.
- Implement Infrastructure as Code (IaC) best practices to streamline operations.
- Collaborate with internal researchers to establish security best practices for live external traffic.
Requirements
- 3+ years of experience in software engineering, specifically in infrastructure and supercomputing.
- Strong proficiency in Rust and experience with containerization technologies.
- Experience managing compute fleets using automation tools like Pulumi or Terraform.
- Excellent communication skills to share knowledge effectively with teammates.
- Ability to work independently and take initiative in a flat organizational structure.
Nice to Have
- Familiarity with Kubernetes, Flux, and ArgoCD.
- Experience in AI or machine learning environments.
- Knowledge of security best practices in cloud environments.
What We Offer
- Competitive salary ranging from $100,000 to $150,000 per year.
- Fully remote work environment with flexible hours.
- Opportunities for professional growth and development.
- Collaborative and innovative team culture.
- Engagement in meaningful projects that impact the future of AI.
This Remote Software Engineer position at xAI offers a unique opportunity to work on cutting-edge supercomputing infrastructure in a fully remote setting.
About xAI
Explore xAI careers in 2026 and discover exciting job openings in remote, hybrid, and office roles. Our platform offers tailored application tracking, insightful company information, and advanced filters to help you find the perfect position at xAI. Stay updated with the latest industry news and vacancy scores to enhance your job search experience and unlock your potential in the innovative world of artificial intelligence.
Who Will Succeed Here
Proficiency in Rust for developing high-performance systems, with experience in asynchronous programming and memory management to optimize supercomputing tasks.
Strong familiarity with Kubernetes and Terraform for managing containerized applications and infrastructure as code, enabling efficient deployment and scaling of GPU clusters in a cloud environment.
A proactive mindset with a passion for continuous learning and improvement, able to adapt to evolving technologies and methodologies in a fully remote work setting, ensuring effective collaboration with a distributed team.
Learning Resources
Career Path
Market Overview
Skills & Requirements
Domain Trends
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months