AI SCORE 9.0 / 10
Remote Principal Software Engineer - AI Inference Systems
$272K–$431K/year
About the Role
We are seeking a Remote Principal Software Engineer to join our innovative NVIDIA Dynamo team, focused on building scalable AI inference systems. This role offers an exciting opportunity to work on cutting-edge technology in distributed GPU environments, enhancing the performance of large language models (LLMs) and reasoning systems.
What You'll Do
- Collaborate on the design and development of the Dynamo Kubernetes stack, enhancing the deployment of AI inference systems.
- Introduce new features to the Dynamo Python SDK and Rust Runtime Core Library, ensuring robust performance.
- Design, implement, and optimize distributed inference components using Rust and Python, contributing to high-performance AI workloads.
- Architect and optimize the separation of prefill and decode phases across GPU clusters to improve throughput.
- Develop dynamic GPU scheduling algorithms to ensure efficient resource allocation based on workload demands.
- Enhance intelligent routing systems to minimize latency and improve request handling for complex reasoning tasks.
- Contribute to open-source repositories, participate in code reviews, and assist with issue triage on GitHub.
- Write clear documentation and contribute to user and developer guides, ensuring a smooth experience for the community.
Requirements
- BS/MS or higher in computer engineering, computer science, or a related field (or equivalent experience).
- 15+ years of proven experience in software engineering, particularly in systems programming.
- Strong proficiency in Rust and/or C++, with experience in Python for workflow and API development.
- Experience with Kubernetes and cloud-native deployment practices.
- Deep understanding of distributed systems, parallel computing, and GPU architectures.
- Familiarity with large-scale inference serving and high-performance AI workloads.
- Excellent problem-solving and communication skills.
- Prior contributions to open-source AI frameworks are a plus.
Nice to Have
- Experience with GPU resource scheduling and cache management.
- Understanding of LLM-specific inference challenges, such as context window scaling.
What We Offer
- Highly competitive salaries with a base salary range of $272,000 - $431,250.
- Eligibility for equity and a comprehensive benefits package.
- Opportunities to work with some of the most forward-thinking and hardworking people in the technology industry.
- A diverse work environment that values inclusion and equal opportunity.
- Remote work flexibility and a chance to contribute to groundbreaking AI technology.
Language Requirements
EnglishC1
BasicIntermediateAdvancedNative
Why This Job9.0 of 10
This Remote Principal Software Engineer position at NVIDIA offers a unique opportunity to work on cutting-edge AI inference systems with a competitive salary and equity options.
Salary Range
Required
0/1
Optional
0/1
Bonus
0/1
Generating success profile...
Analyzing job requirements and market data
Loading market overview...
Analyzing market trends and skill demands
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months
All job postings are automatically gathered by algorithms. We do not review or verify listings, be careful when applying and do not sign-in with iCloud or Google services.