Remote Staff Software Engineer - Observability
About the Role
We're hiring a Remote Staff Software Engineer - Observability to join our dynamic team at Anthropic. As a member of the Observability team, you'll play a crucial role in enhancing the reliability and operational excellence of our AI systems. Your contributions will directly impact how our engineers and researchers monitor and manage complex infrastructures.
What You'll Do
- Design and implement high-throughput ingest pipelines for operational data.
- Develop cost-efficient columnar storage solutions to manage large volumes of telemetry data.
- Create unified query layers across various signals to facilitate actionable insights.
- Build agentic diagnostic tools that enable engineers to detect and resolve issues rapidly.
- Collaborate with cross-functional teams to enhance monitoring and telemetry infrastructure.
Requirements
- 5+ years of experience as a Software Engineer, with a focus on observability and monitoring systems.
- Strong proficiency in programming languages such as Python, Go, or Java.
- Experience with distributed systems and cloud infrastructure.
- Familiarity with metrics, logging, and tracing technologies.
- Ability to work in a fast-paced, collaborative environment.
Nice to Have
- Experience with AI/ML systems and their observability challenges.
- Knowledge of GPU, TPU, and Trainium architectures.
- Familiarity with modern data storage solutions and query languages.
What We Offer
- Competitive salary ranging from $140,000 to $180,000 per year.
- Fully remote work environment with flexible hours.
- Opportunities for professional growth and development.
- Collaborative and innovative team culture.
- Health and wellness benefits.
This role offers an exciting opportunity to work at the forefront of AI observability, with a focus on enhancing operational excellence and reliability. Anthropic is committed to building beneficial AI systems.
About Anthropic
Explore Anthropic careers in 2026 and discover exciting job opportunities across remote, hybrid, and office roles. Our platform offers advanced filters to refine your search, application tracking to streamline your process, and valuable company insights to help you succeed. Stay ahead in your job search for Anthropic positions and unlock your potential in the innovative tech landscape.
Who Will Succeed Here
Proficient in developing and optimizing telemetry solutions using Python and Go, with a strong understanding of distributed systems architecture, enabling the engineer to enhance observability across complex cloud infrastructures.
Self-motivated and disciplined, with a focus on delivering high-quality software in a remote work environment, demonstrating strong time management skills to balance multiple tasks and projects effectively.
A mindset geared towards continuous improvement and innovation, with at least 5+ years of experience in software engineering, particularly in observability and monitoring systems, fostering a proactive approach to identifying and resolving system inefficiencies.
Learning Resources
Career Path
Market Overview
Skills & Requirements
Domain Trends
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months