Remote Senior Site Reliability Engineer - AWS & Kubernetes
About the Role
We're hiring a Remote Senior Site Reliability Engineer to join our innovative team at Cloudbeds. As a key player, you'll ensure our platform's reliability and performance, supporting millions of hospitality transactions globally. This is an exciting opportunity to work with cutting-edge technologies and a fully remote team.
What You'll Do
- Design and implement reliable and scalable AWS architecture to meet the needs of the organization.
- Maintain and support highly loaded Kubernetes (EKS) clusters and infrastructure-related components.
- Support the CI/CD process with ArgoCD and GitOps.
- Automate platform deployments using Terraform infrastructure-as-code.
- Develop and continuously improve product observability and monitoring systems using Grafana, Prometheus, DataDog, and Cloudwatch.
- Participate in incident management and root cause analysis, ensuring minimal impact on services.
- Optimize system performance and troubleshoot issues as they arise.
Requirements
- 5+ years of experience as a Site Reliability Engineer or similar role.
- Strong expertise in AWS services and architecture.
- Proficient in Kubernetes, especially EKS.
- Experience with CI/CD tools like ArgoCD and GitOps.
- Familiarity with infrastructure-as-code using Terraform.
- Strong analytical and troubleshooting skills.
- Excellent communication and collaboration skills.
Nice to Have
- Experience with observability tools like Grafana and DataDog.
- Knowledge of incident management processes.
- Familiarity with Agile methodologies.
What We Offer
- Competitive salary ranging from $140,000 to $180,000 annually.
- Fully remote work environment with flexible hours.
- Opportunity to work with a diverse and talented team.
- Access to cutting-edge technology and tools.
- Professional development opportunities and learning budget.
- Health and wellness benefits.
This Remote Senior Site Reliability Engineer position at Cloudbeds offers a competitive salary, the chance to work with cutting-edge technologies, and a fully remote work environment.
Who Will Succeed Here
Expertise in AWS architecture design and implementation, specifically leveraging services like EC2, S3, and RDS to ensure high availability and scalability of applications.
Strong experience with Kubernetes orchestration and management, including proficiency in deploying and managing containerized applications using Helm charts and understanding of service mesh technologies.
Proactive mindset with a focus on continuous improvement and automation, demonstrated by hands-on experience with GitOps practices using ArgoCD and infrastructure as code with Terraform.
Learning Resources
Career Path
Market Overview
Skills & Requirements
Domain Trends
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months