Remote Senior Site Reliability Engineer - Cloud Solutions
About the Role
We're hiring a Remote Senior Site Reliability Engineer to join our innovative team at Cloudbeds. As a leader in hospitality technology, we empower hotels and properties across 150 countries with our cutting-edge platform. In this role, you will ensure the reliability and performance of our systems, enabling seamless transactions for millions of users globally.
What You'll Do
- Design and implement reliable and scalable AWS architecture to meet the needs of the organization.
- Maintain and support highly loaded Kubernetes (EKS) clusters and infrastructure-related components.
- Support the CICD process with ArgoCD and GitOps.
- Automate platform deployments using Terraform infrastructure-as-code.
- Develop and continuously improve product observability and monitoring systems based on Grafana, Prometheus, DataDog, and Cloudwatch.
- Participate in incident management and root cause analysis, ensuring minimal impact on services.
- Optimize system performance and troubleshoot issues as they arise.
Requirements
- 5+ years of experience as a Site Reliability Engineer or in a similar role.
- Strong expertise in AWS cloud services and architecture.
- Proficiency with Kubernetes and container orchestration.
- Experience with CI/CD tools such as ArgoCD and GitOps.
- Familiarity with infrastructure as code tools like Terraform.
- Knowledge of monitoring and observability tools such as Grafana, Prometheus, and DataDog.
- Excellent problem-solving skills and a proactive approach to system reliability.
Nice to Have
- Experience in the hospitality or travel industry.
- Familiarity with incident management tools.
- Knowledge of scripting languages such as Python or Bash.
What We Offer
- Competitive salary ranging from $140,000 to $180,000 annually.
- Fully remote work environment with flexible hours.
- Opportunities for professional growth and development.
- Collaborative and innovative team culture.
- Health and wellness benefits.
- Access to cutting-edge technologies and tools.
- Work with a diverse and talented team from around the world.
This role offers a unique opportunity to work remotely as a Senior Site Reliability Engineer at Cloudbeds, a leader in hospitality technology. You'll be part of a collaborative team, ensuring system reliability for millions of users.
Who Will Succeed Here
Proficient in AWS services, including EC2, S3, and RDS, with hands-on experience in deploying and managing Kubernetes clusters using EKS for high availability and scalability.
Strong understanding of GitOps principles and hands-on experience with ArgoCD for continuous delivery, ensuring efficient and reliable deployment processes in a remote work environment.
Demonstrated experience in monitoring and performance tuning using Grafana and Prometheus, coupled with a proactive mindset to identify and resolve system issues before they impact users.
Learning Resources
Career Path
Market Overview
Skills & Requirements
Domain Trends
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months