Achievers16.02.26
AI SCORE 8.5

Remote Staff Site Reliability Engineer - Cloud Infrastructure

$124K–$170K/year

About the Role

We are seeking a Remote Staff Site Reliability Engineer to join our dynamic team at Achievers. In this role, you will play a critical part in managing and advancing our global infrastructure, ensuring that our systems are reliable and scalable. As a Remote Staff Site Reliability Engineer, you will leverage your extensive technical expertise to architect our GCP/GKE environment and lead the integration of AI-driven workflows.

What You'll Do

  • Lead the design and ongoing evolution of our global, high-availability infrastructure on Google Cloud Platform (GCP) and Kubernetes (GKE).
  • Implement AI-integrated workflows, such as Slack or Teams bots for incident triage and automated PR generation.
  • Collaborate with Product, Engineering, and Leadership teams to manage complex changes and define the long-term reliability roadmap.
  • Establish best practices for Terraform and CI/CD pipelines, empowering development teams to deploy code rapidly and securely.
  • Lead initiatives in disaster recovery, multi-region networking, and the design of zero-trust security architectures.
  • Guide design reviews and promote best practices, enhancing the technical skills of the entire SRE organization.

Requirements

  • 15 years of systems engineering experience with in-depth knowledge of Linux kernels, network protocols (TCP/IP, BGP, DNS), and cloud-native architecture.
  • Hands-on experience in architecting and managing production workloads on Google Cloud Platform and GKE.
  • Practical experience or a strong vision for integrating AI tools and LLMs to automate SRE tasks.
  • Advanced skills in Python or Go for developing internal tools and automation frameworks.
  • Expert understanding of observability frameworks (New Relic, Prometheus, Grafana).
  • Deep knowledge of managing relational databases (MySQL, MongoDB) at scale.

Nice to Have

  • Hands-on experience with Service Mesh (Istio) and advanced GCP Networking features.
  • A proven history of migrating legacy automation systems to modern, AI-augmented CI/CD workflows.

What We Offer

  • Competitive salary range of $124,000 - $170,000 based on experience and skills.
  • Health Benefits and Life Insurance Coverage starting on your first day.
  • Parental Leave Top-up and Employer matched RRSP contributions.
  • Flexible Vacation policy to recharge and bring your best self to work.
  • Employee and Family Assistance Program offering mental health, legal, and financial counselling.
  • Supported professional development and career growth opportunities.
  • Hybrid flexibility with time in our Liberty Village, Toronto office.
  • Regular events designed to build connection, belonging, and well-being.
Language Requirements
EnglishC1
BasicIntermediateAdvancedNative
Why This Job8.5 of 10

This role offers a unique opportunity to lead cloud infrastructure initiatives in a supportive and innovative environment. Achievers values its employees and provides numerous benefits.

Salary Range
Required
0/1
Optional
0/1
Bonus
0/1

Who Will Succeed Here

Proficient in Google Cloud Platform and Kubernetes, with hands-on experience in deploying and managing containerized applications in a GKE environment, ensuring high availability and scalability.

Strong proficiency in automation and infrastructure as code using Terraform, along with a deep understanding of CI/CD pipelines to facilitate rapid deployment and integration of applications.

A proactive mindset with a strong focus on observability practices, capable of implementing monitoring solutions using tools like Prometheus and Grafana to ensure system reliability and performance.

Learning Resources

Google Cloud Platform Documentationguide

Career Path

Remote Staff Site Reliability Engineer - Cloud Infrastructure(Now)Lead Site Reliability Engineer(1-2 years)Director of Site Reliability Engineering(3-5 years)

Market Overview

Market Size 2024
$50B
Annual Growth
20.5%
AI Adoption
72%
Investment
+150%
Labour Demand
+30%
Avg Salary
$135K

Skills & Requirements

Required
Google Cloud PlatformKubernetesPython
Growing in Demand
Container SecurityServerless ArchitectureInfrastructure as Code (IaC)
Declining
Traditional Virtualization (e.g., VMware)Legacy Database Systems (e.g., Oracle 11g)

Domain Trends

Increased Adoption of Kubernetes
Kubernetes is expected to dominate container orchestration, with 85% of organizations adopting it by 2025.
Rise of Serverless Computing
Serverless architecture is projected to grow by 25% annually, as organizations seek to reduce overhead and increase scalability.
Focus on Observability
Companies are investing heavily in observability tools, with a 40% increase in demand for professionals skilled in monitoring and troubleshooting cloud environments.

Industry News

Loading latest industry news...

Finding relevant articles from the last 6 months

All job postings are automatically gathered by algorithms. We do not review or verify listings, be careful when applying and do not sign-in with iCloud or Google services.