About the Role

Wormhole Foundation is on a mission to empower passionate individuals in the research and development of blockchain interoperability technologies. We are currently seeking a Remote Site Reliability Engineer to enhance the reliability, security, and operational excellence of our production infrastructure. As a Remote Site Reliability Engineer, you will play a crucial role in ensuring that our services maintain a minimum uptime of 99.99%, excluding scheduled maintenance windows.

What You'll Do

  • Act as the first responder and incident commander during production incidents.
  • Lead incident triage, root cause analysis, and retrospective documentation.
  • Build detailed incident timelines and preventative runbooks.
  • Respond to incidents related to performance issues, CCQ failures, degraded throughput, observability pipeline outages, and core Wormhole products.
  • Deliver remediation recommendations and implement approved fixes.
  • Improve reliability and uptime across all Wormhole services.
  • Strengthen observability, monitoring, and alerting systems.
  • Harden infrastructure for security and operational resiliency.
  • Enhance deployment workflows and reduce operational friction.
  • Support operational tooling used by engineering, DevOps, and validator partners.

Requirements

  • Relevant tertiary qualifications in computer science or a closely related field (Bachelor's/Master's) and/or relevant work experience.
  • Experience in incident response and operational excellence.
  • Strong understanding of blockchain technologies and networking services.
  • Proficiency in observability and monitoring tools.
  • Ability to work collaboratively with engineering and DevOps teams.

Nice to Have

  • Experience with cloud infrastructure (AWS, Azure, etc.).
  • Familiarity with container orchestration (Kubernetes, Docker).
  • Knowledge of security best practices in cloud environments.

What We Offer

  • Competitive salary ranging from $120,000 to $150,000 per year.
  • Fully remote work environment.
  • Opportunities for professional growth and development.
  • Collaborative and innovative team culture.
  • Flexible working hours to accommodate different time zones.
Why This Job8.5 of 10

This Remote Site Reliability Engineer position at Wormhole Foundation offers a unique opportunity to work on cutting-edge blockchain technologies while ensuring high service reliability. With a competitive salary and a fully remote work environment, this role is ideal for tech enthusiasts passionate about blockchain.

Salary Range
Required
0/1
Optional
0/1
Bonus
0/1

Who Will Succeed Here

Strong proficiency in cloud infrastructure technologies such as AWS or Google Cloud, with hands-on experience in setting up monitoring tools like Prometheus and Grafana to ensure observability in blockchain applications.

Ability to work autonomously in a remote setting, demonstrating self-discipline and proactive problem-solving skills, particularly in incident response situations that require quick decision-making and collaboration with distributed teams.

Deep understanding of security best practices in blockchain environments, including familiarity with cryptographic principles and frameworks that safeguard decentralized applications, ensuring the integrity and availability of services.

Learning Resources

Incident Response and Handlingcourse

Career Path

Remote Site Reliability Engineer - Blockchain Focus(Now)SRE Team Lead or Blockchain Systems Architect(2-4 years)Director of Site Reliability Engineering or Chief Security Officer(5-7 years)

Market Overview

Market Size 2024
$8.5B
Annual Growth
12.3%
AI Adoption in Incident Response
35%
Investment in Blockchain Security
+150%
Labour Demand for SREs
+22%
Avg Salary for SREs with Blockchain Skills
$135K

Skills & Requirements

Required
Incident ResponseObservabilityMonitoring Tools
Growing in Demand
KubernetesDevSecOpsAutomated Incident Response
Declining
Traditional ITIL Incident ManagementBasic Monitoring Tools (e.g., Nagios)

Domain Trends

Rise of AI in Incident Response
AI-driven tools are increasingly used for automated threat detection, with 40% of organizations adopting AI solutions by 2025.
Blockchain Integration in Incident Management
Over 30% of companies are integrating blockchain for enhanced security in incident response, improving data integrity and transparency.
Shift to Cloud-Native Incident Response
Cloud-native incident response strategies are gaining traction, with a projected 25% increase in adoption among enterprises by 2025.

Industry News

Loading latest industry news...

Finding relevant articles from the last 6 months

All job postings are automatically gathered by algorithms. We do not review or verify listings, be careful when applying and do not sign-in with iCloud or Google services.