Senior Site Reliability Data Engineer - Remote Opportunity
About the Role
We are seeking a Senior Site Reliability Data Engineer to join our dynamic team at Workable, where you will play a pivotal role in ensuring the reliability, scalability, and performance of our data and cloud infrastructure. This Senior Site Reliability Data Engineer remote position offers you the chance to make a significant impact on our data platform's operational excellence and growth.
What You'll Do
- Design, build, and maintain core data engineering infrastructure, including ETL/ELT pipelines, Apache Spark workloads, and data warehouse systems.
- Ensure availability, scalability, and performance of data infrastructure and pipelines with deep operational ownership.
- Implement and maintain scalable reliability tooling and automation to streamline deployment, monitoring, and incident response across distributed services.
- Operate and optimize Kubernetes-based cloud infrastructure to ensure high availability, performance, and cost-efficiency.
- Collaborate cross-functionally with developers and analysts to design, release, and troubleshoot production systems; provide data engineering expertise.
- Lead cross-functional projects with development teams to improve system reliability, automate capacity planning, and enforce SRE best practices.
- Develop and maintain centralized observability, including logging, metrics, tracing, and alerting pipelines; continuously improve incident detection and response workflows.
- Own observability for data pipelines (freshness, completeness, latency, error rates) and ensure SLOs are met.
Requirements
- BS or MS degree in Computer Science, Engineering, or equivalent proven background.
- 5+ years of relevant professional experience, including programming.
- Strong troubleshooting and analytical skills for large-scale distributed data systems.
- Hands-on experience with Apache Spark or similar distributed data processing frameworks.
- Experience designing and operating ETL/ELT pipelines at scale using tools such as Airflow or dbt.
- Experience with cloud data warehouses (BigQuery, Redshift, or Snowflake).
- Experience with a major cloud provider (GCP or AWS).
- Experience with infrastructure automation tools (Terraform or Ansible).
- Excellent written communication skills in English.
Nice to Have
- Production experience with Kubernetes.
- Experience with streaming systems (Kafka, Flink, Spark Streaming).
- Familiarity with relational and NoSQL databases.
What We Offer
- Comprehensive health coverage, including dependents.
- Flexible work model with a hybrid setup.
- Access to top-tier tools and Apple gear.
- Mobile data plan for connectivity.
- Relocation bonus to help you settle in Athens.
This Senior Site Reliability Data Engineer role at Workable offers a unique opportunity to work remotely while contributing to a dynamic team focused on data infrastructure. With competitive salary and comprehensive benefits, this position is ideal for experienced engineers looking to make an impact.
Generating success profile...
Analyzing job requirements and market data
Loading market overview...
Analyzing market trends and skill demands
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months