Founding Platform & Reliability Engineer - AI-Driven Infrastructure
About the Role
We’re hiring a Founding Platform & Reliability Engineer to join our innovative team at OpenArt. This role offers the unique opportunity to shape the future of AI-driven creative tools while working remotely. As a Founding Platform & Reliability Engineer, you will be instrumental in designing, scaling, and ensuring the reliability of our infrastructure stack, impacting millions of users worldwide.
What You'll Do
- Define and operationalize SLOs/SLIs across critical user journeys, driving prioritization and error budgets.
- Participate in an on-call rotation and lead incident response improvements, ensuring high-quality alerts and effective escalation paths.
- Implement reliability patterns at external boundaries and establish health measurement mechanisms for vendors.
- Build end-to-end observability with structured logs, metrics, and dashboards to quickly diagnose issues.
- Develop deploy safety practices, including automated rollbacks and reliable CI/CD gates.
- Guide the direction of our infrastructure architecture, balancing serverless and containerized approaches as we scale.
- Establish cost observability and control mechanisms, including caching strategies and budget alerts.
- Act as a senior technical voice, influencing architecture and engineering best practices across the team.
Requirements
- 5+ years of experience building and operating production systems focused on reliability and scaling.
- Strong software engineering skills with the ability to ship production code.
- Cloud-native experience with AWS or GCP, particularly in serverless and event-driven systems.
- Deep knowledge of observability practices, including dashboards and incident response maturity.
- Ability to design resilient interactions with external dependencies and communicate effectively with non-infra peers.
- Comfortable operating with ambiguity and defining problems before solving them.
Nice to Have
- Experience designing internal platform abstractions that enable product teams to ship faster.
- Proven track record of shipping concrete reliability outcomes, such as reduced MTTR or improved SLO attainment.
- Prior startup experience or experience owning large surface-area features.
What We Offer
- Competitive base salary and bonus program.
- Equity - meaningful ownership in what you build.
- High autonomy in a high growth environment.
- Visa sponsorship available for international candidates.
- Flexible work setup with remote options.
This role offers a unique opportunity to shape the future of AI-driven creative tools while enjoying competitive compensation and equity.
Generating success profile...
Analyzing job requirements and market data
Loading market overview...
Analyzing market trends and skill demands
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months