Director of Site Reliability Engineering
9.0/10
Stellar Development Foundation
$210,000 – $310,000 USD
Remote
lead
about 6 hours ago
cryptoweb3Site Reliability EngineeringInfrastructureOperationsKubernetesTerraformAnsiblePuppet
AI Summary
The vacancy is well-structured, providing clear expectations and a comprehensive overview of the role and company.
Check Match — Just drop your CV
See your fit for Director of Site Reliability Engineering in seconds.
Description
What you'll do
In this role, you will:
- •Establish a clear vision and mandate for the Site Reliability Engineering team
- •Define the SRE team's quarterly OKRs to best align with the company's goals
- •Define processes of collaboration between SREs and development teams throughout the software development lifecycle
- •Define a career growth path for the SRE team, as well as coach and mentor individual contributors on the team
- •Define and track metrics across engineering and help hold engineering teams accountable for their KPIs
- •Coordinate priorities with other teams and areas of the organization
- •Participate in sprint planning and execution, track progress and oversee day-to-day tactical decisions
- •Design and build reliable systems, and infrastructure that is easy to use by software engineers
- •Monitor and troubleshoot systems in production
- •Define and participate in 24/7 on-call rotations alongside the team
- •Mediate technical discussions and review PRs
- •Jump in as needed with code fixes, troubleshooting and hands-on contributions
- •Collaborate across the Stellar ecosystem, engaging with key partners and advising on their integration to set them up for success
Conditions
We offer competitive pay with a base salary range for this position of $210,000 - $310,000 depending on job-related knowledge, skills, experience, and location. In addition, we offer lumen-denominated grants along with the following perks and benefits:
- •Competitive health, dental & vision coverage with most plans covered at 100% for the employee + any dependents
- •Flexible time off + 15 company holidays including a company-wide holiday break
- •Generous paid parental leave for all parents, plus paid pregnancy disability leave for birthing parents
- •Gym reimbursement ($80 per month)
- •Life & ADD (up to $50K)
- •Short & Long term disability
- •401K with 4% match
- •Health & Dependent Care FSA Accounts
- •Commuter benefits with $250/month employer contribution
- •Health Savings Account (HSA) with monthly employer contribution
- •Family building benefits through Kindbody
- •Wellbeing benefits (One Medical, Rightway, Headspace)
- •L&D budget of $1,500/year
- •Daily lunch and snacks in office
- •Company retreats
Requirements
You have:
- •3+ years of experience working as a Site Reliability Engineer
- •3+ years of experience managing an SRE team
- •Site Reliability Engineering experience:
- •Strong track record of collaborating with dev teams at all stages of product development (design, development/CI, beta testing, production)
- •Strong track record collaborating on defining, measuring and driving improvements in KPIs
- •Strong track record assisting teams during Root Cause Analysis and post mortems
- •Infrastructure and Operations experience:
- •Designing and building out the infrastructure for large distributed systems
- •Maintaining highly-available infrastructure
- •Troubleshooting and understanding complex technical problems
- •Using configuration Management or IaC tooling such as Terraform, Ansible, Puppet
- •Building and maintaining infrastructure using Kubernetes
- •Highly autonomous; able to find clarity in ambiguous circumstances
- •Excellent communicator; comfortable working with remote team members
Loading similar jobs...