Stellar Development Foundation

Director of Site Reliability Engineering

9.0/10

Stellar Development Foundation

$210,000 – $310,000 USD
Remote
lead
about 6 hours ago
cryptoweb3Site Reliability EngineeringInfrastructureOperationsKubernetesTerraformAnsiblePuppet

AI Summary

The vacancy is well-structured, providing clear expectations and a comprehensive overview of the role and company.

Check Match — Just drop your CV

See your fit for Director of Site Reliability Engineering in seconds.

Description

What you'll do

In this role, you will:

  • Establish a clear vision and mandate for the Site Reliability Engineering team
  • Define the SRE team's quarterly OKRs to best align with the company's goals
  • Define processes of collaboration between SREs and development teams throughout the software development lifecycle
  • Define a career growth path for the SRE team, as well as coach and mentor individual contributors on the team
  • Define and track metrics across engineering and help hold engineering teams accountable for their KPIs
  • Coordinate priorities with other teams and areas of the organization
  • Participate in sprint planning and execution, track progress and oversee day-to-day tactical decisions
  • Design and build reliable systems, and infrastructure that is easy to use by software engineers
  • Monitor and troubleshoot systems in production
  • Define and participate in 24/7 on-call rotations alongside the team
  • Mediate technical discussions and review PRs
  • Jump in as needed with code fixes, troubleshooting and hands-on contributions
  • Collaborate across the Stellar ecosystem, engaging with key partners and advising on their integration to set them up for success

Conditions

We offer competitive pay with a base salary range for this position of $210,000 - $310,000 depending on job-related knowledge, skills, experience, and location. In addition, we offer lumen-denominated grants along with the following perks and benefits:

  • Competitive health, dental & vision coverage with most plans covered at 100% for the employee + any dependents
  • Flexible time off + 15 company holidays including a company-wide holiday break
  • Generous paid parental leave for all parents, plus paid pregnancy disability leave for birthing parents
  • Gym reimbursement ($80 per month)
  • Life & ADD (up to $50K)
  • Short & Long term disability
  • 401K with 4% match
  • Health & Dependent Care FSA Accounts
  • Commuter benefits with $250/month employer contribution
  • Health Savings Account (HSA) with monthly employer contribution
  • Family building benefits through Kindbody
  • Wellbeing benefits (One Medical, Rightway, Headspace)
  • L&D budget of $1,500/year
  • Daily lunch and snacks in office
  • Company retreats

Requirements

You have:

  • 3+ years of experience working as a Site Reliability Engineer
  • 3+ years of experience managing an SRE team
  • Site Reliability Engineering experience:
  • Strong track record of collaborating with dev teams at all stages of product development (design, development/CI, beta testing, production)
  • Strong track record collaborating on defining, measuring and driving improvements in KPIs
  • Strong track record assisting teams during Root Cause Analysis and post mortems
  • Infrastructure and Operations experience:
  • Designing and building out the infrastructure for large distributed systems
  • Maintaining highly-available infrastructure
  • Troubleshooting and understanding complex technical problems
  • Using configuration Management or IaC tooling such as Terraform, Ansible, Puppet
  • Building and maintaining infrastructure using Kubernetes
  • Highly autonomous; able to find clarity in ambiguous circumstances
  • Excellent communicator; comfortable working with remote team members
Loading similar jobs...