
Software Engineer, Reliability
9.0/10
Checkr
$100,000 – $150,000 USD
Hybrid
mid
about 5 hours ago
techPythonGoRubyLinuxAWSAzureKubernetesDockerTerraformDatadogSplunk
AI Summary
The vacancy is well-structured with clear responsibilities and compensation, but lacks direct company links.
Check Match — Just drop your CV
See your fit for Software Engineer, Reliability in seconds.
Description
What you'll do
- •Design, build, ship, and maintain the core observability libraries, tools, and patterns used by all of Checkr’s engineering teams
- •Troubleshoot complex production issues across the stack, with respect to performance, availability, and data quality
- •Participate in a cross-organization incident response team, driving continuous improvement
- •Contribute to architectural discussions within the SRE team and with cross-functional teams
- •Influence cross-team projects and the reliability roadmap to enable engineering and help Checkr customers
- •Provide consultation and feedback across teams to ensure we are building highly reliable, efficient, and scalable systems
Conditions
- •A fast-paced and collaborative environment
- •Learning and development allowance
- •Competitive cash and equity compensation, and opportunity for advancement
- •100% medical, dental, and vision coverage
- •Up to $25K reimbursement for fertility, adoption, and parental planning services
- •Flexible PTO policy
- •Monthly wellness stipend
- •In-office perks such as lunch five times a week, a commuter stipend, and an abundance of snacks and beverages.
- •A relocation stipend may be available for those willing to relocate to a Checkr hub location.
Requirements
- •Bachelor’s degree in Computer Science or related field, or equivalent practical experience
- •2+ years of software engineering experience, including 1+ years focused on reliability, scalability, and efficiency of distributed systems
- •Proficiency in Python (preferred), Go, or Ruby within Linux environments, and strong understanding of microservices, asynchronous systems, and remote APIs.
- •Experience developing and operating production, customer-facing systems in AWS or Azure using Kubernetes, Docker, and Terraform.
- •Skilled in observability and incident response practices using tools such as Datadog, Splunk, Grafana, Prometheus, and OpenTelemetry, with a focus on continuous improvement.
- •Strong collaboration, documentation, and communication skills, with experience leading small projects, promoting platform adoption, and fostering a self-service, product-first mindset.
- •An A-player mindset with a strong bias for action: you raise the bar, move with urgency, stay resilient through ambiguity, and take ownership to deliver meaningful outcomes.
Loading similar jobs...