Site Reliability Engineer
The vacancy is well-structured with clear responsibilities and compensation, but lacks some company details.
Check Match — Just drop your CV
See your fit for Site Reliability Engineer in seconds.
Overview
Join Offchainlabs as a Site Reliability Engineer and help transform blockchain technology. Work remotely with a competitive salary and be part of a pioneering team in decentralized applications. At Offchain, we aren’t just building products: we’re leading a movement. As pioneers in blockchain scalability and security, we're at the forefront of transforming how the world interacts with decentralized applications. We're laying the foundation that will define the next generation of digital commerce, governance, and human interaction. This involves tackling real-world challenges that come with scaling blockchain technology, without compromising on its core principles: decentralization, security and transparency. At the center of this vision is our people. Our team is made up of thinkers and doers that embrace new challenges and seek solutions that push existing boundaries. If you’re energized by solving unprecedented problems, and believe in the role that decentralized systems will play in creating a more equitable digital future, then we want to hear from you.
What You Will Do
- •Operate production Kubernetes clusters and build scalable, declarative infrastructure using Terraform or similar tools.
- •Deploy and maintain Kubernetes environments, manage system components, and troubleshoot applications running on the platform.
- •Design CI/CD workflows with ArgoCD, GitHub Actions, CodeBuild, or similar tools, covering both infra and app deployments.
- •Design and operate observability systems using time-series metrics, logs, and dashboards with tools like Prometheus, Loki, Mimir, Grafana, and CloudWatch.
- •Diagnose tough networking and storage issues across complex, distributed systems.
- •Implement secure-by-default infrastructure and contribute to architecture reviews and threat models.
- •Automate operational workflows using scripting or programming in Python, Go, or Bash.
Perks
- •Remote-first global workforce + NY office.
- •Annual company offsite + team onsites.
- •Professional reimbursement program (facilitates industry conference attendance, certifications, and more).
- •Medical, dental & vision coverage (US + some other countries).
- •401k retirement plan + company match (US only).
- •Wellness stipend.
- •Home office set up / ergonomic equipment program.
What You've Done
- •Eager to dive into blockchain technology, even if it’s new territory.
- •Enjoy solving infrastructure problems in unconventional ways and thinking beyond standard patterns.
- •Use tools like k9s or ArgoCD for speed and abstraction, but comfortable dropping into YAML, logs, or low-level debugging when things go sideways.
- •Experienced with GitOps-style systems and treating both infrastructure and application delivery as code.
- •Have scaled deployment automation using patterns like ArgoCD ApplicationSets or similar tooling.
- •Curious about how things work under the hood and not satisfied with surface-level fixes.
- •Comfortable in Linux, fluent in shell scripting, and productive in languages like Python or Go.
- •Comfortable operating within a cloud platform (e.g., AWS, GCP, Azure), with a strong understanding of the underlying components making it easy to adapt to or migrate across providers.
- •Participated in an on-call rotation, responding to incidents, troubleshooting under pressure, and driving postmortems to improve system reliability over time.
- •Design systems with security in mind, applying principles like least privilege and threat modeling.
- •Bring a strong technical foundation, excellent problem-solving skills, and a genuine commitment to high-quality work.
- •Take ownership, collaborate openly, and contribute to a culture of clarity, curiosity, and continuous improvement.