Site Reliability Engineer
8.0/10
Alpaca
$98,000 โ $162,000 USD
Remote
mid
6 days ago
cryptodevfintechtechweb3Incident ResponseSRELinuxPythonKubernetesCloud NetworkingPostgreSQL
AI Summary
The vacancy is well-structured, providing clear expectations and compensation details, making it attractive for applicants.
Check Match โ Just drop your CV
See your fit for Site Reliability Engineer in seconds.
Description
Your Role
As a Site Reliability Engineer at Alpaca, you'll help keep our brokerage platform reliable, observable, and operable as we grow - working across our cloud infrastructure, Kubernetes platform, observability stack, messaging layer, and data layer.
Things You Get To Do
- โขOperate production day-to-day - oncall, incident response, postmortems, and the follow-ups that actually close the loop.
- โขOwn reliability practice - define and refine SLIs/SLOs and error budgets, and help product teams live within them.
- โขStrengthen our observability across metrics, logs, traces, and alerting.
- โขShip infrastructure through code in a GitOps workflow - cloud resources and Kubernetes workloads alike.
- โขLook after PostgreSQL: performance tuning, schema and migration review, online migrations on large tables, HA/DR, and CDC pipelines.
- โขMentor engineers on reliability and database fundamentals through code review, design review, and pairing.
How We Take Care of You
- โขCompetitive Salary & Stock Options
- โขHealth Benefits
- โขNew Hire Home-Office Setup: One-time USD $500
- โขMonthly Stipend: USD $150 per month via a Brex Card.
Alpaca is proud to be an equal opportunity workplace dedicated to pursuing and hiring a diverse workforce.
Requirements
Who You Are (must-haves)
- โข4+ years in SRE, DevOps, Platform/Infrastructure, or backend engineering with significant production operations ownership.
- โขHands-on experience operating production services on Kubernetes, and shipping infrastructure as code in a GitOps workflow.
- โขSolid working knowledge of PostgreSQL in production โ query plans, pg_stat_*, indexing and schema trade-offs, and what a safe online migration looks like on a non-trivial table.
- โขCloud networking fundamentals (VPCs, routing, L4/L7 load balancing, DNS, TLS) and comfort debugging cross-service connectivity.
- โขComfortable with a modern observability stack and proficient with Linux at the operator level.
- โขPracticed in incident response - calm under pressure, structured debugging, postmortems that drive change.
- โขAt least working proficiency in Go or Python, plus strong written and verbal communication.
- โขGenuine interest in databases and in growing your PostgreSQL/DBA expertise.
Loading similar jobs...