Senior Site Reliability Engineer, Core AI Infrastructure
9.0/10
$112,000 β $188,000 USD2.0% below market
Remote
senior
4 days ago
aicryptotechweb3AWSTerraformAnsibleChefPuppetSaltDockerKubernetes
AI SummaryVerified by Aipplify AI
The vacancy is well-structured and informative, providing clarity on tasks, compensation, and company profile.
AI quality score8.7 / 10
Check Match β Just drop your CV
See your fit for Senior Site Reliability Engineer, Core AI Infrastructure in seconds.
Overview
Join Coinbase as a Senior Site Reliability Engineer to drive AI transformation and ensure the reliability of critical AI infrastructure in a fast-paced environment. Coinbase is a remote-first, but not remote-only company. Expect to get together quarterly for intense in-person working sessions called βsurges.β You'll join a high-performing team of engineers driving AI transformation at Coinbase as a Senior Site Reliability Engineer on the IT Operations team.
What youβll be doing
- β’Own the reliability, monitoring, and incident response lifecycle for AI infrastructure services, including on-call support for AWS deployment pipelines, root cause analysis, and blameless retros.
- β’Build automation and tooling to streamline operational IT workflows, eliminate manual tasks, and improve deployment velocity across CI/CD frameworks and Kubernetes environments.
- β’Partner with the Coinbase Infrastructure team to extend CI/CD frameworks supporting IT services and enterprise network platforms, and with Security and Compliance to integrate surveillance tooling into deployment pipelines.
- β’Strengthen observability and documentation standards across IT engineering by defining metrics, implementing monitoring solutions, and maintaining technical documentation that sets a standard of excellence.
- β’Develop full-stack applications that power internal AI products and infrastructure with Go or Python.
What we offer
- β’Base salary varies by location (see range below). Total compensation may also include equity and bonus eligibility, and benefits (medical, dental, vision, 401(k)).
- β’Annual base salary range (excluding equity and bonus): $186,065 β $218,900 USD.
What we look for in you
- β’5+ years of experience automating and supporting cloud infrastructure (AWS) and network environments, with hands-on use of infrastructure-as-code tools (Terraform, Ansible, Chef, Puppet, or Salt).
- β’Proven experience deploying, managing, and troubleshooting containerized workloads using Docker and Kubernetes in production environments.
- β’Proficiency in at least one scripting or programming language (Python, Bash, Ruby, or Go) and version control workflows using Git-based CI/CD pipelines.
- β’Track record of leading incident response in environments with strict SLAs, including root cause analysis, blameless retros, and measurable reliability improvements.
- β’Utilizes generative AI responsibly, maintaining human oversight to deliver business-ready outputs and drive measurable improvements in workflow efficiency, cost, and quality.
Skills
AWSTerraformAnsibleChefPuppetSaltDockerKubernetesPythonBashRubyGoGit
Loading similar jobs...