Alchemy

Cloud Infrastructure Engineer

9.0/10

Alchemy

$135,000 – $240,000 USD
Remote
senior
about 1 month ago
cryptodevweb3KubernetesTerraformAWSGCPPrometheusGrafanaOpenTelemetryHelmGitOps

AI Summary

The vacancy is well-structured with clear responsibilities, compensation, and requirements, but could improve by providing direct company links.

Check Match β€” Just drop your CV

See your fit for Cloud Infrastructure Engineer in seconds.

Description

What you'll do

  • β€’Architect and operate scalable, self-healing infrastructure leveraging Kubernetes, Terraform, and cloud-native tools across multi-region deployments.
  • β€’Drive AI enablement across engineering β€” ensuring repos, tooling, and workflows are optimized for agentic development with tools like Claude Code, Cursor, and Codex.
  • β€’Build AI-powered infrastructure tooling and automation (e.g., automated K8s upgrades, IaC plan analysis, cost optimization advisors, MCP servers, n8n workflows).
  • β€’Build and maintain internal developer platform (IDP) capabilities for self-service deployments, observability, and reliability.
  • β€’Develop observability frameworks using Prometheus and Grafana for metrics, dashboards, and alerting.
  • β€’Lead incident management with blameless post-mortems; define and enforce SLIs, SLOs, and error budgets across services.
  • β€’Design and manage multi-cloud, multi-region network architecture β€” VPC design, IPAM, DNS (Cloudflare), cross-cloud connectivity, security groups, and edge-proxy/istio gateway configuration.
  • β€’Collaborate with security teams to embed compliance into infrastructure, including IaC scanning and runtime protection.
  • β€’Provide technical leadership and mentorship to elevate the team’s operational capabilities.

Conditions

  • β€’Medical, Dental, & Vision
  • β€’Gym Reimbursement
  • β€’Home Office Build-out Budget
  • β€’In-Office Group Meals
  • β€’Wellbeing & Mental Health Perks
  • β€’Learning & Development Stipend
  • β€’Company Sponsored Conferences & Events
  • β€’HSA and FSA Plans
  • β€’Fertility Benefits
  • β€’Competitive compensation, including base salary as well as equity
  • β€’Comprehensive medical, dental, and vision coverage
  • β€’401k and unlimited flexible time off

Requirements

  • β€’5+ years as an Infrastructure Engineer focused on reliability (SRE, Production Engineer, Platform Engineer).
  • β€’Experience driving company-wide reliability efforts, including SLO frameworks and error budget policies.
  • β€’Strong proficiency with observability stacks: OpenTelemetry, Prometheus/Grafana.
  • β€’Deep experience with cloud infrastructure (AWS/GCP), Kubernetes, and multi-region architectures.
  • β€’Skilled with Terraform, Helm, and GitOps workflows (e.g., ArgoCD) with an automation-first mindset.
  • β€’Experience leveraging agentic development tools (Claude Code, Cursor, Codex) and workflow automation (n8n) to accelerate IaC and build internal tooling is a strong plus.
  • β€’Solid networking fundamentals β€” VPC design, DNS, IPAM, security groups, cross-cloud connectivity, and service mesh (e.g., Istio) experience is a plus.
  • β€’Calm and effective incident responder with a focus on systemic improvement.
  • β€’Strong cross-functional communicator across SRE, security, and product engineering.
  • β€’Blockchain infrastructure, distributed systems, or high-throughput RPC experience β€” not required but a plus.
Loading similar jobs...