Cloud Infrastructure Engineer
9.0/10
Alchemy
$135,000 β $240,000 USD
Remote
senior
about 1 month ago
cryptodevweb3KubernetesTerraformAWSGCPPrometheusGrafanaOpenTelemetryHelmGitOps
AI Summary
The vacancy is well-structured with clear responsibilities, compensation, and requirements, but could improve by providing direct company links.
Check Match β Just drop your CV
See your fit for Cloud Infrastructure Engineer in seconds.
Description
What you'll do
- β’Architect and operate scalable, self-healing infrastructure leveraging Kubernetes, Terraform, and cloud-native tools across multi-region deployments.
- β’Drive AI enablement across engineering β ensuring repos, tooling, and workflows are optimized for agentic development with tools like Claude Code, Cursor, and Codex.
- β’Build AI-powered infrastructure tooling and automation (e.g., automated K8s upgrades, IaC plan analysis, cost optimization advisors, MCP servers, n8n workflows).
- β’Build and maintain internal developer platform (IDP) capabilities for self-service deployments, observability, and reliability.
- β’Develop observability frameworks using Prometheus and Grafana for metrics, dashboards, and alerting.
- β’Lead incident management with blameless post-mortems; define and enforce SLIs, SLOs, and error budgets across services.
- β’Design and manage multi-cloud, multi-region network architecture β VPC design, IPAM, DNS (Cloudflare), cross-cloud connectivity, security groups, and edge-proxy/istio gateway configuration.
- β’Collaborate with security teams to embed compliance into infrastructure, including IaC scanning and runtime protection.
- β’Provide technical leadership and mentorship to elevate the teamβs operational capabilities.
Conditions
- β’Medical, Dental, & Vision
- β’Gym Reimbursement
- β’Home Office Build-out Budget
- β’In-Office Group Meals
- β’Wellbeing & Mental Health Perks
- β’Learning & Development Stipend
- β’Company Sponsored Conferences & Events
- β’HSA and FSA Plans
- β’Fertility Benefits
- β’Competitive compensation, including base salary as well as equity
- β’Comprehensive medical, dental, and vision coverage
- β’401k and unlimited flexible time off
Requirements
- β’5+ years as an Infrastructure Engineer focused on reliability (SRE, Production Engineer, Platform Engineer).
- β’Experience driving company-wide reliability efforts, including SLO frameworks and error budget policies.
- β’Strong proficiency with observability stacks: OpenTelemetry, Prometheus/Grafana.
- β’Deep experience with cloud infrastructure (AWS/GCP), Kubernetes, and multi-region architectures.
- β’Skilled with Terraform, Helm, and GitOps workflows (e.g., ArgoCD) with an automation-first mindset.
- β’Experience leveraging agentic development tools (Claude Code, Cursor, Codex) and workflow automation (n8n) to accelerate IaC and build internal tooling is a strong plus.
- β’Solid networking fundamentals β VPC design, DNS, IPAM, security groups, cross-cloud connectivity, and service mesh (e.g., Istio) experience is a plus.
- β’Calm and effective incident responder with a focus on systemic improvement.
- β’Strong cross-functional communicator across SRE, security, and product engineering.
- β’Blockchain infrastructure, distributed systems, or high-throughput RPC experience β not required but a plus.
Loading similar jobs...