Vice President Site Reliability Engineering (Data Centers)

9.0/10

Galaxydigitalservices

$120,000 – $200,000 USD

Remote

mid

10 days ago

cryptotechObservability ToolsPackerLinuxPythonCI/CDAWSAnsibleTeam LeadershipTerraformInfrastructure Automation

AI Summary

The vacancy is well-structured, providing clear expectations and a competitive salary, making it attractive to applicants.

Check Match — Just drop your CV

See your fit for Vice President Site Reliability Engineering (Data Centers) in seconds.

Description

Responsibilities

•**Automation Platform Leadership**: Oversee a specialized SRE team focused on the design, deployment, and maintenance of automation toolsets as well as the systems they interact with.
•**Infrastructure as Code (IaC) Governance**: Establish and enforce standards for IaC to ensure consistent, repeatable, and secure deployments across an entire infrastructure ecosystem. Strong proficiency in Terraform is required.
•**Configuration Management**: Lead the strategy for automated configuration and state management, ensuring Ansible playbooks and Packer image pipelines are optimized for both Windows, Linux, and ESXi Platforms.
•**Monitoring & Observability**: Manage the monitoring and health of the automation platforms themselves. Implement SLIs/SLOs to ensure the "tools that build the servers" are highly available and performant.
•**Lifecycle Management**: Drive the automated lifecycle of both physical and virtual assets, from initial template creation/deployment to automated patching, scaling, and decommissioning.
•**Custom Tooling & Scripting**: Lead the development of custom scripts and internal providers (Python, Go, PowerShell, Bash) to provide better insights and tooling for our systems.
•**Collaboration**: Outside of the automation team you will need to be able to collaborate and foster workflows alongside the rest of the Datacenter team and be able to facilitate needs for the team as a whole.
•**Capacity & Performance**: Analyze system behavior and resource utilization in virtual environments to optimize the performance of automated deployments.
•**Mentorship & Growth**: Provide technical guidance and career mentorship to SREs, fostering a culture of "automate-first" and continuous improvement.

Conditions

•Competitive salary range of $120K-$200K.
•Remote work environment.
•Opportunities for career mentorship and growth.

Requirements

•6-10 years’ experience in Infrastructure, SRE or DevOps, specifically focused on infrastructure automation at scale.
•Deep proficiency with Terraform (providers, modules, state management) and Ansible (roles, playbooks, Tower/AWX).
•Hands-on experience with Image Creation (i.e. Packer, Ansible, SCCM) to build standardized, hardened images for both Windows and Linux in hybrid environments.
•Strong experience managing and automating virtual platforms such as VMware (vSphere/vCenter) as well as Cloud providers such as Azure and AWS.
•High-level scripting skills in mediums such as Python, Go, PowerShell, and Bash.
•Experience with observability tools (Splunk, ELK, Prometheus, or Grafana) to monitor infrastructure health and automation telemetry.
•Good understanding of Network topology and design as well as experience with platforms such as Juniper Networks or Palo Alto.
•Strong mastery of Git (branching strategies, PR workflows) and CI/CD platforms (Jenkins, GitLab CI, or GitHub Actions).
•Equal comfort managing, troubleshooting, and tuning performance for both Windows Server and Linux.

Company Info

Galaxydigitalservices

FinTech

Galaxy Digital is a technology-driven financial services and investment management firm focused on digital assets, blockchain, and AI infrastructure, offering trading, asset management, principal investments, investment banking, mining, and data center solutions to institutions, startups, and investors.[1][2]

New York City, United States

200-1000 employees

Founded 2016

Website

linkedin instagram

AI Quality Score

8.7

out of 10

Tasks & KPI clarity9.0

Responsibilities are detailed with measurable outcomes like SLIs/SLOs and specific tasks.

Compensation clarity8.0

Salary range of $120K-$200K is clearly stated, but payment terms are not specified.

Stack & processes9.0

The tech stack and operational processes are well-defined, covering various tools and technologies.

Requirement logic9.0

Required skills and experience align well with the seniority level expected for the role.

Company profile9.0

Company verified via external enrichment: Marketing.

Loading similar jobs...