OKX

DevOps / Site Reliability Engineer

8.0/10

OKX

Not specified
Office / on-site
mid
about 3 hours ago
cryptodevweb3PythonGoJavaReactVueGitLabNexusSonarAlibaba Cloud
AI SummaryVerified by Aipplify AI

The vacancy is well-structured with clear tasks and requirements, but lacks specific compensation details.

AI quality score7.6 / 10

Check Match — Just drop your CV

See your fit for DevOps / Site Reliability Engineer in seconds.

Overview

Join OKX as a DevOps Engineer to build and maintain core infrastructure for AIOps, optimize R&D infrastructure, and manage cloud security operations in a leading crypto exchange. At OKX, we believe that the future will be reshaped by crypto, and ultimately contribute to every individual's freedom. OKX is a leading crypto exchange, and the developer of OKX Wallet, giving millions access to crypto trading and decentralized crypto applications (dApps). OKX is also a trusted brand by hundreds of large institutions seeking access to crypto markets. We are safe and reliable, backed by our Proof of Reserves. Across our multiple offices globally, we are united by our core principles: *We Before Me*, *Do the Right Thing*, and *Get Things Done*. These shared values drive our culture, shape our processes, and foster a friendly, rewarding, and diverse environment for every OK-er. OKX is part of OKG, a group that brings the value of Blockchain to users around the world, through our leading products OKX, OKX Wallet, OKLink and more.

What You’ll Be Doing

  • Build and maintain the core infrastructure of the AIOps platform, including the unified monitoring & alerting system and the FinOps cost observability platform.
  • Maintain and continuously optimize internal R&D infrastructure (GitLab, Nexus, Sonar, etc.).
  • Manage monitoring data collection, alert governance, and cost data visualization across multi-cloud environments (Alibaba Cloud / AWS).
  • Support cloud security operations, including cloud security alert management and compliance auditing.

Perks & Benefits

  • Competitive total compensation package
  • L&D programs and education subsidy for employees' growth and development
  • Various team building programs and company events
  • Wellness and meal allowances
  • Comprehensive healthcare schemes for employees and dependants
  • More that we love to tell you along the process!

What We Look For In You

  • 3+ years of DevOps or SRE experience; experience with AIOps or observability platform development is a plus.
  • Proficient in Python; familiar with at least one of Go or Java. Full-stack capability (React/Vue frontend + backend API) is a plus.
  • Hands-on experience with at least one major cloud platform (Alibaba Cloud or AWS); familiar with cloud monitoring products (CloudWatch / Alibaba Cloud CloudMonitor) and cost management tools.
  • Familiar with monitoring and logging stacks such as Prometheus, Grafana, and ELK.
  • Experience maintaining and optimizing CI/CD toolchains (GitLab CI, Nexus, container registries).
  • Experience with AI/LLM application development (e.g., LLM API integration, RAG, Agent frameworks) is a plus.
  • Good written and verbal English communication skills.

Skills

PythonGoJavaReactVueGitLabNexusSonarAlibaba CloudAWSPrometheusGrafanaELK
Loading similar jobs...