Apicworld

Site Reliability Engineer (SRE)

6.0/10

Apicworld

Not specified
Remote
mid
about 2 months ago
devtechLinuxPostgreSQLMongoDBClickhouseRedisRabbitMQKafkaGitLabVictoriaMetricsPrometheus

AI Summary

The vacancy provides clear responsibilities and tech stack but lacks compensation details and company information.

Check Match β€” Just drop your CV

See your fit for Site Reliability Engineer (SRE) in seconds.

Description

What you'll do

  • β€’Ensure the stability of production and development infrastructure
  • β€’Develop and improve monitoring, alerting, and observability (metrics, logs, tracing)
  • β€’Configure and optimize metrics and logging systems
  • β€’Analyze incidents and prevent their recurrence
  • β€’Work with alerts and improve their quality
  • β€’Increase service reliability and fault tolerance
  • β€’Optimize system performance and stability

Conditions

  • β€’Remote work or from our office in Limassol
  • β€’Compensation for English or Greek classes
  • β€’Health insurance (only for Cyprus)
  • β€’Office lunches (only for Cyprus)
  • β€’Flexible start of the working day

Requirements

  • β€’Strong understanding of Linux
  • β€’Experience as an SRE / DevOps / System Engineer
  • β€’Solid experience with monitoring and alerting tools (Prometheus, Grafana, or similar)
  • β€’Understanding of observability (metrics, logs, tracing)
  • β€’Experience with Kubernetes and containerization
  • β€’Experience in incident analysis and production troubleshooting
  • β€’Automation skills (Bash, Python)
  • β€’Understanding of networking, performance, and fault tolerance
  • β€’Experience with GCP is a plus
Loading similar jobs...