
Site Reliability Engineer (SRE)
6.0/10
Apicworld
Not specified
Remote
mid
about 2 months ago
devtechLinuxPostgreSQLMongoDBClickhouseRedisRabbitMQKafkaGitLabVictoriaMetricsPrometheus
AI Summary
The vacancy provides clear responsibilities and tech stack but lacks compensation details and company information.
Check Match β Just drop your CV
See your fit for Site Reliability Engineer (SRE) in seconds.
Description
What you'll do
- β’Ensure the stability of production and development infrastructure
- β’Develop and improve monitoring, alerting, and observability (metrics, logs, tracing)
- β’Configure and optimize metrics and logging systems
- β’Analyze incidents and prevent their recurrence
- β’Work with alerts and improve their quality
- β’Increase service reliability and fault tolerance
- β’Optimize system performance and stability
Conditions
- β’Remote work or from our office in Limassol
- β’Compensation for English or Greek classes
- β’Health insurance (only for Cyprus)
- β’Office lunches (only for Cyprus)
- β’Flexible start of the working day
Requirements
- β’Strong understanding of Linux
- β’Experience as an SRE / DevOps / System Engineer
- β’Solid experience with monitoring and alerting tools (Prometheus, Grafana, or similar)
- β’Understanding of observability (metrics, logs, tracing)
- β’Experience with Kubernetes and containerization
- β’Experience in incident analysis and production troubleshooting
- β’Automation skills (Bash, Python)
- β’Understanding of networking, performance, and fault tolerance
- β’Experience with GCP is a plus
Loading similar jobs...