Skip to main content

Space International is hiring Observability Engineer

โžก๏ธ Apply here: Observability Engineer

๐Ÿ”” Monitor #sre jobs

๐Ÿ‘ฉโ€๐Ÿ’ผ Want to stand out? Improve your resume to appeal to recruiters, hiring managers, and Applicant Tracking Systems. โžก๏ธ Improve your resume


Job Title: Observability Engineer
Company: Space International
Location: Tbilisi, Georgia
Job Description:
Build and operate highly available Kubernetes platforms (EKS/EKS-Anywhere on vSphere) and supporting AWS networking, storage, and DNS (Route 53);
Implement Infrastructure-as-Code and GitOps (Terraform, Helm, Argo CD/Flux) for repeatable, audited changes;
Design and maintain CI/CD pipelines (GitHub Actions/GitLab CI) with progressive delivery and policy controls;
Own observability (Prometheus, Grafana, Loki, alerting), SLOs/error budgets, incident response and post-mortems;
Ensure platform security and compliance (secrets management, image scanning, RBAC/IAM, PCI-DSS/ISO 27001 controls);
Plan and execute backup/restore, disaster recovery, and multi-DC failover; capacity and cost optimization;
Operate ingress/gateway and L4/L7 load balancing (Emissary/Traefik/Nginx), CNI (Cilium/Calico), and GSLB/DNS (dnsdist/PowerDNS/Route 53);
Support core data/messaging platforms (PostgreSQL, Kafka, Redis) with HA and performance tuning.

Qualifications:
5+ years in DevOps/SRE/Platform Engineering running production systems at scale;
Strong Kubernetes expertise (cluster lifecycle, upgrades, scaling, zero-downtime deploys);
Solid AWS knowledge (VPC, IAM, ALB/NLB, Route 53, S3) and Linux fundamentals;
Infrastructure-as-Code (Terraform/Terragrunt), GitOps (Argo CD or Flux), Helm/Helmfile;
CI/CD design and automation (GitHub Actions/GitLab CI), artifact management, release strategies;
Observability stack (Prometheus/Grafana/Loki), on-call experience, incident management and SLOs;
Security-by-default mindset: Vault/KMS, image scanning/SBOM, policy as code (OPA/Gatekeeper/Kyverno), hardening;
Networking fundamentals (TCP/IP, TLS/mTLS, BGP/Anycast concepts, L7 proxies), troubleshooting skills;
Scripting in Bash/Python (Go is a plus); documentation and collaboration skills;
Nice to have: vSphere/EKS-Anywhere, MetalLB (BGP), Cilium/eBPF, service mesh/Emissary, Kafka/PostgreSQL ops, Ceph/NFS, exposure to PCI-DSS/ISO 27001.
Seniority Level: Mid-Senior level
Employment Type: Full-time
Job Function: Engineering and Information Technology
Industries: Financial Services

Previous and next articles