Skip to main content

EPAM Systems is hiring Senior Site Reliability Engineer (AWS)

➡️ Apply here: Senior Site Reliability Engineer (AWS)

🔔 Monitor #sre jobs

👩‍💼 Want to stand out? Improve your resume to appeal to recruiters, hiring managers, and Applicant Tracking Systems. ➡️ Improve your resume


Are you a talented Site Reliability Engineer (SRE) passionate about building scalable, efficient, and reliable cloud systems?

Join our team of innovative professionals who are shaping the future of cloud infrastructure and delivering world-class solutions. If you’re seeking a challenging role where your technical expertise and problem-solving abilities can make a real impact, we’d love to hear from you!

Experience the freedom of remote work from anywhere in Georgia, whether from the comfort of your home, our modern offices in Tbilisi and Batumi or a coworking space in Kutaisi.

Responsibilities
Design, develop, and maintain scalable cloud infrastructure solutions using AWS technologies and AWS CDK
Collaborate with development and operations teams to ensure efficient delivery of applications, enhance deployments, and improve overall system reliability
Enhance server-side code using TypeScript to support application functionality and scalability
Implement best practices for CI/CD pipelines to accelerate development and deployment processes
Respond to operational issues and incidents, troubleshoot effectively, and ensure high availability and resilience of production systems
Drive the implementation of observability practices, including monitoring, logging, and alerting, to proactively identify and resolve system issues
Support operational systems, ensuring optimized performance and seamless scalability
Foster collaboration and share knowledge across teams to uphold a robust culture of DevOps and SRE

Requirements
Proven experience as a Backend Engineer or Site Reliability Engineer
Deep understanding and practical experience with AWS services and infrastructure
Expertise in AWS Cloud Development Kit (AWS CDK) for infrastructure as code
Proficiency in TypeScript and its application in cloud-based systems
Strong knowledge and experience with operational support in a cloud environment
Excellent communication and interpersonal skills, with a focus on collaboration

Nice to have
Knowledge of and experience with CI/CD pipelines
Practical experience with DevOps practices, tools, and methodologies
Familiarity with Site Reliability Engineering (SRE) principles
Experience in observability practices, including monitoring and alerting tools such as Datadog, Prometheus, Grafana, or equivalent

We offer
We connect like-minded people
Delivering innovative solutions to industry leaders, making a global impact
Enjoyable working environment, whether it is the vibrant office or the comfort of your own home
Opportunity to work abroad for up to two months per year
Relocation opportunities within our offices in 55+ countries
Corporate and social events

We invest in your growth
Leadership development, career advising, soft skills and well-being programs
Certifications, including GCP, Azure and AWS
Unlimited access to LinkedIn Learning and Get Abstract
Free English classes with certified teachers

We cover it all
Participation in the Employee Stock Purchase Plan
Monetary bonuses for engaging in the referral program
Comprehensive medical & family care package
Five trust days per year (sick leave without a medical certificate)
Benefits package (sports activities, a variety of stores and services)

EPAM Georgia is a team of innovators united by a passion for technology. The dynamic and inclusive culture we embrace helps positively impact our communities, clients, and employees. Here you will collaborate with multi-national teams, contribute to numerous cutting-edge projects, deliver the most creative solutions, and have an opportunity to learn. Our people are at the heart of our success, and we are proud to provide talents with a solid ground to develop and grow.

Previous and next articles