Skip navigation EPAM
CONTACT US

Lead Systems Engineer (DevOps & SRE) Hyderabad, India

  • hot

Lead Systems Engineer (DevOps & SRE) Description

EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

We are looking for a Lead Systems Engineer (DevOps & SRE) and play a crucial role in ensuring the reliability, scalability, capacity planning, and performance of our infrastructure and applications.

You will have a strong background in software engineering, system administration, containerization, and cloud technologies, and will lead the design, development, and maintenance of scalable and reliable infrastructure. You will also be responsible for implementing and managing CI/CD pipelines, monitoring system performance and reliability, developing and maintaining automation tools, ensuring security and compliance, mentoring and guiding junior SREs and DevOps engineers.


#LI-DNI#LI-KK18

Technologies

  • CI/CD, Jenkins, Docker, Kubernetes, Terraform, Ansible, Python, Prometheus, Grafana, ELK stack, Splunk, Dynatrace, Datadog or similar, SLI, SLO, SLA and Error Budget concepts

Responsibilities

  • Lead the design, development, and maintenance of scalable and reliable infrastructure
  • Implement and manage CI/CD pipelines to ensure efficient and smooth software releases
  • Develop and maintain automation tools to streamline infrastructure management and deployment processes
  • Monitor system performance and reliability, proactively identifying and resolving issues
  • Ensure security and compliance across all infrastructure and operations
  • Collaborate with development teams to ensure best practices for software development, deployment, and operations
  • Mentor and guide junior SREs and DevOps engineers, fostering a culture of collaboration and continuous learning
  • Optimize resource utilization to ensure cost-effective operations
  • Conduct root cause analysis of system failures and implement solutions to prevent recurrence
  • Stay up-to-date with the latest industry trends and technologies, integrating them into our processes where appropriate

Requirements

  • Minimum 8 years of experience in a DevOps/SRE role
  • Strong experience with cloud platforms (AWS, GCP, Azure)
  • Proficiency in infrastructure as code (IaC) tools (Terraform, CloudFormation, etc.)
  • Strong experience with containerization and orchestration (Docker, Kubernetes)
  • Experience with monitoring and logging tools (Prometheus, Grafana, ELK stack, etc.)
  • Strong knowledge of CI/CD tools (Jenkins, GitLab CI, CircleCI, etc.)
  • Proficiency with scripting languages (Python, Bash, etc.)
  • Ability to participate in capacity planning and scalability assessments to support business growth and requirements
  • Strong knowledge of of SLI, SLO, SLA and Error Budget concepts and their implementations and provide on-call support and participate in incident management & response activities as needed
  • Solid understanding of networking and security principles
  • Strong communication and collaboration skills
  • Excellent problem-solving skills and the ability to work under pressure
  • B2+ English level 

We offer

  • Opportunity to work on technical challenges that may impact across geographies
  • Vast opportunities for self-development: online university, knowledge sharing opportunities globally, learning opportunities through external certifications
  • Opportunity to share your ideas on international platforms
  • Sponsored Tech Talks & Hackathons
  • Unlimited access to LinkedIn learning solutions
  • Possibility to relocate to any EPAM office for short and long-term projects
  • Focused individual development
  • Benefit package:
    • Health benefits
    • Retirement benefits
    • Paid time off
    • Flexible benefits
  • Forums to explore beyond work passion (CSR, photography, painting, sports, etc.)

A DAY IN THE LIFE

BLOG

Salman Talat
Director, Account Management
TORONTO, CANADA

Read More

BLOG

Iryna Kovalenko
Delivery Manager
KYIV, UKRAINE

Read More

BLOG

Jan Mazurek
Chief Business Analyst
GDANSK, POLAND

Read More

GET IN TOUCH

Hello.
How can we help you?

Get in touch with us. We'd love to hear from you.

Our
Locations