Skip navigation EPAM
CONTACT US

Senior Site Reliability Engineer Remote

  • hot

Senior Site Reliability Engineer Description

We are looking for a Senior Site Reliability Engineer to join our motivated team.

As a Senior Site Reliability Engineer, you will play a crucial role in maintaining the reliability and performance of our systems. Your expertise in Site Reliability Engineering will ensure that our services run smoothly and efficiently, providing an outstanding experience for our users. If you are passionate about automation and system performance, we would love to hear from you.


#LI-DNI#EasyApply

Responsibilities

  • Optimize operating systems for high-impact, internet-facing production services and distributed systems
  • Implement and coordinate state-of-the-art telemetry using tools such as Splunk, Grafana, and Prometheus, enhancing our organizational capabilities
  • Address complex issues in Kubernetes, establishing high standards and best practices for the team
  • Utilize Bash and Python to create and maintain cutting-edge automation scripts, boosting our operational efficiency
  • Build and operate advanced systems like Kubernetes or EKS, sharing your knowledge and experience with the team
  • Design and maintain robust, high-performance cloud infrastructure with AWS, ensuring the highest levels of availability and reliability
  • Champion innovative automation solutions, minimizing manual work and driving our team toward more efficient processes
  • Show strong ownership and leadership, communicate effectively, and foster a transparent and collaborative team environment
  • Keep growing and learning, and inspire your team to do the same, nurturing a culture of continuous improvement and curiosity
  • Provide invaluable guidance and support to all team members, fostering a culture of clarity and efficiency in communication
  • Strategically manage disaster recovery and capacity planning to ensure system resilience and scalability
  • Take charge of deployment automation with tools such as Terraform or CloudFormation, increasing our team's productivity and reliability
  • Leverage your experience with technologies such as Cassandra, Kafka, Solr, PostgreSQL, and Redis, enhancing our SRE practices

Requirements

  • 3+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure
  • Experience with Amazon Web Services and cloud platforms
  • Strong knowledge of Linux operating systems
  • Proficiency in using Terraform and Terraform Cloud
  • Understanding of network protocols and security
  • Excellent problem-solving and troubleshooting skills
  • Ability to work collaboratively in a team environment
  • Strong communication skills in English (B2+ level)

Nice to have

  • Experience with container orchestration in Kubernetes
  • Familiarity with configuration management tools

We offer

  • We gather like-minded people:
    • Engineering community of industry professionals
    • Friendly team and enjoyable working environment
    • Flexible schedule and opportunity to work remotely within Poland
    • Chance to work abroad for up to 60 days annually
    • Relocation within our 50+ offices
  • We provide growth opportunities:
    • Outstanding career roadmap
    • Leadership development, career advising, soft skills, and well-being programs
    • Certification (GCP, Azure, AWS)
    • Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru
    • Language classes in English and Polish for foreigners
  • We cover it all:
    • Stable income (Employment Contract or B2B)
    • Participation in the Employee Stock Purchase Plan
    • Benefits package (health insurance, multisport, shopping vouchers)
    • Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more
    • Referral bonuses
    • Corporate, social and well-being events
  • Please, note:
    • The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview
    • We will reach out to selected candidates exclusively

EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

GET IN TOUCH

Hello.
How can we help you?

Get in touch with us. We'd love to hear from you.

Our
Locations