305 King St W
Suite 1100
Kitchener, ON N2G 1B9
Canada
Site Reliability Engineer Remote
Site Reliability Engineer Description
We're looking for a passionate Site Reliability Engineer with 3 to 5 years of experience in Site Reliability Engineering, DevOps, or Infrastructure to lead and elevate our team.
Our client, a world-famous, public California-based multinational technology company (from the Big Five Tech list) specializing in consumer electronics, software, and online services.
Join us in this thrilling opportunity to work at the forefront of technology, driving innovation and excellence in a vibrant and creative environment.
This position requires a shift in the working hours – working from 12pm to 8/9 pm.
#LI-DNI#EasyApply
Responsibilities
- Optimize operating systems for high-impact internet-facing production services and distributed systems
- Implement and coordinate state-of-the-art telemetry using tools like Splunk, Grafana, and Prometheus, enhancing our organizational capabilities
- Address complex issues in Kubernetes, establishing high standards and best practices for the team
- Utilize Bash and Python to create and maintain cutting-edge automation scripts, boosting our operational efficiency
- Build and operate advanced systems like Kubernetes or EKS, sharing your knowledge and experience with the team
- Design and maintain robust, high-performance cloud infrastructure with AWS, ensuring the highest levels of availability and reliability
- Champion innovative automation solutions, minimizing manual work and driving our team towards more efficient processes
- Show strong ownership and leadership, communicate effectively, and foster a transparent and collaborative team environment
- Keep growing and learning, and inspire your team to do the same, nurturing a culture of continuous improvement and curiosity
- Provide invaluable guidance and support to all team members, fostering a culture of clarity and efficiency in communication
- Strategically manage disaster recovery and capacity planning to ensure system resilience and scalability
- Take charge of deployment automation with tools like Terraform or CloudFormation, increasing our team's productivity and reliability
- Leverage your experience with technologies like Cassandra, Kafka, Solr, Postgres, and Redis, enhancing our SRE practices
Requirements
- Experience with Bash scripting
- Proficiency in Grafana for monitoring and visualization
- Familiarity with Internet Information Services (IIS) for web server management
- Strong knowledge and experience with Linux operating systems
- Expertise in Prometheus for monitoring and alerting
- Proficient coding skills in Python for automation and scripting purposes
Nice to have
- Experience with Amazon Web Services (AWS) for cloud infrastructure
- Familiarity with various Cloud Platforms for deployment and management
- Knowledge of Kubernetes for container orchestration
- Familiarity with Splunk for log management and analysis
- Experience with Terraform for Infrastructure as Code (IaC) provisioning
- Familiarity with Terraform Cloud for collaborative infrastructure management
- Skills in troubleshooting system issues and debugging
We offer
- We gather like-minded people:
- Engineering community of industry professionals
- Friendly team and enjoyable working environment
- Flexible schedule and opportunity to work remotely within Poland
- Chance to work abroad for up to 60 days annually
- Relocation within our 50+ offices
- We provide growth opportunities:
- Outstanding career roadmap
- Leadership development, career advising, soft skills, and well-being programs
- Certification (GCP, Azure, AWS)
- Unlimited access to LinkedIn Learning, Get Abstract, O’Reilly, Cloud Guru
- Language classes in English and Polish for foreigners
- We cover it all:
- Stable income (Employment Contract or B2B)
- Participation in the Employee Stock Purchase Plan
- Benefits package (health insurance, multisport, shopping vouchers)
- Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more
- Referral bonuses
- Corporate, social and well-being events
- Please, note:
- The set of bonuses might vary based on the role you apply for – specifics will be discussed with our recruiter during the general interview
- We will reach out to selected candidates exclusively
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.