Skip navigation EPAM
CONTACT US

Lead Site Reliability Engineer Armenia or Remote

  • hot

Lead Site Reliability Engineer Description

Join our dynamic team as a Lead Site Reliability Engineer! If you have a substantial background in software and systems engineering and a focus on reliability and scalability in cloud environments, your expertise is needed in managing and communicating with IoT devices via our platform. You will have a critical role in duties such as device registration and connection, bi-directional messaging between devices and the cloud, device state tracking and data storage, issuing alerts and notifications for device state changes, and integrating other cloud services like Device Registry and Firmware Upgrade.

This is a fully remote position that offers you the flexibility to work from any location in Armenia, whether it's your home or well-equipped offices in Yerevan or Gyumri.


#LI-DNI#LI-VA2

Responsibilities

  • Design, implement, and maintain highly scalable and available systems across Azure cloud architectures
  • Regularly test and implement disaster recovery (DR) plans
  • Configure and enhance monitoring and alerting processes using Prometheus, Grafana, and OpsGenie
  • Develop dashboards to visualize system performance and reliability metrics
  • Use Terraform for infrastructure provisioning and management
  • Support the development team in ongoing projects
  • Communicate with the customer’s DevOps team to discuss requirements and collaborate on implementations
  • Enhance release management and CI/CD processes
  • Improve system security based on security team recommendations
  • Document system support processes and design, write and test runbooks for operational tasks and incident response

Requirements

  • Minimum 5 years of experience as a DevOps or SRE engineer
  • Proven experience with Azure cloud architectures
  • Proficiency in Kubernetes and Docker/Linux services
  • Familiarity with monitoring tools: Prometheus, Grafana, OpsGenie
  • Experience with .NET Core and ASP.NET Core applications
  • Strong knowledge of Cosmos DB (both Mongo API & SQL API) and MS SQL Server
  • Expertise in Terraform
  • Experience with CI/CD tools and Azure Networking concepts
  • Excellent communication skills, ability to manage tasks and projects independently
  • Experience with Azure IoT Hub and EventHub is an added advantage

We offer

  • We connect like-minded people:
    • Delivering innovative solutions to industry leaders, making a global impact
    • Enjoyable working environment, whether it is the vibrant office or the comfort of your home
    • Opportunity to work abroad for up to two months per year
    • Relocation opportunities within our offices in 55+ countries
    • Corporate and social events
  • We invest in your growth:
    • Leadership development, career advising, soft skills and well-being programs
    • Certifications, including GCP, Azure and AWS
    • Unlimited access to LinkedIn Learning, Get Abstract and O'ReillyFree
    • English classes with certified teachers
  • We cover it all:
    • Participation in the Employee Stock Purchase Plan
    • Monetary bonuses for engaging in the referral program
    • Comprehensive medical & family care package
    • Four trust days per year for personal needs
    • Discounts for fitness clubs
    • Benefits package (hotels, restaurants, stores and services)

EPAM Armenia is a team of talented innovators united by a passion for technology. In 2014, we opened our first office in Yerevan, and now we have a second engineering hub in Gyumri. We've built a continuously learning organization that helps its employees rapidly advance their careers. Here you will work with the world's industry leaders, support impactful projects using the latest technologies, collaborate with multi-national teams, and have access to a wide variety of development opportunities.

GET IN TOUCH

Hello.
How can we help you?

Get in touch with us. We'd love to hear from you.

Our
Locations