305 King St W
Suite 1100
Kitchener, ON N2G 1B9
Canada
Lead DevOps / Platform Engineer Houston, TX, USA
Lead DevOps / Platform Engineer Description
We are hiring a strategic Lead DevOps who has experience working with a dynamic team, designing impactful strategies, and solving complex business problems for large, global organizations. This is a high-impact role, with endless opportunities to develop new skills and advance your career. If you are looking for a place where you can truly thrive, this is the perfect opportunity for you. Apply now!
Req.#687800218
#LI-DNI
Responsibilities
- Can comprehend user requirements, build documentation, and translate the requirements to software development
- High service quality: understand users' constraints, work with internal OXY (cloud ops, cyber, netsec) to resolve issues; proactively attend change review meetings to catch changes in advance that would impact our accounts
- Without told, know how to get help within OXY and who to contact by building rapport with OXY folks
- When team wants to build a solution in the future, candidate would be able to provide ideas and advise on what to build and how to build it, then build them for them or help build parts of them
- Understand clearly how systems should work in AWS environments in terms of security, network security, cloud operations
- Reading/writing from S3 buckets
- Parallelization with CPU compute and GPU compute
- AutoScaling: AutoScaling Group, ScaleOut, Kubernetes (EKS)
- Email notification for when jobs failed or completed
- Monitoring dashboards: job runs, services utilization, EC2 utilization, duration, costs
- AutoScaling using Kubernetes
- Be able to work independently with minimum supervision
Requirements
- AWS Services
- Storage: S3 buckets. Understand the advantage of using General vs. Directory buckets and have exposure on its benchmarking
- Infrasturcture as Code (IaC): Lambda
- IaC Framework: Terraform by Harsicorp and CloudFormation by AWS
- AutoScaling and Job Queue: AutoScaling Group
- Job Queueing: SQS with LaunchTemplate
- Database: DynamoDB
- Container: Docker implementation using ECR and ECS
- Logs: CloudWatch, Athena
- Infrastructure: Python CDK
- Frontend: Angular OR React, API Gateway
- Backend: C#
- Database: RDS
- Notification: SNS
- Static Web Hosting: S3 with ALB
- Programming language
- Python, C#
- Program software functionalities using AWS services, such as AutoScaling Group, DynamoDB, CloudWatch, SNS
- Low level support
- Spin up EC2 and configure with Palo Alto certificate for SSL, extend SSH timeout
- Great understanding of how Security Group, subnets, VPC endpoint, boundary permissions work, what they are used for and their impacts
- Software installation, software implementation
- Others
- Familiar with UI/UX design and implementation
- Familiar with Business Analyst process to translate software requirement from the BU to software development
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.