305 King St W
Suite 1100
Kitchener, ON N2G 1B9
Canada
Senior Application Support Engineer (HPC) Remote
Senior Application Support Engineer (HPC) Description
We are looking for an experienced Senior Application Support Engineer to join our team and provide comprehensive support in the area of Data Science Environment, specializing in managing ML Ops environments.
This role focuses on ensuring operational excellence and responding to user needs effectively while co-leading the development of our ML platforms.
#LI-DNI#yes#LI-AU3
Responsibilities
- Co-lead the Data Science Environments team, focusing on operational aspects
- Influence and contribute to the team’s strategic roadmaps by collecting, reporting stakeholder feedback, and user needs
- Engage with ML platform users through communication, onboarding, technical documentation, education, and training including demos
- Maintain operational excellence of our platforms by leading support engineering teams in tasks including patching cycles and security assessments
- Influence the R and Python roadmap focusing on library dependencies, compatibility, and interaction with vendors
- Contribute to the governance and configuration of MLFlow and manage the separation of users and use-cases
- Provide support for MATLAB users, including license management and vendor interactions
- Develop advanced technical solutions in collaboration with platform users
- Lead the operational aspects of configuring and running scientific HPC applications
- Support computational model building using tools like R Studio and Jupyter Notebook
Requirements
- Strong experience with Linux
- Proficiency in high-performance computing (HPC) environments
- Experience writing documentation and tutorials, ideally with tools like Confluence, Jupyter Notebook, and R Markdown
- Familiarity with issue-tracking tools such as Jira
- General understanding of Machine Learning and Data Science concepts
- Background in Python programming, especially in package management
- Skills in Jupyter Notebook and JupyterLab
- Expertise in R programming and its ecosystem, particularly in R Studio Workbench and R Studio Connect
- Experience with SageMaker Studio
- Prior experience directly interacting with vendors
- Qualified in MATLAB user support and license management
- Fluent English communication skills at a B2+ level
Nice to have
- Understanding of machine learning lifecycle management and experience with MLOps platforms such as MLflow
We offer
- CONTINUOUS UPSKILLING, LEARNING & DEVELOPMENT:
- Diversity of tasks and projects
- Assessment center for objective review of competency level
- Personal development plan
- Mentoring programs and leadership development
- Certification and professional development support
- Access to learning platforms including more than 2,500 internal courses and the LinkedIn Learning library with 20,000+ courses
- English courses taught by certified teachers
- CORPORATE BENEFITS:
- Extra leave days
- Referral bonuses
- COMPENSATION PACKAGE:
- Competitive compensation paid in USD
- Regular salary and performance reviews
- MEDICAL & HEALTHCARE:
- Private health insurance
- Well-being events
- WORKING ENVIRONMENT:
- Recreation areas and kitchens
- Tea, coffee, and snacks
- Well-being events
- Sports equipment and game consoles
- IT Equipment
- Microsoft's Software Assurance Home Use Program (HUP)
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.