Job Summary
As a Systems Platform Engineer, you will design, implement, and maintain HPC platform services for a multi-tenant infrastructure. Your work will enhance the overall HPC system functionality and efficiency.
Responsibilities:
* Designing and deploying HPC systems platforms to meet user community requirements;
* Implementing, deploying, and testing platforms;
* Managing and maintaining platforms over time;
* Supporting and documenting platforms.
Requirements:
* Management of Linux systems, including compute, storage, and network components;
* Working knowledge of automation tools and frameworks, including CI/CD processes;
Preferred Qualifications:
* Configuration of Slurm HPC scheduler;
* JFrog Artifactory experience;
* Developing Ansible configurations;
* Developing services on top of Kubernetes.
Education:
Bachelor's or higher degree in Computer Engineering, Computer Science, or relevant technical field, or equivalent practical experience.
Work Environment:
A dynamic organization that values autonomy, ownership, and continuous learning, offering hands-on experience in the challenging aspects of the HPC field.