Job Title: Core - Ceph expert for cluster re-design, Ittigen
Job Reference: 638e7170c003
Job Views: 5
Posted: 21.01.2025
Expiry Date: 07.03.2025
Job Description:
Initial Situation:
We have a Ceph solution in place which is complex, not scalable, and unstable. Refactoring poses a high risk of bringing it down.
We have new machines and would like to spawn a new Ceph cluster following new architecture with the latest Ceph components and best practices to achieve a less complex and more stable cluster that we can confidently scale, operate, and maintain.
Delivery Objectives:
1. Support in architecting and deploying a high-performance and highly available Kubernetes-based Ceph storage solution for S3 and K8s persistent volumes supporting internal ETL, Data Analytics, and AI use cases.
2. Hands-on in automating deployment artifacts and developing a monitoring & alerting stack to operate our cluster.
3. Provide LCM plans including a set of automations that allow upgrades in a rolling-restart mode without affecting users.
Definition of Done:
1. Architectural choices are proposed and validated.
2. Deployment artifacts are tested to achieve efficient deployment from scratch.
3. Monitoring and alerting stack is deployed on PROD.
4. LCM plans and automated upgrades of all components across the cluster are designed, implemented, and tested on PROD.
5. Ceph solution is verified in a productive environment.
Professional and Technical Framework Conditions:
1. Proficiency in Ceph technology deployed on Kubernetes.
2. Experience in designing, deploying, operating, and enhancing multiple Ceph solutions.
#J-18808-Ljbffr