As a Senior Machine Learning Engineer at On, you'll play a critical role in the full lifecycle of our machine learning models. Besides being responsible for training and deploying models, you will spearhead our MLOps initiatives to ensure their seamless and efficient integration and operation in production. This includes championing MLOps best practices, enhancing deployment processes, developing essential tooling and automation to maximize the impact of our AI solutions, and implementing robust monitoring to optimize performance and reliability.
Your Mission
Lead the implementation and continuous improvement of our MLOps strategy, establishing best practices for model development, deployment, and monitoring.
Create and train machine learning models to solve specific business problems, such as product recommendations, customer segmentation, and demand forecasting. Implement such models into production systems to make predictions, drive real-time personalization, and support decision-making.
Design and build the necessary infrastructure and tooling to support efficient and scalable model deployment, including CI/CD pipelines and automated testing.
Implement and own Terraform to manage and provision our cloud infrastructure for machine learning operations.
Oversee the transition to a real-time streaming architecture for our machine learning applications, ensuring efficient data ingestion, feature engineering, and model serving in a streaming context.
Develop and implement a comprehensive monitoring framework to track model performance, identify potential issues, and ensure optimal model health in production. Monitor model performance and update them as needed to adapt to new data and changing conditions.
Collaborate closely with data scientists and engineers to ensure seamless integration of models into our existing systems and workflows. Stay abreast of the latest MLOps trends and technologies to continuously improve our processes and tools.
Your Story
You have 5+ years of experience as a Machine Learning Engineer with a strong focus on MLOps. You have a proven track record of successfully deploying and managing machine learning models in production environments.
You possess deep knowledge of MLOps principles, tools, and best practices.
You are proficient in cloud platforms (Google Cloud Platform is preferred) and infrastructure-as-code tools like Terraform.
You have experience with CI/CD pipelines, containerization technologies (e.g., Docker), and orchestration tools (e.g., Kubernetes) and using orchestration tools such as Kubeflow (our preferred tool) or similar frameworks like Apache Airflow to manage and automate ML workflows.
You have experience with real-time data streaming technologies such as Kafka and Confluent and feature stores in such settings.
You are skilled in building and maintaining monitoring systems for machine learning models.
You have excellent communication and collaboration skills, enabling you to effectively work with cross-functional teams.
Bonus:
Knowledge of frameworks such as LangChain used to orchestrate LLMs.
Experience in LLM evaluations, debugging, and monitoring using tools such as LangFuse or LangSmith.
Meet The Team
We're a growing team of passionate Data Scientists and Machine Learning Engineers working across On to build creative and impactful models end-to-end that personalize experiences, optimize decision making, and predict future trends. We sit within Technology and have the opportunity to collaborate across On - Optimizing how we use data, how we consume data, and how we support On's growth through data is something you could be a part of and we'd love to hear from you!
What We Offer
On is a place that is centered around growth and progress. We offer an environment designed to give people the tools to develop holistically - to stay active, to learn, explore, and innovate. Our distinctive approach combines a supportive, team-oriented atmosphere, with access to personal self-care for both physical and mental well-being, so each person is led by purpose.
On is an Equal Opportunity Employer. We are committed to creating a work environment that is fair and inclusive, where all decisions related to recruitment, advancement, and retention are free of discrimination.
#J-18808-Ljbffr