About the Role
We are seeking an experienced Data Engineer to design and implement robust data pipelines using Azure Databricks or equivalent cloud data services. The successful candidate will be responsible for managing and optimizing Data lakehouse solutions for efficient data storage, transformations, and processing.
Key Responsibilities
* Data Architecture: Design and implement data pipelines using Azure Databricks or equivalent cloud data services.
* Data Integration and ETL Processes: Develop and maintain ETL processes ensuring seamless data integration from various sources, including IoT devices in manufacturing.
* Machine Learning Operations (MLOps): Design, implement, automate, and maintain scalable machine learning pipelines using MLflow on Databricks.
* Generative AI Solutions: Design, implement, and maintain data pipelines to power generative AI solutions such as Retrieval Augmented Generation applications.
* Collaboration and Training: Work with IT and data science teams to integrate state-of-the-art data & AI solutions into business processes and workflows. Provide training and support to end-users if necessary.
* Compliance: Ensure all data solutions comply with regulatory standards in the pharmaceutical industry, utilizing Azure security and compliance features.
* Coaching and Support: Coach and support teams in various business functions to help them onboard onto Databricks or other cloud-native solutions.
Requirements
* Educational Background: Bachelor's degree in computer science, engineering, data science, or a related field.
* Communication and Interpersonal Skills: Good communication and interpersonal skills with the ability to work in an international, distributed team.
* Language: Fluent in English. German is a plus.
* Data Engineering Experience: Minimum of 5 years of experience in data engineering, with a strong focus on cloud native technologies.
* Databricks Experience: Demonstrated experience in deploying and maintaining data pipelines using Databricks or another equivalent cloud-based technology.
* Machine Learning Experience: Demonstrated experience in building Machine Learning pipelines and promoting them across different environments.
* Python Proficiency: Strong focus on data engineering, task orchestration, and machine learning libraries. A good knowledge of SQL is a plus.
* Collaborative Work: Demonstrated experience of collaborative work on code development, using Github or an equivalent technology.
* Azure or AWS Experience: Experience with Azure or AWS is required.
* Azure or Databricks Certification: Any Azure or Databricks certification is a plus.
* MLflow Knowledge: Knowledge of MLflow is a plus.
* IaC Experience: Experience in building cloud infrastructure using IaC is a plus.
What We Offer
* Mission-Driven Work: You help save lives - Every day is meaningful as we produce life-saving medicines.
* Family Values: Long-term perspective for employees and relationships.
* Attractive Salary and Benefits Package: Be rewarded with an attractive salary and benefits package.
* Influence and Autonomy: You will have a high level of influence where you can make a difference and leave your footprint.
* Skill Development: We offer various internal and external employee and leadership trainings, trainee programs, and digital solutions.