About the Role
We are seeking an experienced Data Engineer to join our team and help drive the transformative effort in a global pharmaceutical company.
As a Data Engineer, you will be responsible for designing and implementing robust data pipelines using Azure Databricks or equivalent cloud data services.
You will work with the rest of IT and data science teams to integrate state-of-the-art data & AI solutions into the business processes and workflows.
Your primary goal will be to ensure all data solutions comply with regulatory standards in the pharmaceutical industry, utilizing Azure security and compliance features.
Key Responsibilities:
* Data Architecture: Design and implement robust data pipelines using Azure Databricks or equivalent cloud data services.
* Data Integration and ETL Processes: Develop and maintain ETL processes ensuring seamless data integration from various sources, including IoT devices in manufacturing.
* Machine Learning Operations (MLOps): design, implement, automate and maintain scalable machine learning pipelines using MLflow on Databricks.
* Design, implement and maintain data pipelines to power generative AI solutions such as Retrieval Augmented Generation applications.
* Collaboration and Training: Work with the rest of IT and data science teams to integrate state-of-the-art data & AI solutions into the business processes and workflows. Provide training and support to end-users if necessary.
* Ensure all data solutions comply with regulatory standards in the pharmaceutical industry, utilizing Azure security and compliance features.
* Coach and support teams in the various business functions to help them onboard onto Databricks or other cloud-native solutions and fully leverage the potential of these technologies.
Requirements
* Educational Background: Bachelor's degree in computer science, Engineering, Data Science, or related field.
* Good communication and interpersonal skills with the ability to work in an international, distributed team
* Fluent in English. German is a plus.
* Data Engineering Experience: Minimum of 5 years of experience in data engineering, with a strong focus on cloud native technologies.
* Demonstrated experience in deploying and maintaining data pipelines using Databricks or another equivalent cloud-based technology.
* Demonstrated experience in building Machine Learning pipelines and promoting them across different environments, including production.
* Proficiency in Python with a strong focus on data engineering, task orchestration and machine learning libraries. A good knowledge of SQL is a plus.
* Demonstrated experience of collaborative work on code development (e.g. branching strategy...), using Github or an equivalent technology
* Experience with Azure or AWS is required.
* Any Azure or Databricks certification is a plus
* Knowledge of MLflow is a plus
* Experience on building cloud infrastructure using IaC is a plus
About Us
We are a global pharmaceutical company that produces life-saving medicines. We offer a unique opportunity to drive the transformative effort in a global pharmaceutical company.
You will have a high level of influence where you can make a difference and leave your footprint.
Our company values include family values, long-term perspective for employees and relationships. We also offer an attractive salary and benefits package, skills development opportunities, and a chance to work with skilled and fun colleagues in a relatively informal organization.
What We Offer
* You help save lives - Every day is meaningful as we produce life-saving medicines
* Family values - Long-term perspective for employees and relationships
* Be rewarded with an attractive salary and benefits package
* You will have a high level of influence where you can make a difference and leave your footprint
* Work with skilled and fun colleagues in a relatively informal organization
* Skills development - We offer various internal and external employee and leadership trainings, trainee programs and digital solutions