Job Description
This is a Data Engineer role where you will be responsible for designing and implementing robust data pipelines using Azure Databricks or equivalent cloud data services. You will also manage and optimize Data lakehouse solutions for efficient data storage, transformations, and processing.
Data Architecture
* Design and implement robust data pipelines using Azure Databricks or equivalent cloud data services.
* Manage and optimize Data lakehouse solutions for efficient data storage, transformations, and processing.
Data Integration and ETL Processes
* Develop and maintain ETL processes ensuring seamless data integration from various sources, including IoT devices in manufacturing.
Machine Learning Operations (MLOps)
* Design, implement, automate, and maintain scalable machine learning pipelines using MLflow on Databricks.
Collaboration and Training
* Work with the rest of IT and data science teams to integrate state-of-the-art data & AI solutions into business processes and workflows.
* Provide training and support to end-users if necessary.
* Ensure all data solutions comply with regulatory standards in the pharmaceutical industry, utilizing Azure security and compliance features.
* Capture Teams in the various business functions to help them onboard onto Databricks or other cloud-native solutions and fully leverage the potential of these technologies.
Educational Background
* Bachelor's degree in computer science, engineering, data science, or a related field.
Data Engineering Experience
* Minimum of 5 years of experience in data engineering, with a strong focus on cloud native technologies.
* Demonstrated experience in deploying and maintaining data pipelines using Databricks or another equivalent cloud-based technology.
* Demonstrated experience in building Machine Learning pipelines and promoting them across different environments, including production.
Requirements
* Proficiency in Python with a strong focus on data engineering, task orchestration, and machine learning libraries.
* A good knowledge of SQL is a plus.
* Demonstrated experience of collaborative work on code development, using GitHub or an equivalent technology.
* Experience with Azure or AWS is required.
* Any Azure or Databricks certification is a plus.
* Knowledge of MLflow is a plus.
* Experience on building cloud infrastructure using IaC is a plus.
About Us
We are a global pharmaceutical company that produces life-saving medicines. Our values include family, long-term perspective, employee relationships, and skilled and fun colleagues in a relatively informal organization.
We offer various internal and external employee and leadership trainings, trainee programs, and digital solutions for skills development.