Location: Brussels
Languages: English
Employment Type: Freelance
Desired Start Date: ASAP
Context of the mission
We are looking for a passionate and experienced AI & ML Data Engineer to join our growing team of experts. If you're excited about building robust data science platforms, designing complex data pipelines, and contributing to impactful use cases in the railway sector — this is your opportunity
Responsibilities
As AI & ML Data Engineer, you will join the advanced analytics team. We are a team of data scientists and data engineers working on advanced analytics and AI projects within different domains (punctuality, stations, HR, security, ...)
Within this team, your role will be to:
- Collaborate with our infrastructure team to setup, improve and maintain scalable data science platforms on Azure
- Design and implement complex data ingestion pipelines from diverse sources
- Collaborate with data scientists to operationalize Machine Learning models
- Put Large Language Models (LLMs) into production for a variety of use cases
- Work on strategic and operational use cases that directly impact railway services
- Collaborate with data governance, security, and performance teams
- Actively contribute to a collaborative and innovative team culture
Conformity criterium
- At least 5 years of experience in data engineering and Machine Learning operations
- Proven track record of setting up data science platforms for organizations with over 1,000 employees
- Minimum of 5 years of experience with cloud services for advanced data analytics. We use Azure
- Proficiency in PySpark, Python and SQL demonstrated through at least 3 successful projects
- Experience in setting up continuous integration/continuous deployment (CI/CD) infrastructure as code, and DevOps practices, with at least 3 implemented projects
- Experience in setting up infrastructure as code with TerraForm or similar
- Experience in maintaining data pipelines for large organizations.
- Fluent in English, with professional proficiency in both written and spoken communication.
Evaluation criterium
- Ability to setup a data platform for advanced analytics purposes in large organization
- Experience in bringing data science and Gen AI models in production and ensure run and maintenance of those
- Knowledge in coding for in afford mentioned programs
- Experience in setting up infrastructure as code
- Ability to contribute on the development of data science use cases
- Communication skills