Return to Careers
Apprenticeship – Data Engineer
Engineering | Marseille
Role Overview
We are looking for a Data Engineer apprentice, based at Syroco’s Marseille office, to join our data team and work closely with our Senior Data Engineer (based remotely in Paris).
You will contribute primarily to the operation of our data engineering platform: designing and developing ETL pipelines, ingesting and integrating diverse data sources, transforming and modelling data in our datalake, and more.
This apprenticeship offers an ideal environment to learn, grow quickly, and contribute to a concrete industrial project at the heart of the maritime energy transition.
Main Responsibilities
- Integration of New Vessels / IoT
- Set up ingestion connectors for new vessels.
- Define mappings and schemas for sensor data.
- Retrieve and structure dynamic vessel data (speed, wind, fuel consumption, GPS, etc.) from multiple sources (sensors, manual declarations, etc.).
- Contribute to improving the robustness of the data collection pipeline (APIs, web scraping, emails, etc.).
- Run & Maintenance of the Data Platform
- Operate and monitor existing ingestion pipelines.
- Run historical backfills.
- Respond to anomalies, failures, missing or inconsistent data.
- Work on observability: logs, metrics, alerts (Datadog, CloudWatch, etc.).
- Participate in continuous performance and quality improvements.
- Contribution to the Semantic Layer Project (optional)
- Support the implementation and documentation of a semantic layer to unify and structure business data.
- Work on models, roles, metrics, and conventions.
- Collaboration & Learning
- Learn daily from the Senior Data Engineer and the engineering team.
- Participate in code reviews and data engineering best practices.
- Join technical discussions on ingestion, architecture, tooling, and data quality.
- Contribute to the “You build it, you run it” culture.
Tech Stack (you will learn and work with)
- Ingestion & orchestration: Python, AWS Lambda, Step Functions
- Transformation: Pydantic, internal tools
- Storage & data: AWS S3, RDS/PostgreSQL, Parquet, Athena, IoT/time-series data
- Monitoring & alerting: Datadog, PagerDuty
- Infrastructure: AWS (ECS, S3, Lambda, Step Functions), Terraform, Terragrunt
- Tooling: GitHub, Docker, API clients, observability tools
- Languages: Python, SQL
About You
Don’t hesitate to apply even if you don’t meet every requirement - an apprenticeship is designed for learning.
- You are preparing a Master’s degree, engineering school diploma, or professional bachelor’s with a data focus.
- You have solid foundations in Python and SQL.
- You’re interested in data pipelines, ingestion, data quality, and reliability.
- You want to work with real, industrial, IoT, and high-volume data.
- You are autonomous, curious, and rigorous.
- You enjoy solving concrete problems and understanding systems.
- You align with Syroco’s engineering principles:
- Think about problems before talking about solutions.
- Never assume, always check.
- Done is better than perfect.
- You build it, you run it.
- Production first.
- Share more, learn more.
- Only the paranoid survive.
Practical Information
- Location: on-site in Marseille (Vieux Port)
- Duration: 12 or 24 months depending on your programme
- Start date: flexible, ideally early January
- Environment: great working conditions, committed team, high-impact environmental project