Opportunity Description
- Develop end-to-end data pipelines (ingestion, transformation, modeling, exposure) and contribute to the implementation of visualizations in Power BI or Databricks SQL.
- Continuously improve the SOCOTEC Lakehouse, particularly in the areas of governance, quality, and data pseudonymization.
- Experiment with generative AI solutions applied to data, such as Databricks GenIE, to transform text queries into actionable insights.
Qualifications
- Master's degree in Big Data, Computer Science, or Software Engineering with a strong specialization or appetite for data and distributed architectures. At least 3 years of experience in Data Engineering
- Strong mastery of SQL and NoSQL databases (modeling, optimized queries, integrity, and performance).
- Good understanding of Big Data architecture and distributed processing tools (Spark, Hadoop, Airflow, Kafka, Delta Lake, etc.).
- Prior experience with Databricks would...