Senior Data Engineer
Paris, France
About Us
We're partnered with an AI-driven startup with €5M+ funding, developing generative AI to discover new materials and reduce CO₂ emissions in carbon-intensive industries.
The Role
Reporting to the CTO, you'll lead two key initiatives:
- Design and build scalable data infrastructure integrating diverse sources (text, simulations, experiments) in support of ML and LLM applications.
- Develop internal tools enabling AI-enhanced data access and foster a data-centric culture
Key Responsibilities
- Build optimized data pipelines for simulation, textual, and experimental data
- Implement secure, scalable data storage systems supporting ML workflows
- Create automation tools for data processing
- Establish data governance policies and lineage tracking
- Collaborate with DevOps on cloud infrastructure integration
- Partner with scientists to enable data-driven decision making
- Contribute to open-source projects
Requirements
- Master's or PhD in Computer Science or related field
- 7+ years of data engineering experience
- Proficiency in multiple programming languages (Python, Rust, Scala, or Go)
- Strong SQL and NoSQL database experience
- Data modeling, ETL, and warehousing expertise
- Cloud platform experience (AWS/GCP) and infrastructure-as-code
- Excellent English communication skills
Nice-to-Have
- ML pipeline and AI infrastructure experience
- Open-source contributions
- Familiarity with scientific data, especially materials science
Benefits
- Competitive salary
- Equity package (BSPCE)
- Comprehensive health insurance (Alan Blue)
- Standard French PTO
- Flexible work environment
