Sep 2024
Present
Present
Data Engineer (Apprentice)
BNP Paribas CIB
Paris, France
- Collected and processed ESG data for 10+ providers through 10+ dedicated pipelines: hundreds of datasets/provider, large volumes, processing cut from several hours to around ten minutes; a full-time manual role became ~1 day of monitoring.
- Contributed to the data platform for data integration and delivery into client datalakes with Airflow, Step Functions, Lambda, S3, Glue, and Apache Iceberg.
- Industrialized nearly 400 ESG indicators with dbt.
- Built and maintained Snowflake objects and SQL workloads for ESG analytics; improved SQL perf on heavy joins and recurring aggregation jobs.
- Built an internal Python connector library (SFTP/Paramiko, internal APIs, Microsoft Graph, AWS) and bash env-bootstrap scripts; onboarding time 1 day → 1 min for new team members.
- Delivered ad-hoc Python/SQL tooling for product and sales to shorten client deliverable turnaround.
- Backend development and use-case scripts running on CircleCI to manipulate data in DynamoDB and PostgreSQL.
- Built Rust/Python CLIs to interact with internal APIs and speed up internal needs.
- Automated technical documentation from dbt lineage.
- Adopted spec-driven development with AI agents and reusable skills on pipeline and dbt feature delivery.
- Shipped client chatbot (AWS Bedrock, Chainlit) for self-service access to curated ESG datasets.
- Published around 20 QuickSight dashboard types across clients with regulatory and business templates; partnered with ESG analysts on ESG KPIs, backlog prioritization and data quality tests.