Duties & Accountabilities
- Responsible for the data on-boarding and custom integration work.
- A solid knowledge of AWS infrastructure is a must in order to build different connectors such as FTP, API or
JDBC integrations. - Making the data available in the data lake through AWS Glue, Appflow and Lakeformation are responsibilities
as well as writing unit/data tests and monitoring the quality of the overall on-boarding process.
Requirements
- Experience building data pipelines in Python (experience with PySpark is a plus).
- Understanding on AWS Cloud fundamentals (AWS certification is adviced).
- Solid knowledge in infrastructure as code -> CDK GIT & CI/CD knowledge is a must.
- Experience with common data Python libraries (pandas, awswrangler etc.).
- Understands REST APIs from the consumer perspective.
- ML knowledge is a plus (Kubeflow).
- Open to working on EMEA time zones.