- Design, build, and maintain scalable data pipelines using PySpark and Databricks
- Optimize data processing and storage for maximum performance and efficiency
- Troubleshoot and debug data-related issues, and implement solutions to prevent reoccurrence
- Collaborate with data scientists, software engineers, and other stakeholders to ensure that data solutions are aligned with business goals