Vacancy Description
Job Description
Your Role:
- Design, build, and maintain data pipelines and ETL processes using Databricks and Apache Spark.
- Optimize data workflows for performance, scalability, and cost efficiency.
- Implement data Lakehouse architecture and manage data ingestion from multiple sources.
- Collaborate with data scientists and analysts to enable advanced analytics and machine learning workloads.
- Ensure data quality, governance, and security across all data assets.
- Monitor and troubleshoot Databricks clusters, jobs, and workflows.
- Integrate Databricks with cloud services (AWS, Azure, or GCP) and other enterprise systems.
- Document processes, standards, and best practices for data engineering.
Your Profile:
- Hands‑on experience with Databricks, Apache Spark, and PySpark.
- Strong knowledge of SQL, Python, and data modeling principles.
- Experience w...
Ready to Apply?
अभी आवेदन करें
Submit your application for Lead Data Engineer at Capgemini
Apply for this Position