Position Details: Databricks Database Developers( 267 )
Description:
Key Responsibilities:
- Develop and optimize data pipelines in Databricks for transforming and processing data from various sources.
- Integrate data using Unity Catalog and external data sources (data lakes, APIs, etc.).
- Write Spark SQL and PySpark scripts for data transformations, optimizations, and creating views/procedures.
- Perform data analysis to identify quality issues, optimize pipelines, and enhance data processing for analytics.
- Collaborate on report generation and dashboard creation with front-end teams.
- Use GitLab for version control, CI/CD automation, and task management (Jira).
Required Skills:
- Strong experience with Databricks, Spark SQL, PySpark, and SQL.
- Expertise in creating and optimizing views and stored procedures in Databricks.
- Experience building ETL workflows and data models.
- Knowledge of cloud platforms (AWS, Azure) and version control tools (Git, GitLab).
Qualifications:
- Master’s degree (1-3 years experience) or Bachelor’s degree (4-5 years experience) in Data Engineering or related field.
- 3+ years experience with Databricks and advanced SQL.
- Experience with ETL processes, views, and procedures.
Nice-to-Have:
Experience with healthcare or clinical trial data.
Familiarity with DevOps practices.
Preferred Certification:
Databricks Certified Professional.