Databricks Data Engineer: Lakehouse Pipelines & PySpark
Remote, Remote, Colombia • Posted May 26, 2026
Job Type:
Full-time
Location:
Remote, Remote
Posted:
May 26, 2026
Category:
Bases de datos, analítica y BI
Application Deadline:
July 05, 2026
Role Description
Job Description
- Design, build, and maintain end-to-end data pipelines for ingestion, transformation, and delivery of large‑scale data.
- Develop and optimize data processing logic using PySpark on Databricks (Apache Spark).
- Implement ETL/ELT pipelines integrating data from multiple structured and semi‑structured sources.
- Contribute to the design and implementation of lakehouse architectures (Delta Lake, Medallion architecture).
- Ensure data quality, reliability, performance, and observability across pipelines.
- Optimize Spark jobs through partitioning, caching, and performance tuning techniques.
- Collaborate with data architects, analysts, and business stakeholders to translate requirements into scalable data solutions.
- Implement best practices in CI/CD, version control, and pipeline automation.
- Support the evolution of modern data platforms and analytics capabilities.
- Work with o...
Interested in this role?
Click the button below to start your application for Databricks Data Engineer: Lakehouse Pipelines & PySpark at Perficient.
Apply Now