Databricks Data Engineer: Lakehouse Pipelines & PySpark

Remote, Remote, Colombia • Posted May 26, 2026

Job Type: Full-time

Location: Remote, Remote

Posted: May 26, 2026

Category: Bases de datos, analítica y BI

Application Deadline: July 05, 2026

Role Description

Job Description Design, build, and maintain end-to-end data pipelines for ingestion, transformation, and delivery of large‑scale data. 
Develop and optimize data processing logic using PySpark on Databricks (Apache Spark). 
Implement ETL/ELT pipelines integrating data from multiple structured and semi‑structured sources. 
Contribute to the design and implementation of lakehouse architectures (Delta Lake, Medallion architecture). 
Ensure data quality, reliability, performance, and observability across pipelines. 
Optimize Spark jobs through partitioning, caching, and performance tuning techniques. 
Collaborate with data architects, analysts, and business stakeholders to translate requirements into scalable data solutions. 
Implement best practices in CI/CD, version control, and pipeline automation. 
Support the evolution of modern data platforms and analytics capabilities. 
Work with o...
                

Interested in this role?

Click the button below to start your application for Databricks Data Engineer: Lakehouse Pipelines & PySpark at Perficient.

Apply Now