Data Engineer - Python, AI

Pune, India, India • Posted July 01, 2026

Job Type: Full-time
Location: Pune, India
Posted: July 01, 2026
Category: other-general
Application Deadline: July 06, 2026

Role Description

Role Summary
We are looking for a mid-level Python Developer with combined experience in Data Engineering and AI/NLP engineering. The candidate will build NLP pipelines using libraries such as Flair, BERT, and LLM frameworks, and will also work on large-scale data processing using PySpark, Pandas, and related data tools. The role includes developing APIs, integrating with platform services, and supporting CI/CD deployments using GitHub and LightSpeed Enterprise.

**Key Responsibilities**

+ Develop and optimize ETL/data processing jobs using PySpark, Pandas, PyArrow, and related libraries.
+ Build and maintain NLP pipelines using Flair, BERT, and LLM-based models.
+ Develop scalable ingestion and data transformation pipelines for AI and analytics use cases.
+ Build and maintain Flask-based APIs for model inference and service integrations.
+ Use regular expressions for text cleaning, parsing, and NLP preprocessing.
+ Integrate caching and fast lookups ...

Interested in this role?

Click the button below to start your application for Data Engineer - Python, AI at Citigroup.

Apply Now