Job Title: Senior Python Data Engineer (PySpark)
Location: McLean VA (Onsite Face-to-Face Interview Required)
Type: Contract
Job Description:
We are looking for a Senior Python Data Engineer with strong expertise in PySpark event-based systems and AWS cloud services to join our team onsite in McLean VA. The ideal candidate will bring deep technical skills in building scalable data pipelines microservices and real-time streaming solutions.
Responsibilities:
- Design build and optimize large-scale data pipelines using Python and PySpark.
- Develop event-driven and streaming solutions leveraging Kafka.
- Build and maintain microservices-based data processing systems.
- Work with AWS cloud services (EC2 EKS S3 AIM VPC) to deploy and scale data engineering solutions.
- Ensure reliability scalability and performance of data infrastructure.
- Implement CI/CD pipelines using Jenkins Git and Flyway for version control and database migration.
- Collaborate with data scientists product owners and architects to deliver data solutions that support business needs.
- Work with both RDBMS and NoSQL databases for structured and unstructured data storage.
Required Skills & Experience:
- 8 years of professional experience in Python development (senior-level).
- Strong hands-on experience in Data Engineering and building data pipelines.
- Expertise in PySpark and distributed data processing.
- Experience in event-based systems and streaming (Kafka).
- Strong knowledge of AWS services: EC2 EKS S3 AIM VPC.
- Proficiency with CI/CD tools: Flyway Jenkins Git.
- Experience with microservices development.
- Hands-on with both RDBMS (e.g. PostgreSQL Oracle MySQL) and NoSQL (e.g. MongoDB DynamoDB).
- Strong problem-solving skills and ability to work onsite with cross-functional teams.
Nice to Have:
- Knowledge of containerization (Docker Kubernetes).
- Experience with data security and compliance practices in financial services.
- Exposure to Agile/SAFe development methodologies.
If interested please share resume to