Data Engineer
Job Summary
We help the world run better
At SAP we keep it simple: you bring your best to us and well bring out the best in you. Were builders touching over 20 industries and 80% of global commerce and we need your unique talents to help shape whats next. The work is challenging but it matters. Youll find a place where you can be yourself prioritize your wellbeing and truly belong. Whats in it for you Constant learning skill growth great benefits and a team that wants you to grow and succeed.
What you will build
As a Data Engineer on the Service & Support Data Lake team you will help modernize our data engineering capabilities. We run an in-house data lake that powers AI services for SAP customer support and generates insights through project and agent mining. You will join a collaborative team evolving from isolated pipelines to a unified intelligent platform for self-service analytics and autonomous data agents. Data governance underpins every solution we deliver end to end.
Responsibilities
- Design and maintain scalable pipelines with clear architecture and high quality standards.
- Build real-time and batch ingestion from diverse sources with automated validation.
- Write production-grade code with strong testing discipline (unit/integration tests) code review quality and maintainable modular design.
- Troubleshoot complex data issues end to end including root-cause analysis performance bottlenecks and reliability incidents.
- Implement anonymization and PII controls aligned with governance requirements.
- Develop metadata pipelines for schema profiling business context extraction and lineage tracking.
- Contribute to semantic layers that map technical fields to business terminology for self-service and natural-language analytics use cases.
- Use AI-assisted development practices to accelerate delivery and maintenance; prior autonomous-agent experience is a plus not a requirement.
- Establish monitoring alerting and data quality controls to ensure secure reliable analytical assets evaluate their AI/ML readiness based on data science requirements.
- Partner with the Tech Lead and Architect to convert requirements into production-ready systems.
What you bring
- Bachelors degree or equivalent practical experience.
- 3 years of experience coding in Python (pandas pytest) and SQL.
- 3 years of experience with Spark / Big Data processing (PySpark): transformations partitioning performance optimization.
- 3 years designing and deploying data pipelines including managing data schemas and processing high-volume workflows.
- Strong software engineering fundamentals: clean code modular design debugging version control and maintainable documentation.
- Strong SQL and data modeling capabilities (normalized and denormalized patterns data contracts schema evolution).
- Hands-on testing and release practices: unit/integration testing CI/CD pipelines and safe production rollout.
- Experience in observability and operations: metrics logging alerting and on-call friendly troubleshooting.
- Experience with SQL databases (PostgreSQL preferred others acceptable) and NoSQL databases (Elasticsearch and Delta Lake required).
- Experience with real-time and batch ingestion (APIs - polling & push Kafka streaming).
- Experience with workflow orchestration (Kubeflow Pipelines Airflow Prefect or Dagster).
- Experience with data governance: data redaction anonymization and PII handling.
- Proficiency in Git workflows: branching strategies code review CI/CD integration.
Preferred Qualifications
- 5 years designing enterprise-scale data platforms and analytics infrastructure.
- Familiarity with LLM-enabled data applications including RAG embeddings vector search and evaluation from a data platform perspective.
- Experience building data services and data APIs that support AI-related applications and analytics products.
- Experience productionizing data and feature pipelines that support machine learning and intelligent applications.
- Strong interest in AI-native data engineering and agentic data workflows including orchestration tool integration evaluation and workflow automation.
- Understanding of MLOps/LLMOps principles to ensure scalable and reliable deployment of text processing and redaction pipelines.
- Experience with monorepo or shared library architecture patterns.
- Ability to operate across ambiguity and influence cross-functional technical decisions.
- Demonstrated learning agility in adopting emerging data concepts (for example data agents and semantic layer patterns).
What Youll Get
Technical Growth
- Build an end-to-end data platform from consolidation to metadata intelligence to AI agents.
- Deepen expertise in metadata pipelines semantic layers and data-agent foundations.
- Master data quality frameworks with testing patterns quality gates and large-scale validation.
Team & Impact
- Join as a foundation team member whose work shapes platform direction.
- Collaborate directly with the Tech Lead and Data Scientists.
- Enable next-generation capabilities: self-service analytics intelligent agents and automated insights.
Long-Term Vision
- Grow at the intersection of data engineering and AI agents.
- Help evolve the platform from raw data to intelligent autonomous systems.
- Drive scalable impact across datasets pipelines and downstream analytics.
Where you belong
Culture:
- Shared ownership model: any engineer can maintain any pipeline
- AI-augmented workflows: tools like Claude Code support migration and scaffolding
- Quality-first: 100% branch coverage automated quality gates enforced
- Collaborative learning: show & tell demos pattern documentation peer reviews
Bring out your best
SAP innovations help more than four hundred thousand customers worldwide work together more efficiently and use business insight more effectively. Originally known for leadership in enterprise resource planning (ERP) software SAP has evolved to become a market leader in end-to-end business application software and related services for database analytics intelligent technologies and experience management. As a cloud company with two hundred million users and more than one hundred thousand employees worldwide we are purpose-driven and future-focused with a highly collaborative team ethic and commitment to personal development. Whether connecting global industries people or platforms we help ensure every challenge gets the solution it deserves. At SAP you can bring out your best.
We win with inclusion
SAPs culture of inclusion focus on health and well-being and flexible working models help ensure that everyone regardless of background feels included and can run at their best. At SAP we believe we are made stronger by the unique capabilities and qualities that each person brings to our company and we invest in our employees to inspire confidence and help everyone realize their full potential. We ultimately believe in unleashing all talent and creating a better world.
SAP is committed to the values of Equal Employment Opportunity and provides accessibility accommodations to applicants with physical and/or mental disabilities. If you are interested in applying for employment with SAP and are in need of accommodation or special assistance to navigate our website or to complete your application please send an e-mail with your request to Recruiting Operations Team:
For SAP employees: Only permanent roles are eligible for the SAP Employee Referral Program according to the eligibility rules set in the SAP Referral Policy. Specific conditions may apply for roles in Vocational Training.
Qualified applicants will receive consideration for employment without regard to their age race religion national origin ethnicity gender (including pregnancy childbirth et al) sexual orientation gender identity or expression protected veteran status or disability in compliance with applicable federal state and local legal requirements.
Successful candidates might be required to undergo a background verification with an external vendor.
AI Usage in the Recruitment Process
For information on the responsible use of AI in our recruitment process please refer to our Guidelines for Ethical Usage of AI in the Recruiting Process.
Please note that any violation of these guidelines may result in disqualification from the hiring process.
Requisition ID: 455503 Work Area: Software-Design and Development Expected Travel: 0 - 10% Career Status: Professional Employment Type: Regular Full Time Additional Locations: #LI-Hybrid
Required Experience:
IC
About Company
SAP started in 1972 as a team of five colleagues with a desire to do something new. Together, they changed enterprise software and reinvented how business was done. Today, as a market leader in enterprise application software, we remain true to our roots. That’s why we engineer soluti ... View more