At CodeValue we specialize in delivering cutting-edge software solutions and driving innovation across industries. We are now looking for a Data Engineer to join our team and play a key role in building and optimizing large-scale Big Data systems in production environments.
Key Responsibilities
- Design implement and maintain Big Data pipelines in production.
- Work extensively with Apache Spark (2.x and above) focusing on complex joins shuffle optimization and performance improvements at scale.
- Integrate Spark with relational databases NoSQL systems cloud storage and streaming platforms.
- Contribute to system architecture and ensure scalability reliability and efficiency in data processing workflows.
Requirements:
- Proven hands-on experience as a Data Engineer in production Big Data environments.
- Hands-on experience in Python development is required
- Expertise in Apache Spark including advanced performance optimization and troubleshooting.
- Practical experience with complex joins shuffle optimization and large-scale performance improvements.
- Familiarity with relational and NoSQL databases cloud data storage and streaming platforms.
- Strong understanding of distributed computing principles and Big Data architecture patterns.
At CodeValue we specialize in delivering cutting-edge software solutions and driving innovation across industries. We are now looking for a Data Engineer to join our team and play a key role in building and optimizing large-scale Big Data systems in production environments.Key ResponsibilitiesDesign...
At CodeValue we specialize in delivering cutting-edge software solutions and driving innovation across industries. We are now looking for a Data Engineer to join our team and play a key role in building and optimizing large-scale Big Data systems in production environments.
Key Responsibilities
- Design implement and maintain Big Data pipelines in production.
- Work extensively with Apache Spark (2.x and above) focusing on complex joins shuffle optimization and performance improvements at scale.
- Integrate Spark with relational databases NoSQL systems cloud storage and streaming platforms.
- Contribute to system architecture and ensure scalability reliability and efficiency in data processing workflows.
Requirements:
- Proven hands-on experience as a Data Engineer in production Big Data environments.
- Hands-on experience in Python development is required
- Expertise in Apache Spark including advanced performance optimization and troubleshooting.
- Practical experience with complex joins shuffle optimization and large-scale performance improvements.
- Familiarity with relational and NoSQL databases cloud data storage and streaming platforms.
- Strong understanding of distributed computing principles and Big Data architecture patterns.
View more
View less