We are looking for 3 Software Development Engineers (Java) to join a managed service engagement supporting a large-scale data and analytics ecosystem. The team will contribute to an open-source table format project widely used in distributed data processing systems with the goal of improving interoperability with a major cloud-based data lake service. The initial scope includes enhancing stream processing engine support for the latest table format specification improving compatibility between cloud-native lakehouse capabilities and open-source tooling addressing authentication inconsistencies in OSS integrations implementing already-approved specification features and closing functional gaps between specification and tooling behavior.
Tasks
- Contribute to open-source development related to distributed data table format specifications and ecosystem tooling;
- Improve stream processing engine support to align with the latest table format specification version;
- Enhance compatibility between cloud-based data lake services and open-source table format tooling;
- Identify and address authentication and authorization gaps in OSS integrations especially related to cloud identity mechanisms;
- Implement community-approved specification features that are not yet available in current implementations;
- Analyze and close behavioral gaps between formal specifications and real-world tooling implementations;
- Collaborate with distributed teams and contribute code design discussions and reviews within the open-source community;
- Ensure production-grade quality maintainability and performance of contributed components.
Requirements
- 3 years of experience as a Software Development Engineer with strong Java expertise;
- Solid experience working with distributed data systems and large-scale data processing frameworks;
- Experinence with GCP and its services;
- Hands-on experience with Apache Iceberg AND/OR Apache Flink;
- Strong understanding of cloud environments and authentication mechanisms in distributed systems;
- Strong problem-solving skills and ability to work independently in a distributed team setup;
- Good written and verbal communication skills in English.
Nice to have:
- Experience with cloud-based data lake services and lakehouse architectures;
- Familiarity with catalog federation concepts and metadata management systems;
- Understanding of specification-driven development and standards alignment;
- Previous experience working in managed service or vendor-based delivery models;
- Experience optimizing performance and scalability of distributed data processing systems.
Benefits
Work conditions:
- Overlap until 10 AM PST;
- B2B contact
- Remote/Hybrid work format
- Company benefits
Locations: Serbia Poland Croatia Portugal Kazakhstan Armenia Georgia Egypt.
We are looking for 3 Software Development Engineers (Java) to join a managed service engagement supporting a large-scale data and analytics ecosystem. The team will contribute to an open-source table format project widely used in distributed data processing systems with the goal of improving interop...
We are looking for 3 Software Development Engineers (Java) to join a managed service engagement supporting a large-scale data and analytics ecosystem. The team will contribute to an open-source table format project widely used in distributed data processing systems with the goal of improving interoperability with a major cloud-based data lake service. The initial scope includes enhancing stream processing engine support for the latest table format specification improving compatibility between cloud-native lakehouse capabilities and open-source tooling addressing authentication inconsistencies in OSS integrations implementing already-approved specification features and closing functional gaps between specification and tooling behavior.
Tasks
- Contribute to open-source development related to distributed data table format specifications and ecosystem tooling;
- Improve stream processing engine support to align with the latest table format specification version;
- Enhance compatibility between cloud-based data lake services and open-source table format tooling;
- Identify and address authentication and authorization gaps in OSS integrations especially related to cloud identity mechanisms;
- Implement community-approved specification features that are not yet available in current implementations;
- Analyze and close behavioral gaps between formal specifications and real-world tooling implementations;
- Collaborate with distributed teams and contribute code design discussions and reviews within the open-source community;
- Ensure production-grade quality maintainability and performance of contributed components.
Requirements
- 3 years of experience as a Software Development Engineer with strong Java expertise;
- Solid experience working with distributed data systems and large-scale data processing frameworks;
- Experinence with GCP and its services;
- Hands-on experience with Apache Iceberg AND/OR Apache Flink;
- Strong understanding of cloud environments and authentication mechanisms in distributed systems;
- Strong problem-solving skills and ability to work independently in a distributed team setup;
- Good written and verbal communication skills in English.
Nice to have:
- Experience with cloud-based data lake services and lakehouse architectures;
- Familiarity with catalog federation concepts and metadata management systems;
- Understanding of specification-driven development and standards alignment;
- Previous experience working in managed service or vendor-based delivery models;
- Experience optimizing performance and scalability of distributed data processing systems.
Benefits
Work conditions:
- Overlap until 10 AM PST;
- B2B contact
- Remote/Hybrid work format
- Company benefits
Locations: Serbia Poland Croatia Portugal Kazakhstan Armenia Georgia Egypt.
اعرض المزيد
عرض أقل