We are looking for 3 Software Development Engineers (Java) to join a managed service engagement supporting a large-scale data and analytics ecosystem. The team will contribute to an open-source table format project widely used in distributed data processing systems with the goal of improving interoperability with a major cloud-based data lake service. The initial scope includes enhancing stream processing engine support for the latest table format specification improving compatibility between cloud-native lakehouse capabilities and open-source tooling addressing authentication inconsistencies in OSS integrations implementing already-approved specification features and closing functional gaps between specification and tooling behavior.
Tasks
Contribute to open-source development related to distributed data table format specifications and ecosystem tooling;
Improve stream processing engine support to align with the latest table format specification version;
Enhance compatibility between cloud-based data lake services and open-source table format tooling;
Identify and address authentication and authorization gaps in OSS integrations especially related to cloud identity mechanisms;
Implement community-approved specification features that are not yet available in current implementations;
Analyze and close behavioral gaps between formal specifications and real-world tooling implementations;
Collaborate with distributed teams and contribute code design discussions and reviews within the open-source community;
Ensure production-grade quality maintainability and performance of contributed components.
Requirements
3 years of experience as a Software Development Engineer with strong Java expertise;
Solid experience working with distributed data systems and large-scale data processing frameworks;
Experinence with GCP and its services;
Hands-on experience with Apache Iceberg AND/OR Apache Flink;
Strong understanding of cloud environments and authentication mechanisms in distributed systems;
Strong problem-solving skills and ability to work independently in a distributed team setup;
Good written and verbal communication skills in English.
Nice to have:
Experience with cloud-based data lake services and lakehouse architectures;
Familiarity with catalog federation concepts and metadata management systems;
Understanding of specification-driven development and standards alignment;
Previous experience working in managed service or vendor-based delivery models;
Experience optimizing performance and scalability of distributed data processing systems.
Benefits
Work conditions:
Overlap until 10 AM PST;
B2B contact
Remote/Hybrid work format
Company benefits
Locations: Serbia Poland Croatia Portugal Kazakhstan Armenia Georgia Egypt.
We are looking for 3 Software Development Engineers (Java) to join a managed service engagement supporting a large-scale data and analytics ecosystem. The team will contribute to an open-source table format project widely used in distributed data processing systems with the goal of improving interop...
We are looking for 3 Software Development Engineers (Java) to join a managed service engagement supporting a large-scale data and analytics ecosystem. The team will contribute to an open-source table format project widely used in distributed data processing systems with the goal of improving interoperability with a major cloud-based data lake service. The initial scope includes enhancing stream processing engine support for the latest table format specification improving compatibility between cloud-native lakehouse capabilities and open-source tooling addressing authentication inconsistencies in OSS integrations implementing already-approved specification features and closing functional gaps between specification and tooling behavior.
Tasks
Contribute to open-source development related to distributed data table format specifications and ecosystem tooling;
Improve stream processing engine support to align with the latest table format specification version;
Enhance compatibility between cloud-based data lake services and open-source table format tooling;
Identify and address authentication and authorization gaps in OSS integrations especially related to cloud identity mechanisms;
Implement community-approved specification features that are not yet available in current implementations;
Analyze and close behavioral gaps between formal specifications and real-world tooling implementations;
Collaborate with distributed teams and contribute code design discussions and reviews within the open-source community;
Ensure production-grade quality maintainability and performance of contributed components.
Requirements
3 years of experience as a Software Development Engineer with strong Java expertise;
Solid experience working with distributed data systems and large-scale data processing frameworks;
Experinence with GCP and its services;
Hands-on experience with Apache Iceberg AND/OR Apache Flink;
Strong understanding of cloud environments and authentication mechanisms in distributed systems;
Strong problem-solving skills and ability to work independently in a distributed team setup;
Good written and verbal communication skills in English.
Nice to have:
Experience with cloud-based data lake services and lakehouse architectures;
Familiarity with catalog federation concepts and metadata management systems;
Understanding of specification-driven development and standards alignment;
Previous experience working in managed service or vendor-based delivery models;
Experience optimizing performance and scalability of distributed data processing systems.
Benefits
Work conditions:
Overlap until 10 AM PST;
B2B contact
Remote/Hybrid work format
Company benefits
Locations: Serbia Poland Croatia Portugal Kazakhstan Armenia Georgia Egypt.