We are looking for engineers who have strong coding skills and computer science foundation with passion for building resilient and highly performant distributed systems. As a software engineer in AiDP reliability engineering you will work on one or many projects related to GenAI ML Inference and Big data platform. You will:- Build enhance and maintain multi-tenant systems employing diverse technologies- Collaborate with multi-functional teams to deliver impactful customer features- Lead projects through full lifecycle from design discussions to release delivery- Operate scale and optimize high-throughput and highly concurrent services- Diagnose resolve and prevent production and operational challengesWe are looking for enthusiastic engineers with interest in one of the following areas:- ML Engineers- Big Data Engineer- Platform Reliability Engineer
Bachelors Degree in Computer Science Computer Engineering or equivalent technical degree
Proficient programming knowledge in one of the following areas: Python Java or Go Programming and ability to read and explain open source codebase
Good foundation of Operating Systems Networking and Security Principles
Relevant Internship experience
Excellent analytical & problem solving skills
Exposure to Model Training or Fine Tuning methodologies
Exposure to Spark/Flink/Trino/Iceberg and other modern cloud native big data technologies
Exposure to Kubernetes and other cloud native technologies like Flux/Argo CD
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.