Are you interested in advancing Amazons Generative AI capabilities Come work with a talented team of engineers and scientists in a highly collaborative and friendly team. We are building stateoftheart Generative AI technology that will benefit all Amazon businesses and customers.
Key job responsibilities As a Software Development Engineer you will be responsible for designing developing testing and deploying high performance inference capabilities including but not limited to multimodality SOTA model architectures latency throughput and cost. You will collaborate closely with a team of engineers and scientists to influence our overall strategy and define the teams roadmap. You will drive system architecture spearhead best practices and mentor junior engineers.
A day in the life You will read papers and consult with scientists to get inspiration of emerging techniques and blend those into our roadmap; You will design and experiment with new algorithms benchmark the latency and accuracy of your implementations; Most importantly you will implement production grade solutions and see them through the deployments swiftly; You may need to collaborate with other science and engineering teams to get things done properly; You will hold highest bar in operational excellence and support production systems and constantly create solutions to minimize the ops load.
About the team Our mission is to build bestinclass fast accurate and costefficient large language model inference solutions and infrastructure that will enable Amazon businesses to deliver more value to their customers.
3 years of noninternship professional software development experience 2 years of noninternship design or architecture (design patterns reliability and scaling) of new and existing systems experience Experience programming with at least one software programming language Prior experience with software performance optimization Or Knowledge of Machine Learning and Deep Learning
3 years of full software development life cycle including coding standards code reviews source control management build processes testing and operations experience Bachelors degree in computer science or equivalent Experience with Large Language Model inference Experience with Trainium and Inferentia Development Experience with GPU programming (TensorRTLLM) Experience with Python PyTorch and C programming and performance optimization
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status disability or other legally protected status.
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.