Work along side Applebot team to optimize crawl for cutting edge model closely with product teams to build production grade solutions to launch models serving millions of customers in real tools to understand bottlenecks in crawling for different hardwares and use and guide engineers in the organization.
BS in computer Science or similar
Demonstrated experience in leading and driving complex ambiguous projects.
Experience with high throughput services particularly at supercomputing scale.
Proficient in running applications on Cloud (AWS Azure or equivalent) using Kubernetes and Docker.
Experience with LLM multi modal or ML
Familiar with GPU programming concepts using CUDA and with popular machine learning frameworks like PyTorch or TensorFlow.
MS PhD preferred
Proficient in building and maintaining systems written in modern languages (e.g. Go Python).
Familiar with fundamental deep learning architectures such as Transformer models and encoder/decoder models.
Familiar with NVIDIA TensorRT-LLM vLLM DeepSpeed NVIDIA Triton Inference Server.
Experience in writing custom CUDA kernels using CUDA or OpenAI Triton.
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.