Develop a prototype Proof of Concept (POC) for a specific computer vision and machine learning problem to validate an idea;
Collaborate with the team to follow software processes and standards;
Enhance and create various computer vision and machine learning services for production use;
Identify and address performance and scalability issues.
What we expect from you:
A minimum of 5 years of experience in GPU programming and optimization using CUDA or OpenCL;
Proven experience in GPU parallelization and optimization for at least 3 real-world workloads, each with thousands or more lines of GPU kernel code; not limited to simple tasks like matrix multiplication;
A deep understanding of GPU architecture and familiarity with commonly known optimization techniques;
Demonstrated experience in writing unit tests and establishing a CI (Continuous Integration) pipeline;
Previous experience in contributing to large-scale C++ software projects.
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.