CUDA Developer
Oriole is seeking talented CUDA Developer to help co-optimize our AI/ML software stack with cutting-edge network hardware. You’ll be a key contributor to a high-impact, agile team focused on integrating middleware communication libraries and modelling the performance of large-scale AI/ML workloads.
Key Responsibilities:
- Design and optimize custom GPU communication kernels to enhance performance and scalability across multi-node environments
- Develop and maintain distributed communication frameworks for large-scale deep learning models, ensuring efficient parallelization and optimal resource utilization.
- Profile, benchmark, and debug GPU applications to identify and resolve bottlenecks in communication and computation pipelines.
- Collaborate closely with hardware and software teams to integrate optimized kernels with Oriole’s next-generation network hardware and software stack.
- Contribute to system-level architecture decisions for large-scale GPU clusters, with a focus on communication efficiency, fault tolerance, and novel architectures for advanced optical network infrastructure.
Required Skills & Experience:
- Proficient in C++ and Python , with a strong track record in high-performance computing or machine learning projects.
- Expertise in GPU programming with CUDA , including deep knowledge of GPU memory hierarchies and kernel optimization.
- Hands-on experience debugging GPU kernels using tools such as Cuda-gdb, Cuda Memcheck, NSight Systems, PTX, and SASS.
- Strong understanding of communication libraries and protocols, including NCCL, NVSHMEM, OpenMPI, UCX, or custom collective communication implementations.
- Familiarity with HPC networking protocols/libraries such as RoCE, Infiniband, Libibverbs, and libfabric.
- Experience with distributed deep learning /MoE frameworks, including PyTorch Distributed, vLLM, or DeepEP.
- Solid understanding of deploying and optimizing large-scale distributed deep learning workloads in production environments, including Linux, Kubernetes, SLURM, OpenMPI, GPU drivers, Docker, and CI/CD automation.
Recommended Jobs
BAND 5 RMN-PRISON
MEDICAL RECRUITMENT Specialist Recruitment require experienced BAND 5 RMN(PRISON) to work in the London Area. Hours,7.30 AM-15.30 PM, Monday to Friday. Start as soon as possible. Our client is…
Senior Java Developer
Develop a platform enabling positive changes to the world. This company is a scaling investment fund that has established itself for over ten years. The company provides you the chance to stay in t…
PA to Creative Founder - Luxury Interior Design
Paddington, West London 5 Days Office Based A wonderful opportunity has arisen for a PA to support a Creative Founder of a boutique, luxury interior design company as their business and personal…
Management Accountant - Practice
My client is a Top 30 professional services firm with a dedicated business services team that provides outsourced accounting and administrative support to clients in the financial services sector, inc…
Field Care Supervisor
Care Outlook is an expanding leading home care provider in London and South East of England since 2005. Our Care team is friendly, and we love what we do. We are passionate about the high-quality su…
Special Needs Classroom Assistant
Special Needs Classroom Assistant Location: Redbridge Salary: Competitive | Contract: Full-Time | Start Date: ASAP Are you passionate about supporting children with complex needs? We are currently se…
Active Fire Engineer
Active Fire Engineer We are now looking for an Active Fire Engineer that has install, reactive repairs experience. At RGE Services, we offer our Active Fire Engineer’s further training to build …
Rota Nanny
We are excited to look for an exceptional live-in two week on/off Rota Nanny who can join a very friendly and trusted household to provide the highest standard of care to a six-year-old girl. This i…
Band 5 Pharmacy Dispensary Technician
Job title: Pharmacy Dispensary Technician Salary: Band 5 Location: 20 - 22 Devonshire Place W1G Job Type: Permanent Hours: Full time hours The role We are currently looking to …
Network Engineer
Quant Capital is urgently looking for a Network Engineer to join our high profile client. Our client is a well known global High Frequency Trading firm. They value technology especially the openso…