About the company
IMC is a leading trading firm, known worldwide for our advanced, low-latency technology and world-class execution capabilities. Over the past 30 years, weāve been a stabilizing force in the financial markets ā providing the essential liquidity our counterparties depend on. Across offices in the US, Europe, and Asia Pacific, our talented employees are united by our entrepreneurial spirit, exceptional culture, and commitment to giving back. It's a strong foundation that allows us to grow and add new capabilities, year after year. From entering dynamic new markets, to developing a state-of-the-art research environment and diversifying our trading strategies, we dare to imagine what could be and work together to make it happen.
Job Summary
Your Core Responsibilities:
šLeverage your knowledge of GPU architecture & libraries to optimise model structure, library use and software-hardware integration šDeliver extensions, fixes and improvements on top of base CUDNN & TensorRT to support performant evaluation šBuild C++ and CUDA based performance enhancement libraries for common deep learning libraries like Pytorch and JAX
Your Skills and Experience:
šMS degree in CS or similar fields or equivalent experience š3+ years of relevant work experience šExperienced in fundamental libraries for accelerating ML workflows, like CUDNN/TensorRT, ROCm, OpenVino or OpenPPL.(understanding of one or more ML communication frameworks like NCCL, is an advantage) šBackground in deep learning fundamentals and common deep learning software, especially PyTorch šExperienced in C++ & CUDA. Experience in Python is also highly desirable