Machine Learning Engineer
This is a unique opportunity to solve problems at the intersection of advanced machine learning and high-frequency trading, where your contributions will shape the future of IMC’s technology and trading capabilities.
About Us
IMC is a leading trading firm, known worldwide for our advanced, low-latency technology and world-class execution capabilities. Over the past 30 years, we’ve been a stabilizing force in the financial markets – providing the essential liquidity our counterparties depend on. Across offices in the US, Europe, and Asia Pacific, our talented employees are united by our entrepreneurial spirit, exceptional culture, and commitment to giving back. It's a strong foundation that allows us to grow and add new capabilities, year after year. From entering dynamic new markets, to developing a state-of-the-art research environment and diversifying our trading strategies, we dare to imagine what could be and work together to make it happen.
Your Core Responsibilities:
- Develop large-scale distributed training pipelines to manage datasets and complex models
- Build and optimize low-latency inference pipelines, ensuring models deliver real-time predictions in production systems
- Develop libraries to improve the performance of machine learning frameworks
- Maximize performance in training and inference using GPU hardware and acceleration libraries
- Design scalable model frameworks capable of handling high-volume trading data and delivering real-time, high-accuracy predictions
- Collaborate with quantitative researchers to automate ML experiments, hyperparameter tuning, and model retraining
- Partner with HPC specialists to optimize workflows, improve training speed, and reduce costs
- Evaluate and roll out third-party tools to enhance model development, training, and inference capabilities
- Dig into the internals of open-source ML tools to extend their capabilities and improve performance
Your Skills and Experience:
- 5+ years of experience in machine learning with a focus on training and inference systems
- Hands-on experience with real-time, low-latency ML pipelines in high-performance environments is a strong plus
- Strong engineering skills, including Python, CUDA, or C++
- Knowledge of machine learning frameworks such as PyTorch, TensorFlow, or JAX
- Proficiency in GPU programming for training and inference acceleration (e.g., CuDNN, TensorRT)
- Experience with distributed training for scaling ML workloads (e.g., Horovod, NCCL)
- Exposure to cloud platforms and orchestration tools
- A track record of contributing to open-source projects in machine learning, data science, or distributed systems is a plus