Machine Learning Engineer

ConnexAI • bolton, greater manchester • Posted June 11, 2026

About the Role

Build Low Latency Conversational AI Systems

We are building real-time conversational AI systems built on top of large language models, speech AI, and agentic workflows. Our platform combines ASR, LLMs, and TTS into production-grade AI systems used globally across enterprise environments where latency, reliability, and scalability matter.


We are hiring a Machine Learning Engineer to build low-latency production systems for our LLM team. This role is centred around writing scalable code that enables real-time conversational AI to perform reliably under heavy production workloads.


You’ll work closely with our LLM and speech teams to solve challenges around inference speed, concurrency, request handling, GPU performance, distributed systems, and real-time response streaming.


What you’ll do

  • Build and optimise low-latency LLM systems for real-time conversational AI
  • Write production-grade Py...