Date listed
3 weeks agoEmployment Type
Full timeFound on:
Goal: 99.99% uptime
We serve custom inference stacks that have irregular GPU load.
We're looking for people that have done genuinely amazing work in infrastructure that are interested in a challenge, working with both traditional infrastructure such as load balancers, NLB, etc., as well as very different infrastructure around inference engines and GPU loads.
This is a role that will inherently require deep experience with inference engines.
Contributions to vLLM, SGLang, trtllm, or inference frameworks a plus
Newsletter
Let's simplify your job search. Receive your tailored set of opportunities today.
Subscribe to our Jobs