We're building the inference and orchestration layer for production Vision-Language Models. We care deeply about fast and ergonomic visual inference, reliable structured outputs, and the observability to iterate on them.
A few things we've shipped recently you can poke at:
1. Orion: our visual agent that reasons and acts over images, video, and documents. Chat at https://chat.vlm.run.
2. mm-ctx: a Unix-style multimodal CLI (find, cat, grep, wc) that gives coding agents real context over images, video, and PDFs. Rust core, Python devex.
3. vlmbench: single-file CLI for benchmarking VLM inference (TTFT, TPOT, throughput) across vLLM, Ollama, and SGLang.
Apply: https://app.dover.com/jobs/vlm-run1x Infrastructure Engineer + 2x AI/ML Engineer , VLM Run
Santa Clara, CA (HQ)1 month ago
1x Infrastructure Engineer + 2x AI/ML Engineer , VLM Run
Santa Clara, CA (HQ)1 month ago
Infrastructure Engineer + DevRel + AI/ML Engineer , VLM Run
Santa Clara, CA (HQ)2 months ago
Newsletter
Let's simplify your job search. Receive your tailored set of opportunities today.
Subscribe to our Jobs