C

Senior Systems Engineer - Ai Infrastructure

Clockwork.Io | Palo Alto, Aguascalientes, Mexico

Date listed

3 weeks ago

Employment Type

Full time

Found on:

Responsive image CNCF

Keywords: ancestry ai

About Clockwork Systems Clockwork.io – Software Driven Fabrics to increase GPU cluster utilization

Clockwork Systems was founded by Stanford researchers and veteran systems engineers who share a vision for redefining the foundations of distributed computing. As AI workloads grow increasingly complex, traditional infrastructure struggles to meet the demands of performance, reliability, and precise coordination. Clockwork is pioneering a software-driven approach to AI fabrics by delivering cross-stack observability to catch and quickly resolve problems, workload fault tolerance to keep jobs running through failures, and performance acceleration that dynamically routes and paces traffic to avoid congestion.

To learn more, visit www.clockwork.io.

About the Role We're building infrastructure for fault-tolerant, high-performance distributed GPU training. You'll work at the intersection of GPU systems, high-speed networking, and distributed coordination—designing and implementing systems that run at scale.

This is a systems building role. You'll dig into internals, understand why things break under pressure, and design solutions that handle the messy reality of distributed systems.

Clockwork Systems is an equal opportunity employer. We are committed to building world-class teams by welcoming bright, passionate individuals from all backgrounds. All qualified applicants will receive consideration for employment without regard to race, color, ancestry, religion, age, sex, sexual orientation, gender identity or expression, national origin, disability, or protected veteran status. We believe diversity drives innovation, and we grow stronger together.

Findwork Copyright © 2023

Newsletter


Let's simplify your job search. Receive your tailored set of opportunities today.

Subscribe to our Jobs