Founding Engineer, Mlops

Sync Labs

Date listed

2 months ago

Employment Type

Full time

at sync. we train state-of-the-art controllable video generation models, and we’re looking to serve them at scale.

we’re looking for a strong founding engineer who can help us scale inference from serving 100k users to 100M+ in the next 2 years.

as an MLOps engineer, you’ll be critical in architecting, building, and maintaining the infrastructure that serves our models to our users through our app + api.

as a founding member of our engineering team, you’ll be critical in shaping our engineering culture from the ground up as we scale revenue from $1M-$10M and then $10M-$1B+

key responsibilities:

  • design, implement, and maintain the infrastructure and processes for deploying, monitoring, and updating our ML models in production environments
  • automate and optimize our ML pipeline, including data ingestion, preprocessing, model training, validation, and deployment
  • collaborate with our ML engineers and research team to ensure the smooth transition of models from development to production
  • implement best practices for ML system reliability, scalability, and maintainability, including containerization, orchestration, and automation
  • monitor and troubleshoot ML systems in production, ensuring high availability and performance
  • drive continuous improvements in system performance, cost optimization, and user experience through rigorous testing, analysis, and iteration
  • stay up-to-date with the latest MLOps best practices and technologies, and drive their adoption within our organization

required skills and experience:

  • strong experience with ML infrastructure, including deployment, monitoring, and updating models in production environments
  • deep knowledge of containerization, orchestration, and automation technologies (e.g., Docker, Kubernetes, Airflow)
  • proven track record of optimizing system performance, reducing costs, and improving reliability in an ML context
  • excellent programming skills in Python, with experience in ML frameworks such as TensorFlow or PyTorch
  • familiarity with cloud platforms (e.g., AWS, GCP) and their ML offerings
  • strong problem-solving skills and the ability to tackle complex MLOps challenges in a fast-paced startup environment
  • intensely data driven, proactive, and focused on building systems with observability baked in
  • excellent communication and collaboration skills, with the ability to work effectively with both technical and non-technical stakeholders

preferred skills and experience:

  • experience with video processing and realtime delivery pipelines
  • familiarity with Generative AI models and their deployment challenges
  • contributions to open-source MLOps projects or relevant research papers

outcomes:

  • highly efficient, reliable, and scalable ML systems that power our cutting-edge video generation platform
  • significant improvements in system throughput, cost optimization, and user experience
  • seamless integration and deployment of new models from our research team
  • establishment of a culture of automation, continuous improvement, and operational excellence within our ML infrastructure

our goal is to keep the team lean, hungry, and shipping fast. these are qualities we look for:

[1] raw intelligence

[2] boundless curiosity

[3] exceptional resolve

[4] high agency

[5] outlier hustle

[6] obsessively driven by data

Findwork Copyright © 2023

Newsletter


Let's simplify your job search. Receive your tailored set of opportunities today.

Subscribe to our Jobs