Site Reliability Engineer

Artifakt | France

Date listed

1 month ago

Remote

Yes

Employees

11-50

Total Funding

$68.7 billion

Our Engineering team build and run a scalable and modular PaaS backed by modern and promising technologies. What else? They genuinely seek what is best for their peers and are proud to allow developers around the world to fast-track cloud deployments without a headache.

As an Artifakt Site Reliability Engineer, your core mission is to manage, create, and improve our systems and platforms to make them highly reliable, performant, secure, and scalable.

What You’ll Do:

Create the SRE activity

  • Ensure application and platforms meet our availability, performance, and reliability SLA;
  • Contribute to improve quality, remove regressions and increase performance in cooperation with the Software Engineering team;
  • Build the observability stack (monitoring, logging...) in cooperation with the Cloud Engineering team;
  • Extract relevant insights from the platform through metrics/logs/event exploitation, dashboard creation and alerts optimization;
  • Analyze performance dashboard based on metrics and event collected in the infrastructure to improve performance.

Security strategy

  • Review and audit application code and platforms;
  • Monitor, track, and prevent vulnerabilities;
  • Ensure platforms deployed are up to date and secure.

Exponential scaling

  • Be ready to scale to thousands of users and platforms;
  • Automate updates, security, and compliance of official runtimes.

Quality of excellence

  • Build the SRE team and spread expertise across Artifakt;
  • Organize security audit(s) and certification(s) in cooperation with external partners.

You're a great fit for this role if:

You have +5 years of experience in software engineering with a proven practice as DevOps or SRE.

You show both leadership mindset and hands-on expertise to build activities from scratch and run it!

You are used to develop applications and you have advanced knowledge on guaranteed quality, performance, and reliability in a cloud environment.

You already secured a complete platform and monitored security events to be able to continuously improve security and reach market security standards (SOC2)

Your technical expertise/experience/practice also cover:

  • Manage Kubernetes Cluster (CKA is a plus)
  • Deploy, manage and monitor any types of middleware (database, cache, queuemessaging system, Elasticsearch...)
  • Design, implement and automate any architecture on any cloud provider with cloud API or automation tools (Terraform)
  • Design and implement a full monitoring and logging system based on Prometheus, Loki, Grafana, Fluentd, ES

Bonus if you are familiar with:

  • 12 factors app and container compliant code
  • Event-based architecture and cloud event pattern
  • Distributed and asynchronous system
  • GitOps
  • Kubernetes Operator design pattern
  • Go and NodeJS

Findwork Copyright © 2021

Newsletter


Let's simplify your job search. Receive your tailored set of opportunities today.

Subscribe to our Jobs