About Us Be part of an exciting, well-funded startup changing the world of retail and beyond. RADAR’s mission is to revolutionize customer experience in retail through precise identification of inventory in the stores and distribution centers, completely transforming the in-store experience for employees and customers alike. RADAR's proprietary hardware and software platform combines RFID, Computer Vision and AI to provide hyper-precise, real-time location of every product and person in physical stores. This unprecedented capability enables some of the world’s top retailers to automate in-store inventory management, analytics, and checkout...and this is just the beginning of what we hope to accomplish together.
About the Team The mission of the cloud team at Radar is to build the plumbing in the cloud infrastructure, build cloud services, with simple and fast deployment to production. For the cloud team it also includes aspects of large-scale system design, networking, security, configuration and automatic VM orchestration and many more areas.
About the Role As a cloud Site Reliability Engineer, you will be involved with our fast-paced releases and collaborate closely with the application development team. The role requires hands-on participation. The role requires a deep understanding of cloud-related technologies, management platforms and networking.
Responsibilities Run the production environment by monitoring availability and taking a holistic view of system health, and correct any issues with low latency, high performance, scalable systems in a polyglot architecture. Lead in capacity planning, automate the server capacity monitoring and scaling and best practices for metrics gathering, monitoring, and alarming. Provide tooling to monitor and resolve any issues with persistent data stores in the system, basic data administration and optimization for the data pipeline. Evangelize high engineering standards and best practices across multiple areas. Move quickly and intelligently - seeing technical debt as your nemesis. Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement. Provide primary operational support and engineering for multiple large-scale distributed software applications. Follow key SRE practices of preventive measure for all failures, availability, Performance, Monitoring, Alerting and Incident Response. Document “tribal knowledge” Conducting post-incident reviews and corrections. Improve operational processes (such as deployments and upgrades) to make them as boring as possible. Debug production issues across services and levels of the stack
About You Requirements: Bachelor’s degree or the equivalent in experience in Engineering, Computer Science or related field. 7+ years professional experience in DevOps / SRE handling production procedures and have a certification with a major cloud provider as GCP, AWS Azure. In-depth experience with Docker Compose / Docker swarm, Kubernetes cluster deployment, cluster design, sizing and containerization. In-depth Experience deploying microservice architecture, applications, and supporting serverless architectures. In-depth experience with infrastructure-as-code and config management for VMs and containers. Terraforms, Ansible or comparable tooling. In-depth experience with Prometheus, TICK stack, Elastic, Logstash/ Filebeat, telegraph amongst others. Prior experience in building out solutions with Vault and Consul for secret and configuration. Prior in-depth experience with open-source databases, cloud-native databases, cloud-native messaging frameworks. Rock solid with scripting languages such as Python, Ruby, Go shell and yaml constructs. Working Knowledge of networking concepts, VPN and VPC constructs in cloud Understanding of Operations tools (Pagerduty, CloudWatch, Datadog, Sentry, etc.) Deeply conversant with cloud infrastructure security best practices. Good understanding with one of the following CI / CD tooling Atlassian tooling, Jenkins, CircleCI and cloud native deployment tools and deep understanding of GITOPS.
What We’re Looking For In Teammates Technology like what we’re building doesn’t happen on its own. It is the result of a collaborative environment and the hard work of passionate, dedicated individuals working intelligently towards a common goal. We are looking for exceptional people to join our growing team and have a positive impact on our culture, technology, and product from day one. We deeply value humility, curiosity, and a positive attitude and you should as well. You should also believe that mutual respect is the foundation of any healthy and productive relationship. You should be unafraid to ask questions or challenge responses no matter how simple or complex. Most importantly, you should value honest and direct communication as you recognize that this is the best way for any individual or team to continuously learn and grow. Accomplishing our collective goals will be fun but it will also be hard; you should be in pursuit of an ongoing and rewarding challenge!
What It’s Like To Work With Us We’re passionate about the technology we’ve created and what we’re building, but we know that changing any industry and creating a successful company will take balance, maturity, and a sustained effort. We’ve combined retail industry expertise, amazing engineers with experience shipping real-world hardware and software solutions, and a team of brilliant minds who are not afraid to focus on solving “impossible” problems. But this passion doesn’t mean we live unbalanced lives. We have families and passions outside of work, and we know that the best work comes from sharp, rested people. We respect each other and each of our contributions, and we believe that the best solutions will come from a diversity of ideas and perspectives. Finally, we build our products with deep empathy for the people who will use them every day. Their input and insights are our clearest guide to building what they need; we respect our partners and clients, and listen closely to their feedback.
Site Reliability Engineer, QuindarFull Time Employment
Site Reliability Engineer / System Engineer, NethermindFull Time Employment
Site Reliability Engineer, ReplitFull Time Employment
Site Reliability Engineer - Remote, CaptivateIQFull Time Employment
Senior Site Reliability Engineer, OneSignalFull Time Employment
Let's simplify your job search. Receive your tailored set of opportunities today.Subscribe to our Jobs