At Insight Engines, cloud operations is the backbone of delivering the engineering team’s product to the world. Our goal is to be able to sleep peacefully through the night while terabytes of data fly through our systems. We are looking for someone to help us maintain this large scale data processing platform and lead site reliability engineering.
But enough about us, let’s talk about you. Do you enjoy developing, deploying, and monitoring auto-scaling clusters in the cloud? How about digging into and analyzing metrics for process optimization and resource tuning? As an integral member of our technology team, you’ll do all of the above for everything from ETL pipelines to OLAP datastores -- along with operationalizing the infrastructure that powers our groundbreaking natural language platform. You’ll wear different hats, touch many parts of our system, and have a significant impact on our products.
The kinds of problems you’ll work on include:
Technologies we use on the operations team:
As you can tell, we’re big fans of open source. We don’t expect you to have deep knowledge of 100% of these technologies -- but, if you have a growth mindset combined with experience with several of those or related technologies and a solid understanding of networking and GNU/Linux fundamentals, we would love to hear from you!
When applying, tell us about your real-world experience with GNU/Linux, cloud computing, distributed systems, as well as monitoring and maintaining those systems. Women, People of Color, Minorities, and LGBTQIA+ candidates are encouraged to apply.
Site Reliability Engineer, Stellar Development FoundationAsia Only Full Time Employment
Site Reliability Engineer, RStudioRemote Full Time Employment
Site Reliability Engineer, ZapierRemote Full Time Employment
Site Reliability Engineer, ZapierAnywhere (100% Remote) Only Full Time Employment
Site Reliability Engineer (SRE), EburyEurope Only Contract Employment