Senior Staff Site Reliability Engineer
Fivetran
About the role
From Fivetran’s founding until now, our mission has remained the same: to make access to data as simple and reliable as electricity. With Fivetran, customer data arrives in their warehouses, canonical and ready to query, with no engineering or maintenance required. We’re proud that more organizations continue to leverage our technology every day to become truly data-driven.
About the Role
Fivetran is looking for a high-performance, experienced engineer to be a part of a team of Site Reliability Engineers. You will be working closely with engineering teams, product managers, as well as support and sales engineers to build the future of the Fivetran Data Platform Reliability.
As a member of the Site Reliability Engineering team, you will take ownership of the overall performance and reliability of Fivetran’s infrastructure, the robustness of the deployment pipeline, as well as timely and effective incident response and resolution. You will take responsibility for the growth and stability of Fivetran’s infrastructure, and be a key player driving effective incident response and overall issue avoidance.
Technologies You’ll Use
Working knowledge of managed Kubernetes (EKS, AKS and GKE)
Knowledge of Cloud Platforms and related tooling: AWS, Azure, GCP, Terraform, Ansible, Buildkite, Pulumi and ArgoCD
Experience in Python/Shell scripting. Bonus if you have Java, GO, etc
Experience with Linux operating systems internals and administration
Experience with cloud networking like VPNs, Privatelinks, and Private Service connect (GCP)
Experience with databases such as PostgreSQL
What You’ll Do
Responsible for ongoing reliability and robustness of Fivetran’s production infrastructure by monitoring availability, capacity, and throughput.
Evolve systems by adding reliability into our product roadmap
Coordinate the re-prioritize or fix critical bugs for support or sales requirements as needed
Make recommendations to production infrastructure by interfacing with engineering to ensure 100% availability
Ensure scalable artifacts deployment to all environments by automation scripts
Constantly monitor infrastructure vulnerabilities and remedy them by working with the security team
Skills We’re Looking For
12+ years of experience working with SaaS products at scale.
Working knowledge of managed Kubernetes (EKS, AKS and GKE).
Knowledge of Cloud Platforms and related tooling: AWS, Azure, Google Cloud (GCP), Terraform, Ansible, Buildkite, Pulumi and ArgoCD.
Experience in Python/Shell scripting and Go Language. Bonus if you have Java.
Experience with Linux operating systems internals and administration
Experience with cloud networking like Site-to-Site VPNs, Privatelinks and Private Service connect (GCP)
#LI-HYBRID
#LI-AV1
Perks and Benefits
100% employer-paid medical insurance*
Generous paid time-off policy (PTO), plus paid sick time, inclusive parental leave policy, holidays, and volunteer days off
RSU stock grants*
Professional develop
Underpaid estimate
~₹22 LPA for Site Reliability Engineers (industry-wide) · based on 5 submissions