Sr. Site Reliability Engineer - CPO
Addepar
About the role
Who We Are
Addepar is a global data and AI platform empowering investment professionals to turn complex financial information into actionable intelligence. Addepar unifies portfolio, market and client data in a total portfolio view and delivers AI-powered insights within investment and client workflows. More than 1,400 firms in nearly 60 countries use Addepar to manage and advise on nearly $9 trillion in assets. Its open platform integrates with nearly 650 software, data and consulting partners to power end-to-end investment operations across firms of all sizes and complexity. Addepar supports clients worldwide with offices in New York City, Salt Lake City, London, Edinburgh, Pune, Dubai, Geneva and São Paulo.
The Role
We are looking to add a highly experienced and impactful colleague to the organization to drive the transformation of Addepar’s Production Engineering and SRE team. This role focuses on evolving our platform towards enabling high-level declarative infrastructure orchestration and its operations. This platform closely integrates our Compute, Network, and Storage control planes, allowing us to develop highly efficient and fast-to-iterate-on services tailored to various product areas within the company, abstracting our developers from the nuances of underlying infrastructure.
The ideal candidate will play a senior leading role in implementing, maintaining, and strategically evolving Addepar’s Production Infrastructure.. You will bring a robust combination of leading innovative solutions across functional teams and extensive hands-on development experience in AWS/cloud, Linux/Unix, networking, advanced scripting abilities, containerization, Kubernetes, Terraform, Information Security, deep debugging, and comprehensive monitoring/observability skills. This includes designing, deploying, monitoring, automating, and optimizing all operational aspects of Addepar's platform with a focus on reliability, scalability, and efficiency.
Applicants must have legal authorization to work in the country where this role is based on the first day of employment. Visa sponsorship is not available for this position.
What You’ll Do
Lead the design, implementation, and operationalization of container infrastructure using Kubernetes (k8s), ensuring high availability, performance, and security
Build, and maintain advanced, automated CI/CD pipelines using Jenkins, ArgoCD, AWS CodeBuild/Pipeline, GitHub Actions, or similar, establishing best practices for deployment strategies (e.g., blue/green, canary)
Drive the adoption and evangelism of Infrastructure as Code (IaC) principles using Terraform, focusing on scaling the Addepar Platform across regions with a focus on cost optimization and operational efficiency
Develop deep application-level knowledge to proactively inform and influence infrastructure requirements and constraints for Developers, QA, and Management, including implementing sophisticated dashboards for Cost and Inventory management, perf
Underpaid estimate
~₹22 LPA for Site Reliability Engineers (industry-wide) · based on 5 submissions