Site Reliability Engineer
Axi
About the role
Please note that we will only be able to accept candidates who have the appropriate rights and documentation for employment in India.
Who We Are.
Axi is a leading global provider of margin and deliverable Foreign Exchange, Contracts for Difference (CFDs), and Financial Spread betting. Our business has evolved into a world-class, multifaceted brokerage with offices in six regions. With heavy investment in the latest trading technology, Axi seeks to offer the most comprehensive end-to-end trading experience available, servicing traders of all levels from beginners to institutional-level clients.
Let's talk about the cool stuff you will do at Axi!
The Site Reliability Engineer is accountable for availability, reliability and operational excellence of our technology infrastructure across Axi. Your role is to design, implement & maintain monitoring, alerting and log management solutions. Working in close collaboration with Technology teams through all areas of Development and Operations, the objective is to alert and act upon any business impacting incident before it is reported by the impacted party, and to ensure comprehensive observability and analysis through log management.
Your EDGE Assignment/You Will
To be the Product Owner for Monitoring and Observability within the Axi Technology Operations Environment
Review the current environment and propose a roadmap to optimise the product set and manage the lifecycle of existing product.
Be a key support to technology delivery teams for monitoring & observability during all phases of the product delivery; gather requirements, produce detailed designs, conduct PoCs, recommend and architect solutions.
Tune and adjust health rules and maintain existing monitoring solutions
Remove toil by documenting and automating repeatable processes
Present ideas and designs to a variety of technical or non-technical stakeholders
Clearly and consistently document and maintain up-to-date processes, creating and maintaining a knowledge base for your product expertise.
Coach and mentor other team members in your speciality and be keen to build your breadth of knowledge across the SRE & DevOps scope
Are you the one?
5 plus years of experience in Site Reliability Engineering, DevOps, or Observability-focused roles.
Strong hands-on experience with monitoring and observability platforms such as Datadog.
Proven ability to create and manage dashboards, alerts, Application Performance Monitoring (APM), RUM, and infrastructure metrics.
Experience with automation and scripting (any modern language is acceptable).
Exposure to CI/CD pipelines, containerized environments, and cloud-native platforms mainly Azure.
Working knowledge of Kubernetes, Terraform, and similar modern infrastructure practices.
Solid understanding of high availability, resilience, and failover design.
Strong analytical and troubleshooting skills with a proactive, reliability-first mindset.
Effective communicator who collaborates well i
Underpaid estimate
~₹22 LPA for Site Reliability Engineers (industry-wide) · based on 5 submissions