Senior System Engineer I
SurveyMonkey
About the role
SurveyMonkey is the world’s most popular platform for surveys and forms, built for business—loved by users. We combine powerful capabilities with intuitive design, effectively serving every use case, from customer experience to employee engagement, market research to payment and registration forms. With built-in research expertise and AI-powered technology, it’s like having a team of expert researchers at your fingertips.
Trusted by millions—from startups to Fortune 500 companies—SurveyMonkey helps teams gather insights and information that inspire better decisions, create experiences people love, and drive business growth. Discover how at surveymonkey.com.
What we’re looking for
As a member of the Infrastructure team at SurveyMonkey, you will have a direct impact in designing, engineering and maintaining our Cloud, Messaging and Observability Platform. Solutioning with best practices, deployment processes, architecture, and support the ongoing operation of our multi-tenant AWS environments. This role presents a prime opportunity for building world-class infrastructure, solving complex problems at scale, learning new technologies and offering mentorship to other engineers.
What you’ll be working on
Support and operate AWS environments following established best practices.
Contribute to infrastructure automation using Terraform,CloudFormation,Ansible and CI/CD pipelines.
Assist with deployments, monitoring, and troubleshooting production systems.
Develop and enhance automation for infrastructure provisioning and deployments.
Contribute to CI/CD pipeline improvements using tools like GitHub Actions,Jenkins, or GitLab.
Work with containerized applications using Docker and Kubernetes.
Implement and maintain monitoring, logging, and alerting systems.
Work with tools like CloudWatch, Splunk, Grafana, Prometheus, or ELK.
Participate in incident response and support efforts to improve system reliability and performance.
Work closely with senior engineers and gradually take ownership of components.
Participate in on-call rotations and support incident resolution.
We’d love to hear from people with:
5-7 years of relevant professional experience with cloud platforms such as AWS, Heroku.
Experience with Terraform, Cloudformation Docker, Kubernetes, scripting (Bash/Python/Yaml), and helm.
Experience with Splunk, Grafana/Prometheus, ELK (Elasticsearch/Logstash/Kibana).
Experience instrumenting PHP, Python, Java and Node.js applications to send metrics, traces, and logs to third-party Observability tooling.
Experience with GitOps and tools like ArgoCD/fluxcd.
Ability to listen and partner to understand requirements, troubleshoot problems, or promote the adoption of platforms.
Experience with GitHub/GitHub Actions/Jenkins/Gitlab in either a software engineering or DevOps environment.
Familiarity with databases and caching technologies, including PostgreSQL, MongoDB, Elasticsearch, Memcached, Redis, Kafka and Debezium.
Preferably experien