Principal GenAI Data Engineer
ZScaler
About the role
About Zscaler
Zscaler accelerates digital transformation to ensure our customers can be more agile, efficient, resilient, and secure. As an AI-forward enterprise, we are constantly pushing the envelope, leveraging the world’s largest security data lake to power our cloud-native Zero Trust Exchange platform. This innovation protects our customers from cyberattacks and data loss by securely connecting users, devices, and applications in any location.
Here, impact in your role matters more than title and trust is built on results. We say, impact over activity. We seek innovators who actively use AI to amplify their impact and who thrive in an environment where we leverage intelligent systems to stay ahead of evolving threats. We believe in transparency and value constructive, honest debate—we’re focused on getting to the best ideas, faster. We build high-performing teams that can make an impact quickly and with high quality. To do this, we are building a culture of execution centered on customer obsession, collaboration, ownership, and accountability.
We value high-impact, high-accountability with a sense of urgency where you’re enabled to do your best work and embrace your potential. If you’re driven by purpose, thrive on solving complex challenges, and want to be part of the team that’s helping to secure the AI age, we invite you to bring your talents to Zscaler and help shape the future of cybersecurity.
Role
We are looking for a Principal GenAI Data Engineer to join our IT Data Strategy team. This role is fully remote within the US, reporting to the Senior Manager, Enterprise AI Data Platform. We are seeking an experienced technical leader to drive the design and implementation of enterprise-grade Generative AI data ingestion, knowledge preparation, and platform architectures that enable scalable, production-ready GenAI applications. This role focuses on architecting robust pipelines and platforms for ingesting, processing, governing, and serving structured and unstructured enterprise data for AI/LLM workloads. The ideal candidate combines deep expertise in enterprise data architecture, unstructured data pipelines, GenAI platform engineering, and strong software engineering skills in Python.
What you’ll do (Role Expectations)
Architect enterprise-scale GenAI data platforms for ingestion, transformation, enrichment, and serving of structured and unstructured data
Design scalable pipelines for enterprise knowledge ingestion from diverse data sources including documents, SaaS platforms, knowledge bases, collaboration tools, and databases
Define architecture for metadata extraction, chunking, enrichment, embeddings generation, and knowledge preparation workflows
Design AI-ready data models and storage strategies for vector, graph, and hybrid knowledge systems
Architect scalable unstructured data processing pipelines for text, images, PDFs, tables, and multimodal content
Who You Are (Success Profile)
You act like an owner. Your passion f
Underpaid estimate
~₹20 LPA for Data Engineers (industry-wide) · based on 61 submissions