New job, posted less than a week ago!
Job Details
Posted date: Apr 22, 2026
Location: Kirkland, WA
Level: Director
Estimated salary: $313,500
Range: $262,000 - $365,000
Description
Lead, mentor, and scale a high-performing Quality and Evals team (12+ SWEs), overseeing specialized pods including retail vertical owners (e.g., apparel, home and garden, etc.), evals hill climbing. and Return on Investment (ROI) metric. Enforce the "Launch Bar" quality standards, managing the automated "No-Regression" release gates, hermetic holdout datasets, and ensuring strict pass-rate thresholds for all release applicants. Drive the "Vertical-First" architectural strategy, moving the team away from custom, client-specific prompts to modular, generic architecture that instantly elevates baseline performance across entire retail verticals. Orchestrate aspirational "Hill Climbing" efforts to continuously improve core agent metrics, including search accuracy, action accuracy and expectation compliance. Act as the strategic bridge between Core Engineering, Product Management, and Forward Deployed Engineers (FDEs), hosting bi-weekly "State of Quality" syncs and demystifying the AI quality process for stakeholders.Google Cloud’s mission is to make every business successful through AI by combining cutting-edge technology, infrastructure, and talent. AI/ML software engineers in Cloud bridge the gap between pioneering models and a massive product vehicle reaching billions. Our talent density and AI-powered tools drive rapid development, rooted in a culture of empowerment and a bias to action. In this role, you aren’t just building technology; you’re shaping the frontier of enterprise and driving the evolution of advanced models.
Join the Cloud Applied AI team to build the operational backbone for modern retail with Gemini Enterprise for Customer Experience. Our mission is to embed Google’s foundational AI directly into retailer infrastructure, creating a "flywheel effect" where search, sales, and support converge. We are building the Shopping Agent, a multimodal concierge (text, voice, visual) that acts as a full-stack sales and support expert for global enterprise brands like Macy’s and Home Depot.
As the Tech Lead Manager for the Quality and Evals Pillar, you will lead a dedicated team of 12+ Software Engineers responsible for guaranteeing response safety, brand alignment, and exceptional AI performance at scale. You will advocate our transition into a proactive, "Vertical-First" engineering organization, ensuring that every agent we launch is reliable, consistent, and demonstrably advanced to the competition.The Cloud Applied AI (AAI) powers business growth with Gemini Enterprise. Our portfolio includes Gemini Enterprise for Customer Experience (Shopping Agent, CX Agent Studio, Agent Assist, Vertex AI Search - Commerce, Customer Experience Insights), along with other vertical and domain packaged solutions. We enable high adoption and speed to value by building solutions that are quickly deployed, delivering new 0-to-1 capabilities with startup agility. Team members operate at the forefront of AI, collaborating directly with model builders with unprecedented speed. Join us to work on cutting-edge projects and shape the future of AI in a fast-paced, collaborative, and impactful environment.
The US base salary range for this full-time position is $262,000-$365,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.
Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google.
Qualifications
Minimum qualifications: Bachelor’s degree or equivalent practical experience. 8 years of experience with software development in one or more programming languages (e.g., Python, C, C++, Java, JavaScript). 7 years of experience leading technical project strategy, ML design, and optimizing ML infrastructure (e.g., model deployment, model evaluation, data processing, debugging, fine tuning). 5 years of experience in a technical leadership role. 5 years of experience in a people management or team leadership role. 2 years of experience with GenAI techniques (e.g., LLMs, Multi-Modal, Large Vision Models) or with GenAI-related concepts (language modeling, computer vision).Preferred qualifications: 5 years of experience working in a complex, matrixed organization. 5 years of experience in engineering leadership, particularly within rapidly scaling enterprise SaaS or AI product teams. Experience designing telemetry, observability, and data pipeline solutions to track real-time application metrics and user behavior. Experience leveraging user simulation (e.g., Monte Carlo runs) and deterministic checks for complex AI evaluation. Experience with prompt engineering, Retrieval-Augmented Generation (RAG) architectures, and AI agent orchestration/tool calling. Familiarity with the commerce/retail tech ecosystem, including e-commerce conversion funnels, catalog ingestion, and search/discovery platforms.