New job, posted less than a week ago!
Job Details
Posted date: Mar 28, 2025
Location: Seattle, WA
Level: Senior
Estimated salary: $171,500
Range: $141,000 - $202,000
Description
Develop and optimize machine learning models for audio processing, including speech denoising, separation, and conversation detection, ensuring high performance and low latency in XR devices.Collaborate with cross-functional teams, including hardware engineers, product managers, and designers, to ensure seamless integration of ML models into wearable XR devices and systems, focusing on best practices in code quality, performance, and testability.
Contribute and maintain technical documentation, providing clear insights into model development, updates, and optimizations based on feedback from product teams and real-world usage.
Identify, troubleshoot, and resolve issues related to model performance, analyzing the interaction between hardware, software, and network conditions to optimize audio processing on devices.
Design and implement GenAI solutions, leveraging ML infrastructure to improve model accuracy, optimize for real-time deployment, and enhance data processing workflows for robust audio performance in XR applications.
Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.
Google’s Extende Reality (XR) team is developing Augmented Reality (AR) and Virtual Reality (VR) technologies that integrate digital information into the physical world. Our mission is to empower devices like headsets and glasses to understand and interact with the environment from the user’s perspective.
The Extended Reality (XR) Audio Machine Learning team focuses on creating machine learning models to improve audio experiences in XR devices, including speech denoising, separation, conversation detection, and spatial audio. The role address challenges like detecting speech in noisy environments and sub-vocal speech detection.
In this role, you will design and optimize Machine Learning models for voice interfaces in wearable devices, ensuring real-time audio processing to understand users in dynamic environments and contribute to the future of Augmented Reality and Virtual Reality technology.
The Google Augmented Reality team is a group of experts tasked with building the foundations for great immersive computing and building helpful, delightful user experiences. We're focused on making immersive computing accessible to billions of people through mobile devices, and our scope continues to grow and evolve.
The US base salary range for this full-time position is $141,000-$202,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.
Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google.
Qualifications
Minimum qualifications:Bachelor’s degree or equivalent practical experience.2 years of experience with data structures or algorithms.
2 years of experience with software development in Python, C++, or Java, or 1 year of experience with an advanced degree.1 year of experience developing and deploying machine learning models for audio, including speech processing (e.g., denoising, separation) or generative AI in multi-modal applications (e.g., text, image, video, audio).1 year of experience with ML systems, including data preprocessing, model training, evaluation for real-time applications on mobile/embedded platforms (e.g., Android), with a focus on audio processing and production performance.
Preferred qualifications:Master's degree or PhD in Computer Science or related technical fields.
Experience in machine learning principles and model architectures.Experience in model development, including training and optimization.Experience in developing accessible technologies.
Extended Qualifications
Bachelor’s degree or equivalent practical experience.2 years of experience with data structures or algorithms.
2 years of experience with software development in Python, C++, or Java, or 1 year of experience with an advanced degree.1 year of experience developing and deploying machine learning models for audio, including speech processing (e.g., denoising, separation) or generative AI in multi-modal applications (e.g., text, image, video, audio).1 year of experience with ML systems, including data preprocessing, model training, evaluation for real-time applications on mobile/embedded platforms (e.g., Android), with a focus on audio processing and production performance.
Check out other jobs at Google.