Google Senior Research Scientist, Visual Language and Multimodal Modeling

Job is more than 1 month old.

Job Details

Posted date: Aug 13, 2024

Location: Seattle, WA

Level: Senior

Estimated salary: $200,000
Range: $161,000 - $239,000


Description

Explore multimodal technologies with the team to improve our Machine Learning (ML) capabilities with Gemini and user experience for in-market and next-generation Google devices and services, which include visual language models (VLMs) and multimodal learning techniques.

Work with the data team to collect evaluation and training datasets (e.g. define requirements, design data collection system setup and data labeling programs). Train and evaluate ML models.

Identify quality problems and iterate technical solutions.

Design, implement, optimize and integrate machine learning algorithms and models into Google’s production systems.

As an organization, Google maintains a portfolio of research projects driven by fundamental research, new product innovation, product contribution and infrastructure goals, while providing individuals and teams the freedom to emphasize specific types of work. As a Research Scientist, you'll setup large-scale tests and deploy promising ideas quickly and broadly, managing deadlines and deliverables while applying the latest theories to develop new and improved products, processes, or technologies. From creating experiments and prototyping implementations to designing new architectures, our research scientists work on real-world problems that span the breadth of computer science, such as machine (and deep) learning, data mining, natural language processing, hardware and software performance analysis, improving compilers for mobile platforms, as well as core search and much more.

As a Research Scientist, you'll also actively contribute to the wider research community by sharing and publishing your findings, with ideas inspired by internal projects as well as from collaborations with research programs at partner universities and technical institutes all over the world.

The US base salary range for this full-time position is $161,000-$239,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google.

Qualifications

Minimum qualifications: PhD degree in Computer Science, a related field, or equivalent practical experience.

2 years of experience leading a research agenda.

One or more scientific publication submission(s) for conferences, journals, or public repositories.

Preferred qualifications: Publications in related research venues (CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR, etc).

Experience in relevant areas like face anti-spoofing, biometrics, three-dimensional/two-and-a-half dimensional (3D/2.5D), facial landmark/pose estimation.

Excellent problem solving skills with attention to detail and quality.

Excellent software engineering skills (C++, python, large scale data processing, production backend development, etc.).

Enthusiasm to build production systems with Google scale user impact.

Familiarity with TensorFlow, Flume, common computer libraries / frameworks and Android.



Extended Qualifications

PhD degree in Computer Science, a related field, or equivalent practical experience.

2 years of experience leading a research agenda.

One or more scientific publication submission(s) for conferences, journals, or public repositories.



Check out other jobs at Google.