Senior/Staff Research Scientist, Frontier Benchmarks

Remote role Full-time Open position

ABOUT THE ROLE We're looking for a Staff or Senior Research Scientist to collaborate with partners and lead the development of the next frontier benchmarks and datasets. This is a highly visible, customer-facing role at the intersection of research, company strategy, and go-to-market. You'll design datasets taking into account frontier model performance and work with our academic partners, and then partner with delivery, product and go-to-market to scale out production. You will also serve as a credible technical partner for our customers, prospects, and drive results that impact the broader research community. This role reports directly to the Head of Research and is ideal for someone who is energized by cross-functional work and wants to understand how startups operate across research, data operations, and commercial teams. MAIN RESPONSIBILITIES

Design state of the art datasets that drive frontier model training and evaluation based on current model performance and academic partnerships
Translate benchmark insights into clear, compelling narratives that articulate the ROI of expert-curated data for customer-facing presentations, technical reports, and go-to-market materials.
Work cross-functionally with data operations, product, engineering, and strategy to surface research findings that inform the company roadmap.
Stay at the frontier of LLM evaluation research and bring best practices into Snorkel's workflows
Represent Snorkel's research externally through publications, blog posts, conference talks, and customer engagements that advance the conversation around data-centric AI

PREFERRED QUALIFICATIONS

Strong research background in AI/ML evaluation, NLP, or related fields, with a track record of rigorous experimental design - especially around measuring the impact of training and evaluation data on model behavior.
Exceptional communication skills - able to present complex technical findings clearly to both technical and non-technical audiences
Comfort operating in a fast-moving, cross-functional environment with ambiguous problem spaces
Genuine interest in GTM strategy, startup dynamics, and the commercial side of AI data services.
Ph.D. in machine learning, NLP, or a related field preferred; equivalent industry or research lab experience considered.

Salary Range $220,000-$320,000 USD Apply tot his job Apply To this Job

Apply

Senior/Staff Research Scientist, Frontier Benchmarks

Further positions

Lead AI Research Scientist - NLP

Staff ML Research Scientist, Co-Folding and Affinity

AI Research Scientist, Biological Foundation Models

LLM - Applied AI Research Scientist (USA & LATAM Remote)

Research Scientist in Radiopharmaceutical Imaging and Dosimetry

Assistant Research Scientist/Research Scientist 1 32423

Computational Research Scientist/Sr. Scientist

Senior Research Scientist, Model Evaluation

Lead Bioinformatics Scientist, NGS

AI Training - Research Scientist (PST)

Experienced Customer Success Lead – Scaling Support Operations for arenaflex's DTC eCommerce Brand

Work From Home - Break Free of the 9-5

Remote Customer Service Representative – Pet‑Lovers E‑Commerce Support (Work‑From‑Home) – arenaflex

Pre-Sales, Infrastructure Architect - Hospital Patient Monitoring (South Carolina)

Customer Service Representative (Danish & English)

Experienced Data Entry Associate – Part-Time Remote Opportunity at arenaflex

Team Lead, Clinical Data Management (Remote)

Rewritten Job Title:

SR. MOBILE DEVELOPER (iOS & ANDROID)

Sales Manager Private Krankenversicherung (all genders)