The Pride Month Virtual Career Fair is on June 23. Register today and explore career opportunities at Abbvie, Land O'Lakes, Varsity Brands, NASCAR, Fidelity Investments, Alkermes, Strategic Education, and The TJX Companies.

This job is expired.

Machine Learning Engineer

Bespoke Labs

Full-Time

Machine Learning Engineer

Bespoke Labs

Full-Time

Jun 19, 2026

Information Technology

Job Description

About Us

We are AI researchers and builders who understand how to curate data and RL environments that truly improve models. We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart.

We are embarked on a journey to build Environments that are entire digital worlds that can be used to push the frontier of agents.

What You'll Be Working On

You will work directly with our research team on RL environment and task creation for agent training. This means designing observation spaces, action spaces, reward signals, and success criteria for new environments - and building the infrastructure that makes world-scale RL training possible. This is a high-ownership role; you will be building novel systems, not maintaining legacy ones.

Must-Have Skills

3+ years of ML engineering experience - model training, fine-tuning, or post-training pipelines in research or production

Strong Python and deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed precision)

Hands-on experience with LLM post-training - SFT, RLHF, PPO, DPO, or reward model training - and understanding of how training data quality affects model behavior

Familiarity with RL frameworks (Gymnasium, dm_env) and the ability to design or modify reward functions for agent training objectives

Experience running experiments at scale on cloud or HPC (AWS, GCP, SLURM, or Ray)

Solid understanding of evaluation methodology - held-out sets, benchmark design, avoiding train/eval contamination

About Us

We are AI researchers and builders who understand how to curate data and RL environments that truly improve models. We curated OpenThoughts, one of the best open reasoning datasets, and have trained SOTA models such as Bespoke-MiniCheck and Bespoke-MiniChart.

We are embarked on a journey to build Environments that are entire digital worlds that can be used to push the frontier of agents.

What You'll Be Working On

You will work directly with our research team on RL environment and task creation for agent training. This means designing observation spaces, action spaces, reward signals, and success criteria for new environments - and building the infrastructure that makes world-scale RL training possible. This is a high-ownership role; you will be building novel systems, not maintaining legacy ones.

Must-Have Skills

3+ years of ML engineering experience - model training, fine-tuning, or post-training pipelines in research or production

Strong Python and deep learning proficiency (PyTorch preferred; familiar with training loops, optimizers, mixed precision)

Hands-on experience with LLM post-training - SFT, RLHF, PPO, DPO, or reward model training - and understanding of how training data quality affects model behavior

Familiarity with RL frameworks (Gymnasium, dm_env) and the ability to design or modify reward functions for agent training objectives

Experience running experiments at scale on cloud or HPC (AWS, GCP, SLURM, or Ray)

Solid understanding of evaluation methodology - held-out sets, benchmark design, avoiding train/eval contamination

About Bespoke Labs

Related Jobs

Long Range Networked Fires Engineer

Johns Hopkins Applied Physics Laboratory (APL)

Description Are you an experienced engineer who would like to be a major contributor to the design of future advanced integrated networked fires capabilities in support of homeland and theater defense...

Jun 22, 2026 laurel, md

Senior AFSIM Analyst

Johns Hopkins Applied Physics Laboratory (APL)

Description Are you searching for an opportunity to apply your AFSIM modeling and simulation experience to analyze interesting and complex problems with innovative software and computing capabilities?...

Jun 22, 2026 laurel, md

Senior Reverse Engineer / Cyber Capability Engineer

Johns Hopkins Applied Physics Laboratory (APL)

Description Are you a reverse engineer who loves to discover how bespoke systems work and how to break them? Are you energized by working with world-class experts to solve the hardest offensive cyber...

Jun 22, 2026 laurel, md

Submarine Combat System Test and Evaluation Analyst

Johns Hopkins Applied Physics Laboratory (APL)

Description Are you seeking to apply your submarine operational and tactical system expertise to assess essential U.S. Navy submarine technologies and influence their future direction? Do you enjoy a...

Jun 22, 2026 laurel, md

Digital Communications Systems Engineer

Johns Hopkins Applied Physics Laboratory (APL)

Description Are you curious about complex problems and big-picture systems thinking? Do you have Systems Engineering (SE) / Digital Engineering (DE) experience and the desire to develop solutions for...

Jun 22, 2026 laurel, md

Senior Engineer / Analyst for ISRT & BMC2 Applications

Johns Hopkins Applied Physics Laboratory (APL)

Description Do you enjoy leading teams, mentoring staff, and guiding complex technology development efforts and analyses? Are you interested in researching, developing, and analyzing innovative ways...

Jun 22, 2026 laurel, md

Apply For This Job

Machine Learning Engineer

Bespoke Labs

Jun 19, 2026

Full-time

Your Information

First Name *

Last Name *

Email Address *

This email belongs to another account. Please use a diferent email address or Sign In.

Zip Code *

Password *

Confirm Password *

Which groups do you identify with?

Veteran
Hispanic
Black or African-American
Woman
LGBTQ+
Asian
Disabled
Other / Choose not to identify

Create your Profile from your Resume

Resume

Allow employers to search for my resume

Job is Expired

Follow us:

Company:

About TalentAlly

Investor Relations

Site Resources:

Terms of Services

Support Request

For Employers:

Contact an Associate

©2026 TalentAlly.

Powered by TalentAlly.