For engineers

Eval & RL Engineer jobs.

Designs evals and RL pipelines so quality is measured, not guessed. Here is what the role is, what it pays, and what is open now.

See open roles Hiring? →

01 / Definition

What is a eval & rl engineer

An eval engineer makes model quality measurable. They build the harnesses, datasets, and scoring that turn a vague sense of "this feels better" into a number you can move on purpose.

On the RL side, they design the data and reward pipelines that improve a model. Without good evals, RL is guessing with extra steps, so the two jobs sit close together.

What they do

Build eval harnesses that catch real regressions, not noise.
Design datasets that represent the work the model actually does.
Run RLHF and RLAIF pipelines and read the results honestly.
Turn "it feels worse" into a number someone can fix.

02 / Pay

What it pays

In the US, total compensation for strong eval and rl engineers usually lands around $190k to $280k, higher at frontier labs and with equity at startups. We do not post a role we would not take ourselves.

03 / Open now

Open eval & rl engineer roles

Nothing open under this track this week. If you build this way, send us what you have built and we will keep you close.

FAQ

Common questions

What does an eval engineer do?

They build the evals that measure model quality, so teams can tell whether a change made things better or worse instead of guessing.

Why are eval engineers in demand?

Because most AI teams ship faster than they can measure. An eval engineer is what turns shipping into improving.

What should an eval or RL engineer be paid?

In the US, total compensation usually lands between $190k and $280k, higher at frontier labs.

We only hire the best AI-native engineers.

See all roles Introduce yourself