Reward Model
IntermediateModel trained to predict human preferences (or utility) for candidate outputs; used in RLHF-style pipelines.
AdvertisementAd space — term-top
Definition
Full Definition
Model trained to predict human preferences (or utility) for candidate outputs; used in RLHF-style pipelines.