Results for "policy consistency"

AdvertisementAd space — search-top

13 results

Data Labeling Intermediate

Human or automated process of assigning targets; quality, consistency, and guidelines matter heavily.

Foundations & Theory
Inter-Annotator Agreement Intermediate

Measure of consistency across labelers; low agreement indicates ambiguous tasks or poor guidelines.

Foundations & Theory
Off-Policy Learning Intermediate

Learning from data generated by a different policy.

AI Economics & Strategy
RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization
Audit Intermediate

Systematic review of model/data processes to ensure performance, fairness, security, and policy compliance.

Governance & Ethics
Red Teaming Intermediate

Stress-testing models for failures, vulnerabilities, policy violations, and harmful behaviors before release.

Security & Privacy
Actor-Critic Intermediate

Combines value estimation (critic) with policy learning (actor).

AI Economics & Strategy
Explainability Requirement Intermediate

Legal or policy requirement to explain AI decisions.

AI Economics & Strategy
Self-Consistency Intro

Sampling multiple outputs and selecting consensus.

Prompting & Instructions
Policy Intermediate

Strategy mapping states to actions.

AI Economics & Strategy
Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy
On-Policy Learning Intermediate

Learning only from current policy’s data.

AI Economics & Strategy
Policy Search Advanced

Directly optimizing control policies.

Reinforcement Learning

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.