A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 3 Home / Browse R / Reward Hacking Reward Hacking Advanced EN Share Print Maximizing reward without fulfilling real goal. AdvertisementAd space — term-top Definition Full Definition Maximizing reward without fulfilling real goal. Keywords proxy exploitation Domains AI Safety & Alignment Related Terms Instrumental Convergence related to Tendency for agents to pursue resources regardless of final goal. Value Misalignment related to Model optimizes objectives misaligned with human values. Outer Alignment related to Correctly specifying goals. Inner Alignment related to Ensuring learned behavior matches intended objective. Alignment Problem related to Ensuring AI systems pursue intended human goals. Specification Gaming related to Model exploits poorly specified objectives.