A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 3 Home / Browse S / Specification Gaming Specification Gaming Advanced EN Share Print Model exploits poorly specified objectives. AdvertisementAd space — term-top Definition Full Definition Model exploits poorly specified objectives. Keywords reward hacking Domains AI Safety & Alignment Related Terms Reward Hacking related to Maximizing reward without fulfilling real goal. Instrumental Convergence related to Tendency for agents to pursue resources regardless of final goal. Value Misalignment related to Model optimizes objectives misaligned with human values. Outer Alignment related to Correctly specifying goals. Alignment Problem related to Ensuring AI systems pursue intended human goals.