A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 3 Home / Browse G / Gating Network Gating Network Intermediate EN Share Print Chooses which experts process each token. AdvertisementAd space — term-top Definition Full Definition Chooses which experts process each token. Keywords expert selection Domains AI Economics & Strategy Related Terms Emergent Abilities related to Capabilities that appear only beyond certain model sizes. Scaling Laws related to Empirical laws linking model size, data, compute to performance. Markov Decision Process related to Formal framework for sequential decision-making under uncertainty. State Space related to All possible configurations an agent may encounter. Absolute Positional Encoding related to Encodes token position explicitly, often via sinusoids. Context Compression related to Techniques to handle longer documents without quadratic cost. Sparse Attention related to Attention mechanisms that reduce quadratic complexity. Mixture of Experts related to Routes inputs to subsets of parameters for scalable capacity.