Search: layers — Dictionary of AI

Vanishing Gradient Intermediate

Gradients shrink through layers, slowing learning in early layers; mitigated by ReLU, residuals, normalization.

Foundations & Theory

Residual Connection Intermediate

Allows gradients to bypass layers, enabling very deep networks.

AI Economics & Strategy

Neural Network Intermediate

A parameterized function composed of interconnected units organized in layers with nonlinear activations.

Neural Networks

Weight Initialization Intermediate

Methods to set starting weights to preserve signal/gradient scales across layers.

Foundations & Theory

Convolutional Neural Network Intermediate

Networks using convolution operations with weight sharing and locality, effective for images and signals.

Neural Networks Computer Vision

LoRA Intermediate

PEFT method injecting trainable low-rank matrices into layers, enabling efficient fine-tuning.

Foundations & Theory

Bottleneck Layer Intermediate

A narrow hidden layer forcing compact representations.

AI Economics & Strategy

Depth vs Width Intermediate

Tradeoffs between many layers vs many neurons per layer.

AI Economics & Strategy

Deep Learning Intermediate

A branch of ML using multi-layer neural networks to learn hierarchical representations, often excelling in vision, speech, and language.

Deep Learning

Model Intermediate

A parameterized mapping from inputs to outputs; includes architecture + learned parameters.

Foundations & Theory

Hyperparameters Intermediate

Configuration choices not learned directly (or not typically learned) that govern training or architecture.

Optimization

Dropout Intermediate

Randomly zeroing activations during training to reduce co-adaptation and overfitting.

Foundations & Theory

Transformer Intermediate

Architecture based on self-attention and feedforward layers; foundation of modern LLMs and many multimodal models.

Transformers & LLMs

Highway Network Intermediate

Early architecture using learned gates for skip connections.

AI Economics & Strategy

Backdoor / Trojan Intermediate

Hidden behavior activated by specific triggers, causing targeted mispredictions or undesired outputs.

Foundations & Theory

Parameter Sharing Intermediate

Using same parameters across different parts of a model.

AI Economics & Strategy

Emergent Abilities Intermediate

Capabilities that appear only beyond certain model sizes.

AI Economics & Strategy

Scaling Laws Intermediate

Empirical laws linking model size, data, compute to performance.

AI Economics & Strategy

Restricted Boltzmann Machine Intermediate

Simplified Boltzmann Machine with bipartite structure.

Model Architectures

Flow-Based Model Advanced

Exact likelihood generative models using invertible transforms.

Diffusion & Generative Models

Vision Transformer Intermediate

Transformer applied to image patches.

Computer Vision

Temporal Convolution Intermediate

CNNs applied to time series.

Time Series

Kill Switch Intermediate

Mechanism to disable AI system.

Governance & Ethics

Perception Stack Advanced

Software pipeline converting raw sensor data into structured representations.

Robotics & Embodied AI

Results for "layers"

Welcome to AI Glossary

Search

Browse

3D WordGraph