Results for "text"
Context Window
IntermediateMaximum number of tokens the model can attend to in one forward pass; constrains long-document reasoning.
Generating speech audio from text, with control over prosody, speaker identity, and style.
Converting text into discrete units (tokens) for modeling; subword tokenizers balance vocabulary size and coverage.
The text (and possibly other modalities) given to an LLM to condition its output behavior.
Inputs crafted to cause model errors or unsafe behavior, often imperceptible in vision or subtle in text.
Converting audio speech into text, often using encoder-decoder or transducer architectures.
Joint vision-language model aligning images and text.
Generating human-like speech from text.