Multimodal Model
IntermediateModels that process or generate multiple modalities, enabling vision-language tasks, speech, video understanding, etc.
AdvertisementAd space — term-top
Definition
Full Definition
Models that process or generate multiple modalities, enabling vision-language tasks, speech, video understanding, etc.