CLIP
IntermediateJoint vision-language model aligning images and text.
AdvertisementAd space — term-top
Definition
Full Definition
Joint vision-language model aligning images and text.
Joint vision-language model aligning images and text.