cm0002@lemmy.world to Artificial Intelligence@lemmy.worldEnglish · 7 days agoLong-Context Multimodal Understanding No Longer Requires Massive Models: NVIDIA AI Introduces Eagle 2.5, a Generalist Vision-Language Model that Matches GPT-4o on Video Tasks Using Just 8B Parameterswww.marktechpost.comexternal-linkmessage-square0fedilinkarrow-up110arrow-down10
arrow-up110arrow-down1external-linkLong-Context Multimodal Understanding No Longer Requires Massive Models: NVIDIA AI Introduces Eagle 2.5, a Generalist Vision-Language Model that Matches GPT-4o on Video Tasks Using Just 8B Parameterswww.marktechpost.comcm0002@lemmy.world to Artificial Intelligence@lemmy.worldEnglish · 7 days agomessage-square0fedilink