Seeing it All: LLaVA-UHD Perceives High-Resolution Images at Any Aspect Ratio
Marktechpost
MARCH 23, 2024
It turns out part of the issue may stem from how these models process high-resolution images. The overview effect is that LLaVA-UHD can flexibly parse high-res images up to 672×1088 pixels using just 94% of the compute needed for low-res 336×336 images with previous models. Clever, right?
Let's personalize your content