LLaVA-Mini
LLaVA-Mini, a lightweight multimodal model by ICTNLP, enhances visual content understanding with one visual token, ideal for researchers and developers needing fast, accurate image and video analysis.
LLaVA-Mini
multi-mode mode