福模

免费开源AI模型下载_本地AI工具资源平台

多模态AIMultimodal AI

LLaVA视觉语言模型 - 融合图像理解的对话AI

LLaVA Vision-Language Model - Conversational AI with Image Understanding

LLaVA视觉语言模型,融合图像理解的对话AI。将视觉编码器与语言模型相结合,支持图像相关的对话和推理,适用于教育、客户服务等场景。

LLaVA vision-language model, conversational AI with image understanding. Combines visual encoder with language model, supports image-related conversations and reasoning, suitable for educational, customer service and other scenarios.

LLaVA视觉语言对话AI图像理解LLaVAVision-LanguageConversational AIImage Understanding

文件大小

15.3 GB

Upload Size

15.3 GB

上传日期

2025-04-13

Upload Date

2025-04-13

下载次数

16,800

Downloads

16,800

评分

4.7/5.0

Rating

4.7/5.0

下载资源 Download Resources

下载资源表示您同意我们的使用条款和隐私政策

By downloading this resource, you agree to our Terms of Service and Privacy Policy

相关资源推荐

CoCa多模态生成模型 - 联合图像文本生成CoCa Multimodal Generative Model - Joint Image-Text Generation

CoCa多模态生成模型,联合图像文本生成模型。独特地将图像编码和文本生成结合起来,实现高效的视觉语言理解与生成,适用于内容创作和图像编辑。

CoCa multimodal generative model, joint image-text generation model. Uniquely combines image encoding and text generation, achieving efficient visual language understanding and generation, suitable for content creation and image editing.

CoCa多模态图像文本CoCaMultimodalImage-Text
8.7 GB2025-04-11
Flamingo视觉语言模型 - 少样本视觉语言理解Flamingo Vision-Language Model - Few-Shot Visual Language Understanding

Flamingo视觉语言模型,实现少样本视觉语言理解。结合图像和文本信息,支持问答、描述生成等多模态任务,具有优秀的泛化能力。

Flamingo vision-language model, achieving few-shot visual language understanding. Combines image and text information, supporting multimodal tasks such as question answering and description generation, with excellent generalization capabilities.

视觉语言多模态FlamingoVision-LanguageMultimodalFlamingo
72.6 GB2025-03-11
ALIGN多模态AI模型 - 大规模图像文本对齐ALIGN Multimodal AI Model - Large-Scale Image-Text Alignment

ALIGN多模态AI模型,利用大规模图像文本对进行对比学习。在多个视觉语言任务中取得了优异成果,支持图像检索和文本生成。

ALIGN multimodal AI model, utilizing large-scale image-text pairs for contrastive learning. Achieves excellent results in multiple vision-language tasks, supporting image retrieval and text generation.

ALIGN多模态图像文本ALIGNMultimodalImage-Text
5.6 GB2024-12-15