福模

免费开源AI模型下载_本地AI工具资源平台

多模态AIMultimodal AI

PaLI视觉语言模型 - 端到端语言图像理解

PaLI Vision-Language Model - End-to-End Language-Image Understanding

PaLI视觉语言模型,实现端到端语言图像理解。支持图像分类、视觉问答、图像描述等多种任务,具有统一的架构和优秀的性能。

PaLI vision-language model, achieving end-to-end language-image understanding. Supports multiple tasks including image classification, visual question answering, and image captioning, with a unified architecture and excellent performance.

视觉语言PaLI端到端图像理解Vision-LanguagePaLIEnd-to-EndImage Understanding

文件大小

18.9 GB

Upload Size

18.9 GB

上传日期

2025-03-13

Upload Date

2025-03-13

下载次数

10,200

Downloads

10,200

评分

4.6/5.0

Rating

4.6/5.0

下载资源 Download Resources

下载资源表示您同意我们的使用条款和隐私政策

By downloading this resource, you agree to our Terms of Service and Privacy Policy

相关资源推荐

MUSE多模态AI生成模型 - 高质量文本到图像合成MUSE Multimodal AI Generation Model - High-Quality Text-to-Image Synthesis

MUSE多模态AI生成模型,基于Transformer的高质量文本到图像生成系统。结合了扩散模型和Transformer的优势,生成高质量图像。

MUSE multimodal AI generation model, a high-quality text-to-image generation system based on Transformer. Combines the advantages of diffusion models and Transformers to generate high-quality images.

MUSE多模态文本到图像MUSEMultimodalText-to-Image
18.7 GB2025-02-03
LLaVA视觉语言模型 - 融合图像理解的对话AILLaVA Vision-Language Model - Conversational AI with Image Understanding

LLaVA视觉语言模型,融合图像理解的对话AI。将视觉编码器与语言模型相结合,支持图像相关的对话和推理,适用于教育、客户服务等场景。

LLaVA vision-language model, conversational AI with image understanding. Combines visual encoder with language model, supports image-related conversations and reasoning, suitable for educational, customer service and other scenarios.

LLaVA视觉语言对话AILLaVAVision-LanguageConversational AI
15.3 GB2025-04-13
MiniGPT-4多模态AI模型 - 图像到文本生成专家MiniGPT-4 Multimodal AI Model - Image-to-Text Generation Expert

MiniGPT-4多模态AI模型,图像到文本生成专家。结合视觉编码器和语言模型,能够根据图像生成详细描述和故事,适用于图像理解、内容创作等任务。

MiniGPT-4 multimodal AI model, image-to-text generation expert. Combines visual encoder and language model, capable of generating detailed descriptions and stories from images, suitable for image understanding, content creation and other tasks.

MiniGPT-4多模态图像理解MiniGPT-4MultimodalImage Understanding
4.2 GB2025-04-05