福模

免费开源AI模型下载_本地AI工具资源平台

MiniGPT-4多模态AI模型 - 图像到文本生成专家

MiniGPT-4 Multimodal AI Model - Image-to-Text Generation Expert

MiniGPT-4多模态AI模型,图像到文本生成专家。结合视觉编码器和语言模型,能够根据图像生成详细描述和故事,适用于图像理解、内容创作等任务。

MiniGPT-4 multimodal AI model, image-to-text generation expert. Combines visual encoder and language model, capable of generating detailed descriptions and stories from images, suitable for image understanding, content creation and other tasks.

MiniGPT-4多模态图像理解文本生成MiniGPT-4MultimodalImage UnderstandingText Generation
4.2 GB2025-04-05

BLIP-2视觉语言模型 - 先进的图像字幕生成

BLIP-2 Vision-Language Model - Advanced Image Captioning

BLIP-2视觉语言模型,先进的图像字幕生成工具。能够理解图像内容并生成准确、富有表现力的描述,支持零样本学习,在多个视觉语言基准测试中取得领先成绩。

BLIP-2 vision-language model, advanced image captioning tool. Understands image content and generates accurate, expressive descriptions, supports zero-shot learning, achieving leading results in multiple vision-language benchmarks.

BLIP-2视觉语言图像字幕零样本学习BLIP-2Vision-LanguageImage CaptioningZero-Shot Learning
6.8 GB2025-04-07

Stable Diffusion XL 1.0专业版 - 企业级高分辨率图像生成

Stable Diffusion XL 1.0 Professional - Enterprise-Grade High-Resolution Image Generation

Stable Diffusion XL 1.0专业版,企业级高分辨率图像生成模型。支持1024x1024分辨率,具备改进的文本到图像生成能力,适用于商业设计和专业创作。

Stable Diffusion XL 1.0 professional version, enterprise-grade high-resolution image generation model. Supports 1024x1024 resolution, features improved text-to-image generation capabilities, suitable for commercial design and professional creation.

Stable Diffusion高分辨率企业级专业版Stable DiffusionHigh ResolutionEnterpriseProfessional
12.6 GB2025-04-09

CoCa多模态生成模型 - 联合图像文本生成

CoCa Multimodal Generative Model - Joint Image-Text Generation

CoCa多模态生成模型,联合图像文本生成模型。独特地将图像编码和文本生成结合起来,实现高效的视觉语言理解与生成,适用于内容创作和图像编辑。

CoCa multimodal generative model, joint image-text generation model. Uniquely combines image encoding and text generation, achieving efficient visual language understanding and generation, suitable for content creation and image editing.

CoCa多模态图像文本内容生成CoCaMultimodalImage-TextContent Generation
8.7 GB2025-04-11

LLaVA视觉语言模型 - 融合图像理解的对话AI

LLaVA Vision-Language Model - Conversational AI with Image Understanding

LLaVA视觉语言模型,融合图像理解的对话AI。将视觉编码器与语言模型相结合,支持图像相关的对话和推理,适用于教育、客户服务等场景。

LLaVA vision-language model, conversational AI with image understanding. Combines visual encoder with language model, supports image-related conversations and reasoning, suitable for educational, customer service and other scenarios.

LLaVA视觉语言对话AI图像理解LLaVAVision-LanguageConversational AIImage Understanding
15.3 GB2025-04-13

轻量级对话AI模型 - 适用于移动设备的LLM方案

Lightweight Conversation AI Model - LLM Solution for Mobile Devices

轻量级对话AI模型,专为移动设备优化的LLM方案。体积小、功耗低,可在智能手机和平板电脑上运行,支持离线对话功能,保护用户隐私。

Lightweight conversation AI model, LLM solution optimized for mobile devices. Small size, low power consumption, runs on smartphones and tablets, supports offline conversations, protecting user privacy.

轻量级对话模型移动设备离线LightweightConversation ModelsMobileOffline
2.1 GB2025-04-15

声音克隆模型下载 - 5秒音频即可克隆人声

Voice Cloning Model Download - Clone Voices with Just 5 Seconds of Audio

声音克隆模型,只需5秒音频即可克隆人声。支持高保真度的声音复制,适用于配音、虚拟主播、语音助手等应用场景,提供详细的训练教程。

Voice cloning model, requiring only 5 seconds of audio to clone voices. Supports high-fidelity voice replication, applicable to dubbing, virtual streamers, voice assistants and other application scenarios, providing detailed training tutorials.

声音克隆语音复制TTS5秒克隆Voice CloningVoice ReplicationTTS5 Second Clone
3.2 GB2025-04-19

离线运行 AI 模型包 - 完全本地化的AI解决方案

Offline Running AI Model Package - Fully Localized AI Solution

离线运行AI模型包,完全本地化的AI解决方案。无需网络连接,所有计算均在本地完成,确保数据隐私和安全性,适合对隐私要求高的场景。

Offline running AI model package, a fully localized AI solution. Requires no internet connection, all calculations are performed locally, ensuring data privacy and security, suitable for privacy-sensitive scenarios.

离线运行本地化数据隐私安全Offline OperationLocalizationData PrivacySecurity
45.7 GB2025-04-29