Hubert语音表示学习模型 - 无监督语音表征学习

Hubert Speech Representation Learning Model - Unsupervised Speech Representation Learning

HuBERT语音表示学习模型，Facebook提出的无监督语音表征学习模型。通过聚类平滑预测和掩码重建，实现了语音表示的层次化学习。

HuBERT speech representation learning model, an unsupervised speech representation learning model proposed by Facebook. Achieves hierarchical learning of speech representations through cluster-smoothed prediction and masked reconstruction.

HuBERT语音表示无监督学习语音识别HuBERTSpeech RepresentationUnsupervised LearningSpeech Recognition

1.9 GB2025-01-24

LayoutLM文档理解模型 - 图文结合的文档解析

LayoutLM Document Understanding Model - Document Analysis with Text and Layout

LayoutLM文档理解模型，结合文本和布局信息的文档理解模型。通过融合视觉和文本特征，提升了表格解析和文档分类的准确性。

LayoutLM document understanding model, a document understanding model combining text and layout information. Improves the accuracy of table parsing and document classification by fusing visual and textual features.

LayoutLM文档理解文档解析OCRLayoutLMDocument UnderstandingDocument AnalysisOCR

2.7 GB2025-01-26

SimCLR自监督视觉学习模型 - 对比学习表征学习

SimCLR Self-Supervised Visual Learning Model - Contrastive Learning Representation Learning

SimCLR自监督视觉学习模型，通过对比学习进行视觉表征学习。采用增强对比策略，大幅提升了无监督学习的性能。

SimCLR self-supervised visual learning model, performing visual representation learning through contrastive learning. Adopting augmented contrastive strategies, significantly improves the performance of unsupervised learning.

SimCLR自监督学习对比学习视觉表征SimCLRSelf-Supervised LearningContrastive LearningVisual Representation

4.2 GB2025-01-28

DeBERTa语言理解模型 - 增强版BERT模型

DeBERTa Language Understanding Model - Enhanced BERT Model

DeBERTa语言理解模型，对BERT的增强改进版本。通过分解注意力和增强掩码解码，进一步提升了语言理解任务的性能。

DeBERTa language understanding model, an enhanced improved version of BERT. Further improves the performance of language understanding tasks through disentangled attention and enhanced masked decoding.

DeBERTa语言理解NLPBERT改进DeBERTaLanguage UnderstandingNLPBERT Improvement

3.1 GB2025-01-30

ProGAN渐进式生成AI模型 - 高分辨率图像合成

ProGAN Progressive Generation AI Model - High-Resolution Image Synthesis

ProGAN渐进式生成AI模型，能够逐步生成高分辨率图像。从低分辨率开始逐渐增加细节，生成逼真的图像，广泛应用于艺术和设计领域。

ProGAN progressive generation AI model, capable of generating high-resolution images progressively. Starting from low resolution and gradually increasing detail, generating realistic images, widely used in art and design fields.

ProGAN渐进式生成高分辨率图像合成ProGANProgressive GenerationHigh ResolutionImage Synthesis

9.1 GB2025-02-01

MUSE多模态AI生成模型 - 高质量文本到图像合成

MUSE Multimodal AI Generation Model - High-Quality Text-to-Image Synthesis

MUSE多模态AI生成模型，基于Transformer的高质量文本到图像生成系统。结合了扩散模型和Transformer的优势，生成高质量图像。

MUSE multimodal AI generation model, a high-quality text-to-image generation system based on Transformer. Combines the advantages of diffusion models and Transformers to generate high-quality images.

MUSE多模态文本到图像TransformerMUSEMultimodalText-to-ImageTransformer

18.7 GB2025-02-03

PaLM-E具身AI模型 - 多模态大语言模型

PaLM-E Embodied AI Model - Multimodal Large Language Model

PaLM-E具身AI模型，结合视觉和语言能力的多模态大语言模型。能够在物理世界中执行任务，将语言理解与感知相结合。

PaLM-E embodied AI model, a multimodal large language model combining vision and language capabilities. Capable of performing tasks in the physical world, combining language understanding with perception.

PaLM-E具身AI多模态大语言模型PaLM-EEmbodied AIMultimodalLarge Language Model

22.3 GB2025-02-05

Flamingo多模态AI模型 - 视觉语言理解

Flamingo Multimodal AI Model - Visual-Language Understanding

Flamingo多模态AI模型，先进的视觉语言理解模型。可以回答关于图像的问题、描述视觉内容，并执行各种视觉语言任务。

Flamingo multimodal AI model, an advanced visual-language understanding model. Can answer questions about images, describe visual content, and perform various vision-language tasks.

Flamingo视觉语言理解模型问答系统FlamingoVisual-LanguageUnderstanding ModelQuestion Answering

14.2 GB2025-02-07

...