Skip to content Skip to sidebar Skip to footer

All Posts

Viewing 521-528 posts

Ziya-BLIP2-14B-Visual-v1

Ziya-BLIP2-14B-Visual-v1 Main Page:Fengshenbang Github: Fengshenbang-LM 姜子牙系列模型 Ziya-BLIP2-14B-Visual-v1 Ziya-LLaMA-13B-v1.1 Ziya-LLaMA-13B-v1 Ziya-LLaMA-7B-Reward Ziya-LLaMA-13B-Pretrain-v1 简介 Brief Introduction Ziya-Visual多模态大模型基于姜子牙通用大模型V1训练,具有视觉问答和对话能力。今年3月份OpenAI发布具有识图能力的多模态大模型GPT-4,遗憾的是,时至今日绝大部分用户也都还没有拿到GPT-4输入图片的权限,Ziya-Visual参考了Mini-GPT4、LLaVA等优秀的开源实现,补齐了Ziya的识图能力,使中文用户群体可以体验到结合视觉和语言两大模态的大模型的卓越能力。 The Ziya-Visual multimodal Big Model is based on the Ziya-LLaMA-13B-v1 training and has visual question and answer and dialogue capabilities. In March this year, OpenAI released GPT-4, a multimodal big model with image recognition capabilities.…

dpt-large

Model Details: DPT-Large (also known as MiDaS 3.0) Dense Prediction Transformer (DPT) model trained on 1.4 million images for monocular depth estimation. It was introduced in the paper Vision Transformers for Dense Prediction by Ranftl et al. (2021) and first released in this repository. DPT uses the Vision Transformer (ViT) as backbone and adds…

resnet-50

Edit model card ResNet-50 v1.5 Model description Intended uses & limitations How to use BibTeX entry and citation info ResNet-50 v1.5 ResNet model pre-trained on ImageNet-1k at resolution 224x224. It was introduced in the paper Deep Residual Learning for Image Recognition by He et al. Disclaimer: The team releasing ResNet did not…

PPO-Huggy

Edit model card ppo Agent playing Huggy Usage (with ML-Agents) Resume the training Watch your Agent play ppo Agent playing Huggy This is a trained model of a ppo agent playing Huggy using the Unity ML-Agents Library. Usage (with ML-Agents) The Documentation: https://unity-technologies.github.io/ml-agents/ML-Agents-Toolkit-Documentation/ We…

grasp_diffusion

Edit model card Trained Models for Grasp SE(3) DiffusionFields. Check SE(3)-DiffusionFields: Learning smooth cost functions for joint grasp and motion optimization through diffusion for additional details. [Paper] Source link

mms-tts-eng

Edit model card Massively Multilingual Speech (MMS): English Text-to-Speech Model Details Usage BibTex citation License Massively Multilingual Speech (MMS): English Text-to-Speech This repository contains the English (eng) language text-to-speech (TTS) model checkpoint. This model is part of Facebook's Massively Multilingual Speech project, aiming to provide speech technology across a diverse range of languages. You…

AI advisory is only for significant platforms, says MoS IT Rajeev Chandrasekhar | Latest News India

Days after the IT ministry issued an advisory to multiple companies including Google, Microsoft and Adobe that they must get explicit permission from the government for all their “under-testing” or “unreliable” artificial intelligence (AI) models before releasing them to users in India, minister of state for electronics and information technology, Rajeev Chandrasekhar tweeted that the…