Skip to content Skip to sidebar Skip to footer

llava-v1.6-vicuna-13b-gguf

GGUF Quantized LLaVA 1.6 Vicuna 13B Updated quants and projector from PR #5267 Name Quant method Bits Size Use case llava-v1.6-vicuna-13b.Q3_K_XS.gguf Q3_K_XS 3 5.31 GB very small, high quality loss llava-v1.6-vicuna-13b.Q3_K_M.gguf Q3_K_M 3 6.34 GB very small, high quality loss llava-v1.6-vicuna-13b.Q4_K_M.gguf Q4_K_M 4 7.87 GB medium, balanced quality - recommended llava-v1.6-vicuna-13b.Q5_K_S.gguf Q5_K_S 5 8.97 GB large, low quality loss - recommended llava-v1.6-vicuna-13b.Q5_K_M.gguf Q5_K_M 5 9.23 GB large, very low quality loss - recommended llava-v1.6-vicuna-13b.Q6_K.gguf Q6_K 5 10.7 GB very large, extremely low quality loss llava-v1.6-vicuna-13b.Q8_0.gguf Q8_0 5 13.8 GB very large, extremely low quality loss…

pix2struct-ocrvqa-base

Model card for Pix2Struct - Finetuned on OCR-VQA (Visual Question Answering over book covers) Table of Contents TL;DR Using the model Contribution Citation TL;DR Pix2Struct is an image encoder - text decoder model that is trained on image-text pairs for various tasks, including image captionning and visual question answering.…

vit-base-blur

Edit model card vit-base-blur Model description Intended uses & limitations Training and evaluation data Training procedure Training hyperparameters Training results Framework versions vit-base-blur This model is a fine-tuned version of google/vit-base-patch16-224-in21k on the blurry images dataset. It achieves the following results on the evaluation set: Loss: 0.0008 Accuracy: 1.0 Model description Model…

decision_transformer_atari

Edit model card Find here pretrained model weights for the [Decision Transformer] (https://github.com/kzl/decision-transformer). Weights are available for 4 Atari games: Breakout, Pong, Qbert and Seaquest. Found in the checkpoints directory. We share models trained for one seed (123), whereas the paper contained weights for 3 random seeds. Usage git clone https://huggingface.co/edbeeching/decision_transformer_atari conda env create -f conda_env.yml Then, you…

tts_hifigan

Edit model card NVIDIA Hifigan Vocoder (en-US) Usage Automatically instantiate the model Generate audio Save the generated audio file Input Output Model Architecture Training Datasets Performance Limitations Deployment with NVIDIA Riva References NVIDIA Hifigan Vocoder (en-US) | | | | | HiFiGAN [1] is a generative adversarial network (GAN) model that generates…