Visual Question Answering

Visual Question AnsweringMarch 8, 2024

blip-fine-tuned-2ep

icyheat23/blip-fine-tuned-2ep Visual Question Answering • Updated Mar 21, 2023 • 1 Source link

Visual Question AnsweringMarch 8, 2024

Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding This is the Hugging Face repo for storing pre-trained & fine-tuned checkpoints of our Video-LLaMA, which is a multi-modal conversational large…

0Likes 0Comments

Visual Question AnsweringMarch 7, 2024

deplot

Model card for DePlot Table of Contents TL;DR Using the model Contribution Citation TL;DR The abstract of the paper states that: Visual language such as charts and…

0Likes 0Comments

Visual Question AnsweringMarch 7, 2024

pix2struct-infographics-vqa-large

Model card for Pix2Struct - Finetuned on Infographics-VQA (Visual Question Answering over high-res infographics) - large version Table of Contents TL;DR Using the model Contribution Citation…

0Likes 0Comments

Visual Question AnsweringMarch 7, 2024

pix2struct-ai2d-large

Model card for Pix2Struct - Finetuned on AI2D (scientific diagram VQA) - large version Table of Contents TL;DR Using the model Contribution Citation TL;DR Pix2Struct…

0Likes 0Comments

Visual Question AnsweringMarch 7, 2024

pix2struct-ocrvqa-base

Model card for Pix2Struct - Finetuned on OCR-VQA (Visual Question Answering over book covers) Table of Contents TL;DR Using the model Contribution Citation TL;DR …

0Likes 0Comments

Visual Question AnsweringMarch 7, 2024

pix2struct-ocrvqa-large

Model card for Pix2Struct - Finetuned on OCR-VQA (Visual Question Answering over book covers) - large version Table of Contents TL;DR Using the model Contribution Citation…

0Likes 0Comments

Visual Question AnsweringMarch 7, 2024

pix2struct-widget-captioning-base

Model card for Pix2Struct - Finetuned on Widget Captioning (Captioning a UI component on a screen) Table of Contents TL;DR Using the model Contribution Citation …

0Likes 0Comments

blip-fine-tuned-2ep

Video-LLaMA-Series

deplot

pix2struct-infographics-vqa-large

pix2struct-ai2d-large

pix2struct-ocrvqa-base

pix2struct-ocrvqa-large

pix2struct-widget-captioning-base

TF_Decision_Trees

Creatie

Dottypost

COMPANY

SUPPORT

Follow Us