ACHVR AI

OpenAI: Singapore’s Temasek in talks to invest in OpenAI: report

Singapore's Temasek Holdings is in discussions to invest in Microsoft-backed artificial intelligence company OpenAI, the Financial Times reported on Tuesday, citing two people familiar with the matter. Senior executives at Singapore's state investment firm have met ChatGPT maker's CEO, Sam Altman, multiple times in recent months, the report added.OpenAI did not immediately respond to Reuters requests…

March 6, 20240Likes 0Comments

AI News

Reinforcement learning helping fine-tune Gemini Pro, says Google Cloud CEO | Technology News

Google has started rolling out its Gemini Pro API to developers and organisations along with a range of other AI tools, models, and infrastructure. Developers on Google Studio will now be able to access Gemini Pro API which is also available to enterprises through Google Cloud’s Vertex AI platform. “We are moving quickly to bring…

March 6, 20240Likes 0Comments

Text Classification

NSFW-detector

Facilitating NSFW Text Detection in Open-Domain Dialogue Systems via Knowledge Distillation ⚙️ GitHub • 📄 Paper • 🤗 Model Overview CensorChat is a dialogue monitoring dataset aimed at NSFW dialogue detection. Leveraging knowledge distillation techniques involving GPT-4 and ChatGPT, this dataset offers a cost-effective means of constructing NSFW content detectors. The process entails…

March 6, 20240Likes 0Comments

VLLMs

udop-large-512-300k

UDOP model The UDOP model was proposed in Unifying Vision, Text, and Layout for Universal Document Processing by Zineng Tang, Ziyi Yang, Guoxin Wang, Yuwei Fang, Yang Liu, Chenguang Zhu, Michael Zeng, Cha Zhang, Mohit Bansal. Model description UDOP adopts an encoder-decoder Transformer architecture based on T5 for document AI tasks like…

March 6, 20240Likes 0Comments

Document Question Answering

CQI_Visual_Question_Awnser_PT_v0

Getting started with the model To run these examples, you must have PIL, pytesseract, and PyTorch installed in addition to transformers. from transformers import pipeline nlp = pipeline( "document-question-answering" , model="impira/layoutlm-document-qa" , ) nlp( "https://templates.invoicehome.com/invoice-template-us-neat-750px.png" , "What is the invoice number?" ) # {'score':…

March 6, 20240Likes 0Comments

Visual Question Answering

git-large-textvqa

GIT (GenerativeImage2Text), large-sized, fine-tuned on TextVQA GIT (short for GenerativeImage2Text) model, large-sized version, fine-tuned on TextVQA. It was introduced in the paper GIT: A Generative Image-to-text Transformer for Vision and Language by Wang et al. and first released in this repository. Disclaimer: The team releasing GIT did not write a model card for this…

March 6, 20240Likes 0Comments

Depth Estimation

lap-depth-nyu

LapDepth-release This repository is a Pytorch implementation of the paper "Monocular Depth Estimation Using Laplacian Pyramid-Based Depth Residuals" Minsoo Song, Seokjae Lim, and Wonjun Kim* IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) Official Repository: LapDepth-release License: GPL-3.0 license Usage from model import…

March 6, 20240Likes 0Comments

Image Classification

cafe_aesthetic

Edit model card Info License Info Since people are downloading this and I don't know why, I'll add some information. This model is an image classifier fine-tuned on microsoft/beit-base-patch16-384. Its purpose is to be used in the dataset conditioning step for the Waifu Diffusion project, a fine-tune effort for Stable Diffusion. As WD1.4 is…

March 6, 20240Likes 0Comments

OpenAI: Singapore’s Temasek in talks to invest in OpenAI: report

Reinforcement learning helping fine-tune Gemini Pro, says Google Cloud CEO | Technology News

NSFW-detector

udop-large-512-300k

CQI_Visual_Question_Awnser_PT_v0

git-large-textvqa

lap-depth-nyu

cafe_aesthetic

TF_Decision_Trees

Creatie

Dottypost

COMPANY

SUPPORT

Follow Us