Singapore's Temasek Holdings is in discussions to invest in Microsoft-backed artificial intelligence company OpenAI, the Financial Times reported on Tuesday, citing two people familiar with the matter.
Senior executives at Singapore's state investment firm have met ChatGPT maker's CEO, Sam Altman, multiple times in recent months, the report added.OpenAI did not immediately respond to Reuters requests…
Google has started rolling out its Gemini Pro API to developers and organisations along with a range of other AI tools, models, and infrastructure. Developers on Google Studio will now be able to access Gemini Pro API which is also available to enterprises through Google Cloud’s Vertex AI platform.
“We are moving quickly to bring…
Facilitating NSFW Text Detection in Open-Domain Dialogue Systems via Knowledge Distillation
⚙️ GitHub •
📄 Paper •
🤗 Model
Overview
CensorChat is a dialogue monitoring dataset aimed at NSFW dialogue detection. Leveraging knowledge distillation techniques involving GPT-4 and ChatGPT, this dataset offers a cost-effective means of constructing NSFW content detectors. The process entails…
UDOP model
The UDOP model was proposed in Unifying Vision, Text, and Layout for Universal Document Processing by Zineng Tang, Ziyi Yang, Guoxin Wang, Yuwei Fang, Yang Liu, Chenguang Zhu, Michael Zeng, Cha Zhang, Mohit Bansal.
Model description
UDOP adopts an encoder-decoder Transformer architecture based on T5 for document AI tasks like…
Getting started with the model
To run these examples, you must have PIL, pytesseract, and PyTorch installed in addition to transformers.
from transformers import pipeline
nlp = pipeline(
"document-question-answering" ,
model="impira/layoutlm-document-qa" ,
)
nlp(
"https://templates.invoicehome.com/invoice-template-us-neat-750px.png" ,
"What is the invoice number?"
)
# {'score':…
GIT (GenerativeImage2Text), large-sized, fine-tuned on TextVQA
GIT (short for GenerativeImage2Text) model, large-sized version, fine-tuned on TextVQA. It was introduced in the paper GIT: A Generative Image-to-text Transformer for Vision and Language by Wang et al. and first released in this repository.
Disclaimer: The team releasing GIT did not write a model card for this…
LapDepth-release
This repository is a Pytorch implementation of the paper "Monocular Depth Estimation Using Laplacian Pyramid-Based Depth Residuals"
Minsoo Song, Seokjae Lim, and Wonjun Kim* IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
Official Repository: LapDepth-release
License: GPL-3.0 license
Usage
from model import…
Edit model card
Info
License
Info
Since people are downloading this and I don't know why, I'll add some information. This model is an image classifier fine-tuned on microsoft/beit-base-patch16-384.
Its purpose is to be used in the dataset conditioning step for the Waifu Diffusion project, a fine-tune effort for Stable Diffusion. As WD1.4 is…
