Facilitating NSFW Text Detection in Open-Domain Dialogue Systems via Knowledge Distillation
⚙️ GitHub •
📄 Paper •
🤗 Model
Overview
CensorChat is a dialogue monitoring dataset aimed at…
How to Get Started with the Model
To use the model through Hosted inference API, follow the code snippet provided below:
from transformers import BertTokenizer, BertForSequenceClassification
def …
Reward model trained from human feedback
Reward model (RM) trained to predict which generated answer is better judged by a human, given a question.
RM are useful in these…
⚠️ Disclaimer:
The huggingface models currently give different results to the detoxify library (see issue here). For the most up to date models we recommend using the models from
…
SiEBERT - English-Language Sentiment Classification
Overview
This model ("SiEBERT", prefix for "Sentiment in English") is a fine-tuned checkpoint of RoBERTa-large (Liu et al. 2019). It enables reliable…
Emotion English DistilRoBERTa-base
Description ℹ
With this model, you can classify emotions in English text data. The model was trained on 6 diverse datasets (see Appendix below)…
Twitter-roBERTa-base for Sentiment Analysis
This is a roBERTa-base model trained on ~58M tweets and finetuned for sentiment analysis with the TweetEval benchmark. This model is suitable for English (for…
CAMeLBERT-DA SA Model
Model description
CAMeLBERT-DA SA Model is a Sentiment Analysis (SA) model that was built by fine-tuning the CAMeLBERT Dialectal Arabic (DA) model.
For the fine-tuning,…
