Transformers
PyTorch
English
roberta
Machine Learning
Research Papers
Scientific Language Model
Entity
Instructions to use shrutisingh/MLEntityRoBERTa with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use shrutisingh/MLEntityRoBERTa with Transformers:
# Load model directly from transformers import AutoTokenizer, MaskedRoBERTa tokenizer = AutoTokenizer.from_pretrained("shrutisingh/MLEntityRoBERTa") model = MaskedRoBERTa.from_pretrained("shrutisingh/MLEntityRoBERTa") - Notebooks
- Google Colab
- Kaggle
MLEntityRoBERTa
How to use:
from transformers import AutoTokenizer, AutoModel
tok = AutoTokenizer.from_pretrained('shrutisingh/MLEntityRoBERTa')
model = AutoModel.from_pretrained('shrutisingh/MLEntityRoBERTa')
Pretraining Details:
This is a variant of the MLRoBERTa model which is trained on a masked dataset. The dataset of MLRoBERTa is modified to replace specific scientific entities in a paper with generic labels. The idea is to make the model focus more on the syntax and semantics of the text without getting confused by specific entity names. Scientific entities which belong to any one of the classes: TDMM (task, dataset, method, metric) are masked with these specific labels. The entity set is manually cleaned and mapped to appropriate labels. Eg: The authors present results on MNIST. -> The authors present results on dataset.
Citation:
@inproceedings{singh2021compare,
title={COMPARE: a taxonomy and dataset of comparison discussions in peer reviews},
author={Singh, Shruti and Singh, Mayank and Goyal, Pawan},
booktitle={2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)},
pages={238--241},
year={2021},
organization={IEEE}
}
- Downloads last month
- 2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support