deberta-v3-large-snli_mnli_fever_anli_R1_R2_R3-nli
Datasets
This model was trained on the snli-v1.0, multi-nli-1.0, nli-fever and anli-1.0-r1/anli-1.0-r2/anli-1.0-r3 datasets with the training weights of 1,1,1,10,20,10 respectively.
The training codes are mostly referenced from: https://github.com/facebookresearch/anli
Hyperparameters
learning_rate: 1e-5
max_length: 156
batch_size: 16
warmup_ratio: 0.1
weight_decay: 0.0
num_epochs: 2
Dev results
snli-v1.0 | multi-nli-1.0-m | multi-nli-1.0-mm | anli-1.0-r1 | anli-1.0-r2 | anli-1.0-r3 |
---|---|---|---|---|---|
0.938 | 0.914 | 0.912 | 0.796 | 0.627 | 0.610 |
Test results
snli-v1.0 | anli-1.0-r1 | anli-1.0-r2 | anli-1.0-r3 |
---|---|---|---|
0.929 | 0.775 | 0.636 | 0.612 |
- Downloads last month
- 101
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.