Orca task clusters

ORCA is composed of 60 publicly available datasets and covers 29 tasks within seven NLU task clusters as follows.


(1) Natural Language Inference (NLI)

Task Name Identifier
Variation
Score MetricURL
BibTeX
ANS Stanceans-stance
MSA
Macro F1-score
Baly Stancebaly-stance
MSA
Macro F1-score
XLNIxlni
MSA
Macro F1-score


(2) Question Answering (QA)

Task Name Identifier
Variation
Score MetricURL
BibTeX
Question Answeringqa
MSA
Macro F1-score


(3) Semantic Textual Similarity and and Paraphrase (STSP)

Task Name Identifier
Variation
Score MetricURL
BibTeX
Emotion Regression emotion-reg
MSA
Spearman Correlation
MQ2Qmq2q
MSA
Macro F1-score
STSsts
MSA
Spearman Correlation


(4) Sentence Classification (SC)

Task Name Identifier
Variation
Score MetricURL
BibTeX
Abusiveabusive
DA
Macro F1-score
Adultadult
DA
Macro F1-score
Ageage
DA
Macro F1-score
ANS Claimans-claim
MSA
Macro F1-score
Dangerousdangerous
DA
Macro F1-score
Dialect at Binary Leveldialect-binary
DA
Macro F1-score
Dialect at Country Leveldialect-country
DA
Macro F1-score
Dialect at Region Leveldialect-region
DA
Macro F1-score
Emotionemotion
DA
Macro F1-score
Gendergender
DA
Macro F1-score
Hate Speechhate-speech
DA
Macro F1-score
Ironyirony
DA
Macro F1-score
Machine Generation machine-generation
MSA
Macro F1-score
Offensiveoffensive
DA
Macro F1-score
Sarcasmsarcasm
DA
Macro F1-score
Sentiment Analysissentiment
DA
Macro F1-score


(5) Structure Predictions (SP)

Task Name Identifier
Variation
Score MetricURL
BibTeX
Aqmar NERaqmar-ner
MSA
Macro F1-score
Arabic NER Corpusarabic-ner
MSA
Macro F1-score
Dialect Part Of Speechdialect-pos
DA
Macro F1-score
MSA Part Of Speechmsa-pos
MSA
Macro F1-score


(6) Topic Classification (TC)

Task Name Identifier
Variation
Score MetricURL
BibTeX
Topictopic
MSA
Macro F1-score


(7) Word Sense Disambiguation (WSD)

Task Name Identifier
Variation
Score MetricURL
BibTeX
Word Sense Disambiguationwsd
MSA
Macro F1-score