Recent Updates


  • Feb 8, 2022 Please check out this page for information on paper submission.
  • Feb 5, 2022 Rankings are out.
  • Jan 25, 2022 Test phase duration is increased by 2 days. New end date Jan 30 23:59 UTC.
  • Jan 04, 2022 FAQs about test phase are answered.
  • Oct 28, 2021 Submission site on Codalab is open now. Register now!
  • Oct 25, 2021 Updated information on participation and preparing submissions.
  • Sep 10, 2021 A Baseline system with dev set results is available.
  • Join us in Slack.
  • Sep 03, 2021 Training data is available.
  • Aug 23, 2021 Competition page has some trial data.

Important Dates

* All deadlines are calculated at 11:59 pm
UTC-12 hours

Trial Data Ready Jul 31 (Sat), 2021
Training Data Ready Sep 3 (Fri), 2021
Evaluation Start Jan 24 (Mon), 2022
Evaluation End Jan 30 (Sun), 2022
System Description Paper Submission Due Feb 28 (Mon), 2022
Notification to Authors Mar 31 (Thu), 2022
Camera-ready Due Apr 21 (Thu), 2022
Workshop 14-15 July 2022 co-located with NAACL

The dataset is publicly available here.

Data Statistics

Language Training Validataion Test
BN-Bangla 15,300 800 133,119
DE-German 15,300 800 217,824
EN-English 15,300 800 217,818
ES-Spanish 15,300 800 217,887
FA-Farsi 15,300 800 165,702
HI-Hindi 15,300 800 141,565
KO-Korean 15,300 800 178,249
NL-Dutch 15,300 800 217,337
RU-Russian 15,300 800 217,501
TR-Turkish 15,300 800 136,935
ZH-Chinese 15,300 800 151,661
MULTI-Multilingual 168,300 8,800 471,911
MIX-Code mixed 1,500 500 100,000
Total 338,100 18,100 2,567,509

Communication