CALL FOR TASK PARTICIPATION
SHINRA2020-ML Classification Task
<http://shinra-project.info/shinra2020ml/?lang=en>

Data release: January 2020
Registration & Result submission deadline: July 31, 2020
NTCIR-15 Conference: December 2020

SHINRA <http://shinra-project.info/?lang=en> is a resource creation project started
in the year 2017, aiming to structure the knowledge in Wikipedia.
SHINRA2020-ML is the first shared-task of text classification in project SHINRA,
tackling the challenge of classifying 30 language Wikipedia entities in fine-grained
categories. The task is conducted as one of the NTCIR-15 tasks.

[Video] (approx.11 min):
Introduction of SHINRA2020-ML task (categorization of 30-language Wikipedia into ENE)


TASK OVERVIEW
The task is to classify 30 language (*1) Wikipedia pages into 219 categories using
categorized Japanese Wikipedia pages and the interlanguage links to the corresponding
pages in target languages. The categories are defined in Extended Named Entity (ENE)
ver.8.0 <http://ene-project.info/ene8/?lang=en>, a four-layer ontology for classifying
names, time, and numbers.

The participants are expected to select one or more target languages, and for each
language, use the Wikipedia pages linked from the categorized Japanese pages as the
training data, and run the system to classify the remaining pages which are not linked
from the Japanese pages. Please see the TASK DESCRIPTION
<http://shinra-project.info/call-for-participation/?lang=en#task_description> on the home
page for further details.

After the task is over, we (including the participants) will combine the results by all the
participants (i.e. by Ensemble learning), and publish the results to the public. It is a
scheme called “Resource by Collaborative Contribution (RbCC)”.

We are expecting many participants with a good will.

(*1): The 30 languages are English, Spanish, French, German, Chinese, Russian,
Portuguese, Italian, Arabic, Indonesian, Turkish, Dutch, Polish, Persian, Swedish,
Vietnamese, Korean, Hebrew, Romanian, Norwegian, Czech, Ukrainian, Hindi, Finnish,
Hungarian, Danish, Thai, Catalan, Greek, Bulgarian.

IMPORTANT DATES
January 2020 Data release
July 31, 2020 Registration & Result submission deadline
August 20, 2020 Evaluation results due back to participants
December 2020 NTCIR-15 Conference (NII, Tokyo)

HOW TO PARTICIPATE
We encourage new participants to have a look at the data in “Trial datasets"
<http://shinra-project.info/shinra2020ml/2020ml_data/?lang=en#trial>. How to download
the datasets and participate in the task is described here
<http://shinra-project.info/shinra2020ml/howtoparticipate/?lang=en>.

Please note that the task is conducted as one of the NTCIR-15 tasks
<http://research.nii.ac.jp/ntcir/ntcir-15/tasks.html> and you have to register through
the NTCIR-15 registration page <http://research.nii.ac.jp/ntcir/ntcir-15/howto.html> to
participate in it.

ORGANIZERS
Chair
Satoshi Sekine (RIKEN AIP, Japan)

Organizing Committee
Masako Nomoto (RIKEN AIP, Japan)
Asuka Sumida (RIKEN AIP, Japan)
Kouta Nakayama (University of Tsukuba/ RIKEN AIP, Japan)
Koji Matsuda (RIKEN AIP/ Tohoku University, Japan)

PC Members
Jiewen Wu (A*STAR, Singapore)
Christophe Gravier (Université de Lyon, France)
Hsin-Hsi Chen (National Taiwan University, Taiwan)
Haizhou Li (National University of Singapore, Singapore)
Virach Sornlertlamvanich (Thammasat Univercity, Thailand / Musashino University,
Japan)
Massimo Poesio (Mary Queen University of London, England)
Rafael Muñoz Guillena (Universitat d’Alacant, Spain)
Min Zhang (Soochow University, China)
Wenliang Chen (Soochow University, China)
Johan Bos (University of Groningen, Netherland)
Gerhard Weikum (DFKI, Germany)
Asif Ekbal (IIT Patna, India)
Gjergji Kasneci (Tübingen University, Germany)
Vasudeva Varma (IIIT Hyderabad, India)
Asanee Kasetsart (Kasetsart University, Thailand)
Pierpaolo Basile (Università degli Studi di Bari Aldo Moro, Italy)
David Nadeau (Innodata, Canada)
Murat Can Ganiz (Marmara University, Turkey)
Adrian Iftene (“Alexandru Ioan Cuza” University, Romania)
Tommi A Pirinen (Universität Hamburg, Germany)
Tru Cao (The University of Texas Health Science Center at Houston, USA)
Petya Osenove (Sofia University “St. Kl. Ohridski”, Bulgaria)
Le Hong Phuong (Vietnam National University, Hanoi, Vietnam)
Nguyen Thi Minh Huyen (Vietnam National University, Hanoi Vietnam)
Nicolas Heist (Universität Mannheim, Germany)
Zdenek Zabokrtsky (Charles University, Czech Republic)
Tim Finin (University of Maryland, USA)
Su Jian (A*STAR, Singapore)
Manar Alkhatib (The British University in Dubai, United Arab Emirates)
Key-Sun Choi (Korea Advanced Institute of Science and Technology, Korea)
Nigel Collier (University of Cambridge, UK)
Ikuya Yamada (Studio Ousia, Japan)
Kentaro Inui (Tohoku University/ RIKEN AIP, Japan)
Tomoya Iwakura (Fujitsu, Japan)
Mehrnoush Shamsfard (Shahid Beheshti University, Iran)
Galia Angelova (Bulgarian Academy of Sciences, Bulgaria)
Yusuke Miyao (The University of Tokyo, Japan)
Kiril Simov (Bulgarian Academy of Sciences, Bulgaria)
Yukino Baba (University of Tsukuba, Japan)
Masaharu Yoshioka (Hokkaido University, Japan)
Heng Ji (University of Illinois at Urbana-Champaign, USA)
Miloslav Konopik (University of West Bohemia, Czech Republic)
Steven Skiena (Stony Brook University, USA)
Catherine Legg (Deakin University, Australia)

CONTACT
Email to the organizers:

Slack among the participants and organizers:
<http://shinra2020-ml.slack.com>
[Invitation link]


LINKS
SHINRA2020-ML homepage:
<http://shinra-project.info/shinra2020ml/?lang=en>
Extended Named Entity:
<http://ene-project.info/?lang=en>
NTCIR-15 Task Overview and Call for Task Participation:
<http://research.nii.ac.jp/ntcir/ntcir-15/tasks.html>