AI Health Data Challenge: part of the International Workshop on AI in Health: Transferring and Integrating Knowledge for Better Health
Sponsored by Suit Endowment Fund and Mary R. Boyvey Dean's Excellence Fund at School of Information, University of Texas at Austin

Co-located with The Web Conference 2021 (
April 19, 2021, Ljubljana, Slovenia

1st Prize: $1,000; 2nd Prize: $800, 3rd Prize: $200
Sponsored by Suit Endowment Fund and Mary R. Boyvey Dean's Excellence Fund at School of Information, University of Texas at Austin

This AI Health Data Challenge is part of the International Workshop on AI in Health at The Web Conference 2021. The goal of this data challenge is to promote data-driven and AI-driven approaches to enable better health. A participant can join either as a single person or in a team to develop either a tool or a data analytical package based on MIMIC datasets and PubMed Knowledge Graph. Potential examples are (but not limited to):
- Applying and developing novel AI algorithms to automatically generate bounding boxes to annotate MIMIC CXR chest x-ray images, enable visual question anwering using CXR radiology reports and images, automatically generate radiology reports based on CXR chest x-ray images, create human-centered AI approaches for medical imaging diagnosis
- Increasing the interpretability of AI approaches for patient risk predictions based on MIMIC EHR datasets,  
- Developing apps to enable evidence-based care for doctors/patients based on MIMIC EHR datasets using the FHIR standards.
- Applying Graph mining on PubMed Knowledge Graph
- Building frontend for PubMed Knowledge Graph

Thanks to the sponsorship from Suit Endowment Fund and Mary R. Boyvey Dean's Excellence Fund at School of Information, University of Texas at Austin, there will be cash awards for the first three winning groups. 1st Prize will get $1,000, 2nd Prize will get $800, and 3rd Prize will get $200. 

Data Access

There is a formal process for requesting access to MIMIC datasets ( You will need to pass the Human Subject Online Course from MIT in order to get permission to download the dataset. This process can take several days or several weeks, so plan accordingly. Participants must agree to the MIMIC data use agreement.  

PubMed Knowledge Graph (
PubMed Knowledge Graph covering PubMed articles from 1800 to 2020 with extracted bio-entities from 29 million PubMed articles using BioBert, disambiguated author names, integrated funding data through NIH ExPORTER, affiliation history and educational background of authors from ORCID, and fine-grained affiliation data from MapAffil (Xu et al., 2020). By integrating the credible multi-source datasets, this PubMed knowledge graph contains connections among bio-entities (e.g., gene/protein, disease, drug/chemical, species, and mutations), authors, articles, affiliations, and funding with the author name disambiguation results reaching the F1 score of 98.09%. This PubMed knowledge graph contains 14,830,461 authors, 18,361,409 bio entities, 8,300,984 affiliations, and 102,070 NIH funded projects. For more details, please visit:

Judging Criterias
A team of judges will be formed by experts from healthcare, data science, artificial intelligence, and entrepreneurship. Your submission will be judged based on the following crieterias:
- Usability: Is your app/tool useful to address some healthcare issues? Is your app/tool easy to use?
- Novelty: What are NEW features in your app/tool? Are there any creative and exciting things in your app/tool?
- Reproducibility: Can others reproduce your app/tool?

Submission Guideline
Please submit the following items before 11PM Central Time April 5, 2021 (Note: submit only one zip file containing everything and filename as the submitter’s full name) via easychair (please check the data challenge website for the updated easychair link):
- App/tool: the link to your GitHub
- A 2-5 page report on the details of how you built the app. This report should contain the details of your methods, and screenshots of your app/tool.
- A 5-minute video about your app/tool

Important Dates
?	AI health data challenge submission: April 5, 2021
?	Notification for final around: April 15, 2021
?	Competition for final awards: April 19, 2021

Data Challenge Organizers
School of Information, University of Texas at Austin, USA
Dell Medical School, University of Texas at Austin, USA
MIT Critical Data, MIT, USA

Contact Information
Ying Ding: