Eliciting Disease Data from Wikipedia Articles

Authors

  • Geoffrey Fairchild
  • Lalindra De Silva
  • Sara Y. Del Valle
  • Alberto M. Segre

DOI:

https://doi.org/10.5210/ojphi.v8i1.6526

Abstract

Traditional disease surveillance systems suffer from several disadvantages, including reporting lags and antiquated technology, that have caused a movement towards internet-based disease surveillance systems. This study presents the use of Wikipedia article content in this sphere.  We demonstrate how a named-entity recognizer can be trained to tag case, death, and hospitalization counts in the article text. We also show that there are detailed time series data that are consistently updated that closely align with ground truth data.  We argue that Wikipedia can be used to create the first community-driven open-source emerging disease detection, monitoring, and repository system.

Downloads

Published

2016-03-24

How to Cite

Fairchild, G., De Silva, L., Del Valle, S. Y., & Segre, A. M. (2016). Eliciting Disease Data from Wikipedia Articles. Online Journal of Public Health Informatics, 8(1). https://doi.org/10.5210/ojphi.v8i1.6526

Issue

Section

Poster Presentations