Wikipedia usage data has been harnessed to estimate the prevalence of influenza-like illness (ILI) in the US population. By observing the number of times certain key Wikipedia articles are viewed each day, a model was developed that accurately estimated ILI, within 0.27% of official Centers for Disease Control and Prevention data. Additionally, this method was able to accurately determine the week in which ILI peaked 17% more often than Google Flu Trends. This work demonstrates the power of open, freely available data to aid in disease surveillance.
Authors own copyright of their articles appearing in the Online Journal of Public Health Informatics. Readers may copy articles without permission of the copyright owner(s), as long as the author and OJPHI are acknowledged in the copy and the copy is used for educational, not-for-profit purposes. Share-alike: when posting copies or adaptations of the work, release the work under the same license as the original. For any other use of articles, please contact the copyright owner. The journal/publisher is not responsible for subsequent uses of the work, including uses infringing the above license. It is the author's responsibility to bring an infringement action if so desired by the author.