Chi-Myung KwonSungwon HwangJae-Un Jung
ABSTRACTSeasonal influenza epidemics cause 3 to 5 millions severe illness and 250,000 to 500,000 deaths worldwide each year. To prepare better controls on severe influenza epidemics, many studies have been proposed to achieve near real-time surveillance of the spread of influenza. Korea CDC publishes clinical data of influenza epidemics on a weekly basis typically with a 1-2-week reporting lag. To provide faster detection of epidemics, recently approaches using unofficial data such as news reports, social media, and search queries are suggested. Collection of such data is cheap in cost and is realized in near real-time. This research aims to develop regression models for early detecting the outbreak of the seasonal influenza epidemics in Korea with keyword query information provided from the Naver (Korean representative portal site) trend services for PC and mobile device. We selected 20 key words likely to have strong correlations with influenza-like illness (ILI) based on literature review and proposed a logistic regression model and a multiple regression model to predict the outbreak of ILI. With respect of model fitness, the multiple regression model shows better results than logistic regression model. Also we find that a mobile-based regression model is better than PC-based regression model in estimating ILI percentages.
Qingyu YuanElaine O. NsoesieBenfu LvPeng GengRumi ChunaraJohn S. Brownstein
Jeremy GinsbergMatthew H. MohebbiRajan PatelLynnette BrammerMark S. SmolinskiLarry Brilliant
Mauricio SantillanaElaine O. NsoesieSumiko R. MekaruDamon C. ScalesJohn S. Brownstein
Pi GuoJianjun ZhangLi WangShaoyi YangGanfeng LuoChangyu DengYe WenQingying Zhang
Yuzhou ZhangLaith YakobMichael B. BonsallWenbiao Hu