Skip to content

[EDA] Statistical modelling of Novel Coronavirus breakout nCoV-2019

Notifications You must be signed in to change notification settings

ganeshkumartk/nCoV-2019

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

nCoV-2019

Date: Jan 29 2020

Estimating Potential Outbreak Size in Wuhan before Shut Down (Jan 22)

Background

  1. Wuhan shut down time: Jan 23 2 am
  2. The offical confirmed cases in wuhan is just more than 100 before shut down, then this offical number goes up too 2000 on Jan 29.
  3. Population of international city Wuhan: 11 mllion; estimated airport catchment 19 million.

Considering the incubation period and under-reporting, we are trying to estmate the potential number of 2019-nCoV cases in wuhan. However, the sample we use is the number of detected cases overseas out from wuhan, thus we only be able to estimate the possible cases in wuhan before wuhan shut down.

Goal:

  1. Re-produce model in Estimating the potential total number of novel Coronavirus cases in Wuhan City, China (Jan 21 2020)
  2. Estimate potential Outbreak Size in Wuhan until Jan 22, using the number of cases detected overseas (Jan 22, Jan 26 and Jan 29). We believe the estimated result based on the observation of Jan 29 is more accurate since most of oversea cases have been detected until this day.

The sample, used in the model estimation, is the number of cases detected outside mainland China (international travel from Wuhan).

  • 8 detected cases overseas until 22 Jan. (all travelled out from Wuhan)
  • 29 detected cases overseas until 26 Jan (all travelled out from Wuhan)
  • 67 detected cases overseas until 29 Jan (all travelled out from Wuhan)

Main Conclusion:

Since 29 Jan is 7 days after Wuhan Shut down, most of oversea cases have been detected until 29 Jan. We believe the estimated result based on the observation of Jan 29 is more accurate.

  • There are at least 4600 95% CI(2100, 8550) cases in Wuhan, until Jan 19. (There is time deley between suspected and confirmed)
  • Based on 29 confirmed overseas cases in Jan 26, There are more than 16600 cases 95% CI(11310, 23480) before Jan 23.
  • Based on 67 confirmed overseas cases in Jan 29, There are more than 38500 cases 95% CI(30000, 48470) before Jan 23.
  • Commuting flows has significant impact on 2019-nCov epidemic growth rate

Statistical Modelling:

Suppose population follows binomial distribution Bin(p,N)

  • p: probability any one case will be detected overseas (international travel from Wuhan)
  • N: negative binomially distributed function of X (number of cases detected outside mainland China)

Sensitivity analysis:

Sensitivity analysis to estimate current cases in wuhan based on 3 scenarios:

  1. Baseline
    • 9 or (67) overseas confirmed cases until Jan 22 and 29.
    • 10 day mean time to detection
    • 19 million airportCatchment
  2. Smaller catchment:
    • airportCatchment = wuhan_population = 11 millio.
  3. Shorter detection window:
    • 8 day mean time to detection

Profile likelihood CI

In general, the confidence interval based on the standard error strongly depends on the assumption of normality for the estimator, which is something we cannot guarantee in this case. The "profile likelihood confidence interval" provides an alternative.

We defined a Binomial likelihood for the number of exported cases and used this function to find the MLE of the number of cases in Wuhan, using the profile likelihood approach to identify the 95% CI around the MLE. The lower and upper bounds of the 95%CI are those values by which the log-likelihood difference from the maximum log-likelihood is 1.92 (95%-percentile of the Chi-square(1) distribution).


Key assumptions:

95% Confidence interval (uncertainty range) represents uncertainty in Epi-based assumptions as well as statistical assumptions

Epi-based assumptions

  1. Unable to estimate the epidemic growth rate
  2. Total number of international travel from Wuhan over the last 2 months has been 3301 per day.
    • This number is based on previous year data, we can calculate more accurately by airport data of 2019
    • This number is 0 after Jan 23 (Wuhan shut down)
  3. Estimates donot include cases with mild or no symptoms
  4. The incubation period is unknown, and has been approximated by MERS-nCov and SARS.

Statistical assumptions

  1. We assume independent relationship between exported cases and confirmed cases within Wuhan
    • However, exit screening may have reduced exports in recent days
    • Outcome: underestimate of the true number
  2. The number of cases detected overseas is binomially distributed with Binomial(p), where p is the probability any one case will be detected overseas.
    • This assumption only influence the interval estimation
  3. Assume international travel is independent of the risk of exposure to 2019n-CoV and infection status.
    • However, the international traveller might be wealthier people and with lower risk of exposure to 2019n-CoV

About

[EDA] Statistical modelling of Novel Coronavirus breakout nCoV-2019

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published