Background: Monitoring levels and trends in premature mortality is crucial to understanding how societies can address prominent sources of early death. The Global Burden of Disease 2016 Study (GBD 2016) provides a comprehensive assessment of cause-specifc mortality for 264 causes in 195 locations from 1980 to 2016. This assessment includes evaluation of the expected epidemiological transition with changes in development and where local patterns deviate from these trends. Methods: We estimated cause-specifc deaths and years of life lost (YLLs) by age, sex, geography, and year. YLLs were calculated from the sum of each death multiplied by the standard life expectancy at each age. We used the GBD cause of death database composed of: vital registration (VR) data corrected for under-registration and garbage coding; national and subnational verbal autopsy (VA) studies corrected for garbage coding; and other sources including surveys and surveillance systems for specifc causes such as maternal mortality. To facilitate assessment of quality, we reported on the fraction of deaths assigned to GBD Level 1 or Level 2 causes that cannot be underlying causes of death (major garbage codes) by location and year. Based on completeness, garbage coding, cause list detail, and time periods covered, we provided an overall data quality rating for each location with scores ranging from 0 stars (worst) to 5 stars (best). We used robust statistical methods including the Cause of Death Ensemble model (CODEm) to generate estimates for each location, year, age, and sex. We assessed observed and expected levels and trends of cause-specifc deaths in relation to the Socio-demographic Index (SDI), a summary indicator derived from measures of average income per capita, educational attainment, and total fertility, with locations grouped into quintiles by SDI. Relative to GBD 2015, we expanded the GBD cause hierarchy by 18 causes of death for GBD 2016. Findings: The quality of available data varied by location. Data quality in 25 countries rated in the highest category (5 stars), while 48, 30, 21, and 44 countries were rated at each of the succeeding data quality levels. Vital registration or verbal autopsy data were not available in 27 countries, resulting in the assignment of a zero value for data quality. Deaths from non-communicable diseases (NCDs) represented 72·3% (95% uncertainty interval [UI] 71·2-73·2) of deaths in 2016 with 19·3% (18·5-20·4) of deaths in that year occurring from communicable, maternal, neonatal, and nutritional (CMNN) diseases and a further 8·43% (8·00-8·67) from injuries. Although age-standardised rates of death from NCDs decreased globally between 2006 and 2016, total numbers of these deaths increased; both numbers and age-standardised rates of death from CMNN causes decreased in the decade 2006-16 – age-standardised rates of deaths from injuries decreased but total numbers varied little. In 2016, the three leading global causes of death in children under-5 were lower respiratory infections, neonatal preterm birth complications, and neonatal encephalopathy due to birth asphyxia and trauma, combined resulting in 1·80 million deaths (95% UI 1·59 million to 1·89 million). Between 1990 and 2016, a profound shift toward deaths at older ages occurred with a 178% (95% UI 176-181) increase in deaths in ages 90-94 years and a 210% (208-212) increase in deaths older than age 95 years. The ten leading causes by rates of age-standardised YLL signifcantly decreased from 2006 to 2016 (median annualised rate of change was a decrease of 2·89%); the median annualised rate of change for all other causes was lower (a decrease of 1·59%) during the same interval. Globally, the fve leading causes of total YLLs in 2016 were cardiovascular diseases; diarrhoea, lower respiratory infections, and other common infectious diseases; neoplasms; neonatal disorders; and HIV/AIDS and tuberculosis. At a fner level of disaggregation within cause groupings, the ten leading causes of total YLLs in 2016 were ischaemic heart disease, cerebrovascular disease, lower respiratory infections, diarrhoeal diseases, road injuries, malaria, neonatal preterm birth complications, HIV/AIDS, chronic obstructive pulmonary disease, and neonatal encephalopathy due to birth asphyxia and trauma. Ischaemic heart disease was the leading cause of total YLLs in 113 countries for men and 97 countries for women. Comparisons of observed levels of YLLs by countries, relative to the level of YLLs expected on the basis of SDI alone, highlighted distinct regional patterns including the greater than expected level of YLLs from malaria and from HIV/AIDS across sub-Saharan Africa; diabetes mellitus, especially in Oceania; interpersonal violence, notably within Latin America and the Caribbean; and cardiomyopathy and myocarditis, particularly in eastern and central Europe. The level of YLLs from ischaemic heart disease was less than expected in 117 of 195 locations. Other leading causes of YLLs for which YLLs were notably lower than expected included neonatal preterm birth complications in many locations in both south Asia and southeast Asia, and cerebrovascular disease in western Europe. Interpretation: The past 37 years have featured declining rates of communicable, maternal, neonatal, and nutritional diseases across all quintiles of SDI, with faster than expected gains for many locations relative to their SDI. A global shift towards deaths at older ages suggests success in reducing many causes of early death. YLLs have increased globally for causes such as diabetes mellitus or some neoplasms, and in some locations for causes such as drug use disorders, and confict and terrorism. Increasing levels of YLLs might refect outcomes from conditions that required high levels of care but for which efective treatments remain elusive, potentially increasing costs to health systems.
The GBD study provides a highly standardised approach to dealing with the multiple measurement challenges in cause of death assessment, including variable completeness of vital registration (VR) data, levels and trends in the fraction of deaths assigned to garbage codes, the use of verbal autopsy (VA) studies in locations with incomplete VR, and overall data missingness. Here we provide a general description, organised in 12 sections; detail is provided in the methods appendix (appendix 1 p 288). Statistical code used in estimation is available through an online repository; analyses were done using Python version 2.7.12 and 2.7.3, Stata version 13.1, and R version 3.2.2. As in GBD 2015, we follow the Guidelines for Accurate and Transparent Health Estimates Reporting (GATHER) for the development and documentation of GBD 2016 (appendix 1 p 292). The GBD geographical hierarchy includes 195 countries and territories grouped within 21 regions and seven GBD super-regions (appendix 1 p 460). For the GBD 2016 estimation, new subnational assessments were developed for Indonesia by province and for England by local government area. In this publication, we present subnational estimates for all countries with a population greater than 200 million in 2016: Brazil, China, India, Indonesia, and the USA. The likelihood of substantial geographical heterogeneity in these large populations is high, requiring disaggregated assessments to be policy relevant. Due to space limitations, we only provide these subnational estimates in maps; detailed subnational assessments will be provided in separate publications. Cause-specific estimation for GBD 2016 covers the years 1980 to 2016. For a subset of analyses in this paper, we focus on the past decade, from 2006 to 2016, to address more current policy priorities. GBD 2016 results for all years and by location can be explored further with dynamic data visualisations. For GBD, each death is attributed to a single underlying cause—the cause that initiated the series of events leading to death—in accordance with ICD principles. This categorical attribution of causes of death differs from the counterfactual approach, which calculates how many deaths would not have occurred in the absence of disease. GBD also differs from approaches involving excess mortality in people with disease monitored through cohort or other studies. Deaths in such studies might be assigned as the underlying cause, be causally related to the disease, or include deaths with confounding diagnoses.3 The GBD cause list is organised as a hierarchy (appendix 1 p 477), with each level composed of causes of death that are mutually exclusive and collectively exhaustive. The GBD cause hierarchy, with corresponding ICD9 and ICD10 codes, is detailed in appendix 1 (p 300). GBD Level 1 causes are grouped as three broad categories: communicable, maternal, neonatal, and nutritional (CMNN) diseases; NCDs; and injuries. Level 2 causes contain 21 cause groups, including subsets of CMNN causes, cancers, cardiovascular diseases, and types of injuries (eg, transport injuries, self-harm, and interpersonal violence). Individual causes are primarily recorded at Level 3 (eg, malaria, asthma, and road injuries), while a subset of Level 3 causes are disaggregated further to Level 4 causes (eg, four sub-causes within chronic kidney disease). For GBD 2016, we disaggregated some Level 3 causes to expand the cause hierarchy used for GBD 2015 by 18 causes of death. GBD cause list expansion was motivated by two main factors: inclusion of causes that result in substantial burden and inclusion of causes that are of high policy relevance. New causes for GBD 2016 included Zika virus disease, congenital musculoskeletal anomalies, urogenital congenital anomalies, and digestive congenital anomalies. Other leukaemia was added as a Level 4 subcause to leukaemia rather than being estimated in the Level 3 residual category of other neoplasms. The Level 3 cause of collective violence and legal intervention was separated into “executions and police conflict” and “conflict and terrorism”. Disaggregation of existing Level 3 causes resulted in the addition of 11 detailed causes at Level 4 of the cause hierarchy: drug-susceptible tuberculosis, multidrug-resistant tuberculosis, and extensively drug-resistant tuberculosis; drug-susceptible HIV–tuberculosis, multidrug-resistant HIV–tuberculosis, and extensively drug-resistant HIV–tuberculosis; alcoholic cardiomyopathy, myocarditis, and other cardiomyopathy; and self-harm by firearm, and self-harm by other means. Within each level of the hierarchy the number of collectively exhaustive and mutually exclusive causes for which the GBD study estimates fatal outcomes is three at Level 1, 21 at Level 2, 145 at Level 3, and 212 at Level 4. For GBD 2016, separate estimates were developed for a total of 264 unique causes and cause aggregates. The GBD study combines multiple data types to assemble a comprehensive cause of death database. Sources of data included VR and VA data; cancer registries; surveillance data for maternal mortality, injuries, and child death; census and survey data for maternal mortality and injuries; and police records for interpersonal violence and transport injuries. Since GBD 2015, 24 new VA studies and 169 new country-years of VR data at the national level have been added. Six new surveillance country-years, 106 new census or survey country-years, and 528 new cancer-registry country-years were also added. An important development has been the release of the Sample Registration System (SRS) VA data by the Government of India for use in GBD. This includes cause of death data for 455 460 deaths covered by SRS from 2004–06, 2007–09, and 2010–13 across all Indian states and union territories. For this analysis, we established 2005, 2008, and 2012 as midpoint years for these three periods. The SRS in India is operated by the Office of the Registrar General of India working under the Ministry of Home Affairs, Government of India. Using the 2001 census, 7597 geographical units, 4433 (58·4%) of which were rural, were sampled for the 2004–13 SRS, ultimately covering a population of 6·7 million across all states and union territories.20 The inclusion of SRS for GBD 2016 offers a comprehensive picture of causes of death in India, particularly in rural areas. For a subset of causes, we used the India Medical Certification of Cause of Death (MCCD) data source or Survey of Causes of Death (SCD) data rather than SRS. The decision to use MCCD and SCD data in addition to SRS was limited to causes for which we had clear evidence of time trends not reflected by using the three SRS midpoint years alone (eg, maternal mortality). The Office of the Registrar General of India is not involved with the production of the GBD modelled estimates, and as a result their estimates might differ from those presented here. Methods for standardisation or correction of data sources are described in detail in appendix 1 (p 14). The SDI was developed for GBD 2015 to provide an interpretable synthesis of overall development, measured by the geometric mean of scores on relative scales of lag-dependent income per capita (LDI), average educational attainment in the population aged older than 15 years, and total fertility rates (TFR).3 For GBD 2016, the SDI was slightly revised; the correlation of the GBD 2015 and GBD 2016 versions of SDI is 0·977 (p0–9%, 1 star; 0%, 0 stars. Instances in the table that show 1 star despite all zeros in percent well certified are a result of very small values that round to 0 at one decimal place. In GBD, the vast majority of cause of death estimates are modelled using the Cause of Death Ensemble model (CODEm). Due to their unique epidemiology or known biases, a subset of causes of death are modelled using alternative estimation strategies: negative binomial models for relatively rare causes, incidence and case fatality models, subcause proportion models, and prevalence-based models. The estimation of HIV/AIDS also requires a different modelling approach;21 and in previous publications.3, 21, 24 Due to lags in reporting, estimates for the most recent years rely more on the modelling process. Additional details on CODEm and all alternative estimation strategies are provided below and in appendix 1 (p 33 and p 35). Major methodological changes from GBD 2015 were made for several models in GBD 2016: the distribution of antiretroviral therapies (ART) in countries with high HIV/AIDS prevalence were modelled based on an empirical pattern derived from household studies rather than on the assumption that ART was allocated to those individuals most in need; tuberculosis was modelled for prevalence of disease and then for prevalence of latent infection, which were then used as covariates for the CODEm model; malaria in high-endemicity Africa was estimated using a pixel-level geospatial model, while malaria outside of Africa was estimated using a new suite of spatiotemporal covariates in CODEm; and cancer mortality-to-incidence data inclusion and modelling were revised to better capture the likely effects of worse access to treatment in lower-SDI settings. CODEm, used for 177 causes of death for GBD 2016, is the GBD cause of death estimation approach in which a large number of model specifications are systematically tested in terms of functional forms and permutations of relevant covariates which are subsequently used to predict true levels for each cause of death.25, 26 CODEm uses multiple iterations of cross-validation tests to evaluate the out-of-sample predictive validity of model variants that met predetermined requirements for direction and significance of regression coefficients. These models were then combined into a weighted ensemble model, with models performing best on out-of-sample prediction error of both levels and trends weighted highest. Additional details of the methods used to develop these ensemble models are provided in appendix 1 (p 33). Independent CODEm models were run for each cause of death by sex, and separately for countries with and without extensive complete VR data. All data were included in models for countries without extensive VR coverage to enhance predictive validity; data from countries without extensive VR coverage were excluded from models for countries with this coverage to avoid inflation of uncertainty. We used negative binomial models for nine causes of death (other intestinal infectious diseases; upper respiratory infections; diphtheria; varicella and herpes zoster; schistosomiasis; cysticercosis; cystic echinococcosis; ascariasis; and iodine deficiency) for which death counts are typically very low, or might frequently have zero counts in high-SDI countries. For causes in locations with insufficient data from VR or VA data, we used incidence and case fatality models—also known as natural history models—separately estimating incidence and case fatality rates and then combining them to produce estimates of cause-specific mortality. We used incidence and case fatality models for 14 causes: measles; visceral leishmaniasis; African trypanosomiasis; yellow fever; syphilis (congenital); typhoid fever; paratyphoid fever; whooping cough; Zika virus disease; and acute hepatitis A, B, C, and E. We also used an incidence and case fatality model for malaria incidence in sub-Saharan Africa as produced by the Malaria Atlas Project and age-sex-specific case fatality rates from available data.27 For some causes—meningitis, maternal disorders, liver cancer, cirrhosis, and chronic kidney disease—data other than VR data provide considerable additional detail (eg, end-stage renal disease registries), or data are reported in too few places to be modelled directly in the CODEm framework. In these cases, we first estimated the parent cause using CODEm and then estimated subcauses by each age-sex-location-year using the Bayesian meta-regression tool DisMod-MR 2.1, developed for the GBD studies.21, 26, 28 An increased likelihood of reporting Alzheimer’s disease and other dementias, Parkinson’s disease, and atrial fibrillation and flutter as underlying causes of death on death certificates has resulted in an apparent large increase in death rates associated with these diseases. The absence of a parallel increase of the same magnitude in reported rates of age-specific prevalence of these diseases supports the view that these changes are reporting artefacts rather than true changes in epidemiology. Because the redistribution algorithms used to build the cause of death database for previous iterations of GBD did not seem to adequately capture this trend in death certification over time for these causes, estimates for these three causes for GBD 2016 were derived from prevalence surveys and from estimates of excess mortality based on deaths certified in countries with the greatest proportion of deaths allocated to the correct underlying cause of death in recent years. The derivation of cause-specific mortality rates from prevalence and excess mortality models was completed in DisMod-MR 2.1. After generating underlying cause of death estimates and accompanying uncertainty, we combined these models into estimates that are consistent with the levels of all-cause mortality estimated for each age-sex-year-location group using a cause of death correction procedure (CoDCorrect). Using 1000 draws from the posterior distribution of each cause and 1000 draws from the posterior distribution of the estimation of all-cause mortality, we used CoDCorrect to rescale the sum of cause-specific estimates to equal the draws from the all-cause distribution (appendix 1 p 280). We introduced a change in the CoDCorrect algorithm to take into account that deaths from Alzheimer’s disease and Parkinson’s diseases are more likely miscoded to lower respiratory infections, protein-energy malnutrition, other nutritional deficiencies, cerebrovascular disease, interstitial nephritis and urinary tract infections, decubitus ulcer, and pulmonary aspiration and foreign body in airway than other causes (see appendix 1 p 279 for details).29, 30, 31 Fatal discontinuities occur when events such as military operations or terrorism, natural disasters, major transportation accidents, or large infectious disease outbreaks lead to abrupt departures from expected mortality rates in a given location. To capture these events, we used VR data for locations assigned a 4-star or 5-star data quality rating over the period from 1980 to 2016. For locations with a 3-star rating or lower (122 of 195 locations), we used the Uppsala Conflict Data Program for military operations and terrorism;14 the Centre for Research on the Epidemiology of Disasters’ International Emergency Disasters Database for natural disasters, transport accidents, fires, exposure to mechanical forces (eg, building collapses, explosions), and famine;32 and the Global Infectious Diseases and Epidemiology Network for cholera and meningococcal meningitis. The latter two infectious diseases were included as fatal discontinuities for GBD 2016 because CODEm smooths year-to-year irregularities in deaths from these causes and thus risks underestimating their effects. There is frequently a lag in reporting and data publishing for the most recent years, so we used supplementary data sources, including news reports, when gaps existed for known fatal discontinuities. Detail on the data and analytic approaches used for fatal discontinuities is available in appendix 1 (p 39). As for GBD 2015, we calculated the years of life lost (YLLs)—a measure of premature mortality—from the sum of each death multiplied by the standard life expectancy at each age. For GBD 2016, the standard life expectancy at birth was 86·6 years, derived from the lowest observed risk of death for each 5-year age group; to avoid problems associated with small numbers, we restricted this to all populations greater than 5 million individuals in 2016. Age-standardised mortality rates and YLL rates were computed using the world standard population developed for the GBD study,3 which is a time-invariant standard. Details of these calculations are available in appendix 1 (p 281). Point estimates for each quantity of interest were derived from the mean of the draws, while 95% uncertainty intervals (UIs) were derived from the 2·5th and 97·5th percentiles. Uncertainty in the estimation is attributable to sample size variability within data sources, different availability of data by age, sex, year, or location, and cause-specific model specifications. We determined UIs for components of cause-specific estimation based on 1000 draws from the posterior distribution of cause-specific mortality by age, sex, and location for each year included in the GBD 2016 analysis. In this way, uncertainty could be quantified and propagated into the final quantities of interest. Limits on computational resources mean we do not propagate uncertainty in the covariates used by cause of death models. We remain unable to incorporate uncertainty from garbage code redistribution algorithms into our final estimates. When measuring changes over time, the change was considered statistically significant if the posterior probability of an increase (or decrease) was at least 95%—ie, if the mortality rate increased (or decreased) in at least 95% of the draws. Future methodological improvements that allowed the incorporation of more sources of uncertainty could result in currently marginally significant results no longer being significant within our definition. The funder of the study had no role in the study design, data collection, data analysis, data interpretation, or the writing of the report. All authors had full access to the data in the study and had final responsibility for the decision to submit for publication.