Application of Bayesian probabilistic linkage model in birth and death data linking

Journal Title: Shanghai Journal of Preventive Medicine - Year 2024, Vol 36, Issue 1

Abstract

Objective To elucidate the principles and methods of the Bayesian probabilistic linkage model, and to demonstrate the effect of applying the model in linking birth and death data.Methods Through the Shanghai birth and death registration system, data of 199 025 infants born in 2017 and 1 512 infants who died in 2017 and 2018 were collected. After cleaning the data, the data were divided into monthly blocks and fully linked. The Jaro-Winkler algorithm and Euclidean distance were employed to measure the similarity of fields for matching. A Bayesian probabilistic linkage model was constructed and the linking effect was evaluated using a confusion matrix.Results Using the Bayesian probabilistic linkage model, the birth and death data of infants were effectively linked, revealing that 36.71% of infants who died in Shanghai were born outside the city, and the probability of infant death was 2.6‰. The confusion matrix of the test set showed a recall rate of 0.86, precision of 0.76, and an F-score of 0.81.Conclusion The practical application of Bayesian probabilistic linkage demonstrates a good model performance, enabling the establishment of birth-death cohorts that more accurately reflect the true levels of infant mortality. Utilizing this technique to integrate data from different departments can effectively improve research efficiency in the field of public health.

Authors and Affiliations

YU Huiting,CAI Renzhi,LIN Weixiao,NI Jingyi,QIAN Naisi,XIA Tian,WU Fan,

Keywords

Related Articles

Premature death of female breast cancer patients and its trend in Putuo District of Shanghai from 2004 to 2019

Objective To understand the incidence and death of female breast cancer patients and the premature death caused by breast cancer in Putuo District of Shanghai, and to reduce the incidence of breast cancer, mortality an...

Prevalence and influencing factors of reduced visual acuity among young children in Changning District, Shanghai

Objective To analyze the prevalence of reduced visual acuity of young children in Changning district of Shanghai and to explore the influencing factors, so as to provide a reference basis for formulating prevention and...

Association between atmospheric particulate matters and outpatient visits for respiratory disorders in Jiaxing City of Zhejiang Province from 2019 to 2021: a time series analysis

ObjectiveTo explore the effect of exposure to atmospheric particulate matters on the outpatient visits of respiratory disorders in Jiaxing City,Zhejiang Province.MethodsDaily air pollutant monitoring data,meteorologi...

Evaluation of the measles surveillance system performance in Jiading District, Shanghai from 2020 to 2022

ObjectiveTo evaluate the measles surveillance system (MSS) in Jiading District, Shanghai from 2020 to 2022, and to provide evidence for the elimination of measles.MethodsDescriptive methods were used to analyze the MSS d...

Analysis of the trend of mortality among residents of Fuling District, Chongqing from 2017 to 2022

Objective To understand the mortality trends among residents of Fuling District, Chongqing, before and after the COVID-19 outbreak, and to provide references for the government to formulate disease prevention and cont...

Download PDF file
  • EP ID EP741984
  • DOI -
  • Views 43
  • Downloads 1

How To Cite

YU Huiting, CAI Renzhi, LIN Weixiao, NI Jingyi, QIAN Naisi, XIA Tian, WU Fan, (2024). Application of Bayesian probabilistic linkage model in birth and death data linking. Shanghai Journal of Preventive Medicine, 36(1), -. https://europub.co.uk/articles/-A-741984