Application of Bayesian probabilistic linkage model in birth and death data linking

Journal Title: Shanghai Journal of Preventive Medicine - Year 2024, Vol 36, Issue 1

Abstract

Objective To elucidate the principles and methods of the Bayesian probabilistic linkage model, and to demonstrate the effect of applying the model in linking birth and death data.Methods Through the Shanghai birth and death registration system, data of 199 025 infants born in 2017 and 1 512 infants who died in 2017 and 2018 were collected. After cleaning the data, the data were divided into monthly blocks and fully linked. The Jaro-Winkler algorithm and Euclidean distance were employed to measure the similarity of fields for matching. A Bayesian probabilistic linkage model was constructed and the linking effect was evaluated using a confusion matrix.Results Using the Bayesian probabilistic linkage model, the birth and death data of infants were effectively linked, revealing that 36.71% of infants who died in Shanghai were born outside the city, and the probability of infant death was 2.6‰. The confusion matrix of the test set showed a recall rate of 0.86, precision of 0.76, and an F-score of 0.81.Conclusion The practical application of Bayesian probabilistic linkage demonstrates a good model performance, enabling the establishment of birth-death cohorts that more accurately reflect the true levels of infant mortality. Utilizing this technique to integrate data from different departments can effectively improve research efficiency in the field of public health.

Authors and Affiliations

YU Huiting,CAI Renzhi,LIN Weixiao,NI Jingyi,QIAN Naisi,XIA Tian,WU Fan,

Keywords

Related Articles

Epidemiological characteristics and influencing factors of adenovirus-associated acute conjunctivitis in Shenzhen,2014‒2022

Objective To investigate the epidemic characteristics and influencing factors of acute conjunctivitis caused by human adenovirus(HAdv)in Yantian District of Shenzhen City, Guangdong Province from 2014 to 2022, and to...

Analysis of obesity factors among public primary school students in a town, Minhang District, Shanghai

[Objective] To identify and analyze the possible influencing factors of obesity among public primary school students in Minhang District, Shanghai.[Methods] Basic data, collected through questionnaire stars, was impor...

2017 to 2021 suicidal behavior in hospitalized patients with major depression and the influencing factors

Objective To analyze the suicidal behavior in hospitalized patients with major depression and its influencing factors.Methods A total of 1 000 patients with major depression admitted to our hospital from January 2017 to...

Characteristics analysis of delayed hospital visits in elderly tuberculosis patients in Anji County, Zhejiang Province

ObjectiveTo characterize delayed visits of elderly patients with tuberculosis aged ≥60 years in Anji County, Zhejiang Province from 2010 to 2021, and to provide evidence for the prevention and control of tuberculos...

Prevalence and influencing factors of reduced visual acuity among young children in Changning District, Shanghai

Objective To analyze the prevalence of reduced visual acuity of young children in Changning district of Shanghai and to explore the influencing factors, so as to provide a reference basis for formulating prevention and...

Download PDF file
  • EP ID EP741984
  • DOI -
  • Views 57
  • Downloads 1

How To Cite

YU Huiting, CAI Renzhi, LIN Weixiao, NI Jingyi, QIAN Naisi, XIA Tian, WU Fan, (2024). Application of Bayesian probabilistic linkage model in birth and death data linking. Shanghai Journal of Preventive Medicine, 36(1), -. https://europub.co.uk/articles/-A-741984