Application of Bayesian probabilistic linkage model in birth and death data linking

Journal Title: Shanghai Journal of Preventive Medicine - Year 2024, Vol 36, Issue 1

Abstract

Objective To elucidate the principles and methods of the Bayesian probabilistic linkage model, and to demonstrate the effect of applying the model in linking birth and death data.Methods Through the Shanghai birth and death registration system, data of 199 025 infants born in 2017 and 1 512 infants who died in 2017 and 2018 were collected. After cleaning the data, the data were divided into monthly blocks and fully linked. The Jaro-Winkler algorithm and Euclidean distance were employed to measure the similarity of fields for matching. A Bayesian probabilistic linkage model was constructed and the linking effect was evaluated using a confusion matrix.Results Using the Bayesian probabilistic linkage model, the birth and death data of infants were effectively linked, revealing that 36.71% of infants who died in Shanghai were born outside the city, and the probability of infant death was 2.6‰. The confusion matrix of the test set showed a recall rate of 0.86, precision of 0.76, and an F-score of 0.81.Conclusion The practical application of Bayesian probabilistic linkage demonstrates a good model performance, enabling the establishment of birth-death cohorts that more accurately reflect the true levels of infant mortality. Utilizing this technique to integrate data from different departments can effectively improve research efficiency in the field of public health.

Authors and Affiliations

YU Huiting,CAI Renzhi,LIN Weixiao,NI Jingyi,QIAN Naisi,XIA Tian,WU Fan,

Keywords

Related Articles

Trends in antimicrobial use and hospital infection incidence among inpatients

Objective To understand the use of antibiotics in inpatients and the incidence and trend of hospital infections, to explore the implementation effect of comprehensive management measures, and to provide reference for h...

Analysis of obesity factors among public primary school students in a town, Minhang District, Shanghai

[Objective] To identify and analyze the possible influencing factors of obesity among public primary school students in Minhang District, Shanghai.[Methods] Basic data, collected through questionnaire stars, was impor...

Investigation of a foodborne poisoning incident caused by accidental consumption of medicinal liquor containing aconite alkaloids

A foodborne poisoning incident occurred in a street, Ouhai District, Wenzhou City, Zhejiang Province on May 9, 2023, which was caused by the accidental consumption of medicinal liquor containing aconitum alkaloids....

Application and management of human genetic resources in China

Human genetic resources include specimen samples and related information. The application and regulation of human genetic resources in China started at the end of the 20th century and have progressed rapidly. However,...

Spatiotemporal characteristics and prevention and control measures of SARS-CoV-2 Omicron pandemic in Shanghai

ObjectiveTo analyze the spatiotemporal characteristics and prevention and control measures of the pandemic caused by the SARS-CoV-2 Omicron variant in Shanghai in 2022, aiming to optimize future prevention and control...

Download PDF file
  • EP ID EP741984
  • DOI -
  • Views 65
  • Downloads 1

How To Cite

YU Huiting, CAI Renzhi, LIN Weixiao, NI Jingyi, QIAN Naisi, XIA Tian, WU Fan, (2024). Application of Bayesian probabilistic linkage model in birth and death data linking. Shanghai Journal of Preventive Medicine, 36(1), -. https://europub.co.uk/articles/-A-741984