Application of Bayesian probabilistic linkage model in birth and death data linking

Journal Title: Shanghai Journal of Preventive Medicine - Year 2024, Vol 36, Issue 1

Abstract

Objective To elucidate the principles and methods of the Bayesian probabilistic linkage model, and to demonstrate the effect of applying the model in linking birth and death data.Methods Through the Shanghai birth and death registration system, data of 199 025 infants born in 2017 and 1 512 infants who died in 2017 and 2018 were collected. After cleaning the data, the data were divided into monthly blocks and fully linked. The Jaro-Winkler algorithm and Euclidean distance were employed to measure the similarity of fields for matching. A Bayesian probabilistic linkage model was constructed and the linking effect was evaluated using a confusion matrix.Results Using the Bayesian probabilistic linkage model, the birth and death data of infants were effectively linked, revealing that 36.71% of infants who died in Shanghai were born outside the city, and the probability of infant death was 2.6‰. The confusion matrix of the test set showed a recall rate of 0.86, precision of 0.76, and an F-score of 0.81.Conclusion The practical application of Bayesian probabilistic linkage demonstrates a good model performance, enabling the establishment of birth-death cohorts that more accurately reflect the true levels of infant mortality. Utilizing this technique to integrate data from different departments can effectively improve research efficiency in the field of public health.

Authors and Affiliations

YU Huiting,CAI Renzhi,LIN Weixiao,NI Jingyi,QIAN Naisi,XIA Tian,WU Fan,

Keywords

Related Articles

Analysis of the current status and related factors of first aid knowledge among middle-aged and elderly residents in a community in Shanghai

Objective To investigate the current status of first aid knowledge among middle-aged and elderly residents aged 50 and above in a community in Shanghai, in order to provide reference for improving the self-rescue and mu...

Comparative analysis of Aedes albopictus surveillance index in Yangpu District, Shanghai

Objective To analyze the interannual fluctuation, seasonal fluctuation, habitat distribution and the correlation of the 3 monitoring indicators of Aedes albopictus in Yangpu District of Shanghai from 2017 to 2021, and...

Analysis of cerebrovascular disease between 1996 and 2021 in Minhang District of Shanghai

ObjectiveTo analyze the changing trend of cerebrovascular disease burden in Minhang District of Shanghai from 1996 to 2021, and to provide scientific evidence for government to formulate targeted cerebrovascular diseas...

The effect of ozone exposure on asthma and the potential mechanisms

Near-surface ozone is a profoundly reactive and highly oxidizing gas and one of the critical respiratory toxicants. Numerous epidemiological investigations have indicated that asthmatic individuals are the vulnerable gro...

Incidence and related influencing factors of parturients with postpartum depression in Xuhui District, Shanghai

ObjectiveTo understand the occurrence of postpartum depression among puerperae in six communities in Xuhui District, Shanghai, and to analyze the related influencing factors of postpartum depression, so as to provi...

Download PDF file
  • EP ID EP741984
  • DOI -
  • Views 39
  • Downloads 1

How To Cite

YU Huiting, CAI Renzhi, LIN Weixiao, NI Jingyi, QIAN Naisi, XIA Tian, WU Fan, (2024). Application of Bayesian probabilistic linkage model in birth and death data linking. Shanghai Journal of Preventive Medicine, 36(1), -. https://europub.co.uk/articles/-A-741984