Application of Bayesian probabilistic linkage model in birth and death data linking

Journal Title: Shanghai Journal of Preventive Medicine - Year 2024, Vol 36, Issue 1

Abstract

Objective To elucidate the principles and methods of the Bayesian probabilistic linkage model, and to demonstrate the effect of applying the model in linking birth and death data.Methods Through the Shanghai birth and death registration system, data of 199 025 infants born in 2017 and 1 512 infants who died in 2017 and 2018 were collected. After cleaning the data, the data were divided into monthly blocks and fully linked. The Jaro-Winkler algorithm and Euclidean distance were employed to measure the similarity of fields for matching. A Bayesian probabilistic linkage model was constructed and the linking effect was evaluated using a confusion matrix.Results Using the Bayesian probabilistic linkage model, the birth and death data of infants were effectively linked, revealing that 36.71% of infants who died in Shanghai were born outside the city, and the probability of infant death was 2.6‰. The confusion matrix of the test set showed a recall rate of 0.86, precision of 0.76, and an F-score of 0.81.Conclusion The practical application of Bayesian probabilistic linkage demonstrates a good model performance, enabling the establishment of birth-death cohorts that more accurately reflect the true levels of infant mortality. Utilizing this technique to integrate data from different departments can effectively improve research efficiency in the field of public health.

Authors and Affiliations

YU Huiting,CAI Renzhi,LIN Weixiao,NI Jingyi,QIAN Naisi,XIA Tian,WU Fan,

Keywords

Related Articles

Association between e-cigarette use and subjective cognitive decline among adults aged 45 and above

ObjectiveTo investigate the relationship between e-cigarette use and subjective cognitive decline.MethodsThis study included survey participants aged ≥45 years from the US Behavioral Risk Factor Surveillance System. Th...

Effect of mint juice on nitrite in pickled cabbage

ObjectiveTo explore the effect of mint juice on the nitrite content in pickled cabbage, and to determine the best concentration of mint juice through comprehensive sensory evaluation.MethodsThe control variates method...

The relationship between students’ visual acuity and the visual environment of primary and secondary school classrooms in Minhang District, Shanghai

Objective To understand the visual environment sanitation in primary and secondary school classrooms in Minhang District, Shanghai, and to investigate the factors affecting the decline in students’ visual acuity.Method...

Assessment on the implementation effect of basic public health service project in Jiangbei District of Chongqing

ObjectiveTo evaluate the implementation effect of the basic public health service project in Jiangbei District of Chongqing, so as to provide a basis for further improving the service content and social/economic benefi...

Epidemiology and exposure management of rabies in Shanxi Province, 2011‒2022

Objective To analyze the epidemiological features and influencing factors of rabies in Shanxi Province,and to provide evidence to further promote the elimination of rabies in Shanxi Province.Methods The incidence data o...

Download PDF file
  • EP ID EP741984
  • DOI -
  • Views 13
  • Downloads 0

How To Cite

YU Huiting, CAI Renzhi, LIN Weixiao, NI Jingyi, QIAN Naisi, XIA Tian, WU Fan, (2024). Application of Bayesian probabilistic linkage model in birth and death data linking. Shanghai Journal of Preventive Medicine, 36(1), -. https://europub.co.uk/articles/-A-741984