Probabilistic linkage to enhance deterministic algorithms and reduce data linkage errors in hospital administrative data
Journal article
Hagger-Johnson, Gareth, Harron, Katie, Goldstein, Harvey, Aldridge, Rob and Gilbert, Ruth. (2017). Probabilistic linkage to enhance deterministic algorithms and reduce data linkage errors in hospital administrative data. Journal of Innovation in Health Informatics. 24(2), pp. 234 - 246. https://doi.org/10.14236/jhi.v24i2.891
Authors | Hagger-Johnson, Gareth, Harron, Katie, Goldstein, Harvey, Aldridge, Rob and Gilbert, Ruth |
---|---|
Abstract | Background The pseudonymisation algorithm used to link together episodes of care belonging to the same patient in England [Hospital Episode Statistics ID (HESID)] has never undergone any formal evaluation to determine the extent of data linkage error. Objective To quantify improvements in linkage accuracy from adding probabilistic linkage to existing deterministic HESID algorithms. Methods Inpatient admissions to National Health Service (NHS) hospitals in England (HES) over 17 years (1998 to 2015) for a sample of patients (born 13th or 28th of months in 1992/1998/2005/2012). We compared the existing deterministic algorithm with one that included an additional probabilistic step, in relation to a reference standard created using enhanced probabilistic matching with additional clinical and demographic information. Missed and false matches were quantified and the impact on estimates of hospital readmission within one year was determined. Results HESID produced a high missed match rate, improving over time (8.6% in 1998 to 0.4% in 2015). Missed matches were more common for ethnic minorities, those living in areas of high socio-economic deprivation, foreign patients and those with ‘no fixed abode’. Estimates of the readmission rate were biased for several patient groups owing to missed matches, which were reduced for nearly all groups. Conclusion Probabilistic linkage of HES reduced missed matches and bias in estimated readmission rates, with clear implications for commissioning, service evaluation and performance monitoring of hospitals. The existing algorithm should be modified to address data linkage error, and a retrospective update of the existing data would address existing linkage errors and their implications. |
Keywords | deterministic record linkage; evaluation; hospital discharge; probabilistic record linkage |
Year | 2017 |
Journal | Journal of Innovation in Health Informatics |
Journal citation | 24 (2), pp. 234 - 246 |
Publisher | BCS Learning and Development Limited |
ISSN | 2058-4563 |
Digital Object Identifier (DOI) | https://doi.org/10.14236/jhi.v24i2.891 |
Scopus EID | 2-s2.0-85042619898 |
Page range | 234 - 246 |
Research Group | Institute for Learning Sciences and Teacher Education (ILSTE) |
Publisher's version | License File Access Level Controlled |
Place of publication | United Kingdom |
https://acuresearchbank.acu.edu.au/item/8857q/probabilistic-linkage-to-enhance-deterministic-algorithms-and-reduce-data-linkage-errors-in-hospital-administrative-data
Restricted files
Publisher's version
83
total views0
total downloads0
views this month0
downloads this month