Assessing linkage accuracy using MC based Monte Carlo simulation approach #4

shovanur-haque · 2017-10-15T08:03:23Z

The main focus of my research is to assess the accuracy of matching in data linkage i.e. to assess the likelihood that records matched from the two files actually belongs to the same individual. We proposed a Markov Chain based Monte Carlo simulation method for assessing linkage accuracy and illustrates the utility of the approach using the ABS (Australian Bureau of Statistics) synthetic data in realistic data settings.

Given the current state of the chain, A(n), the next state, A(n+1), will be constructed following a defined algorithm developed, which maintains internal consistency patterns of agreement. The idea is to generate re-sampled versions of the agreement array in such a way as to preserve the underlying probabilistic linking structure.

For assessing the accuracy:

correctly linked proportions are investigated for each record with different blocking strategies
The average proportions of correct links are observed with the increasing block sizes and for a range of cut-off values with the aim of facilitating optimal choice of block sizes and cut-off values while improving existing linking processes by achieving higher accuracy.
To improve the existing method, I am working on a concept of using similarity weight in the agreement matrix. This weight will allow partial agreement of the linking variable values for record pairs in the form of similarity weight.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assessing linkage accuracy using MC based Monte Carlo simulation approach #4

Assessing linkage accuracy using MC based Monte Carlo simulation approach #4

shovanur-haque commented Oct 15, 2017

Assessing linkage accuracy using MC based Monte Carlo simulation approach #4

Assessing linkage accuracy using MC based Monte Carlo simulation approach #4

Comments

shovanur-haque commented Oct 15, 2017