Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Assessing linkage accuracy using MC based Monte Carlo simulation approach #4

Open
shovanur-haque opened this issue Oct 15, 2017 · 0 comments

Comments

@shovanur-haque
Copy link

The main focus of my research is to assess the accuracy of matching in data linkage i.e. to assess the likelihood that records matched from the two files actually belongs to the same individual. We proposed a Markov Chain based Monte Carlo simulation method for assessing linkage accuracy and illustrates the utility of the approach using the ABS (Australian Bureau of Statistics) synthetic data in realistic data settings.

Given the current state of the chain, A(n), the next state, A(n+1), will be constructed following a defined algorithm developed, which maintains internal consistency patterns of agreement. The idea is to generate re-sampled versions of the agreement array in such a way as to preserve the underlying probabilistic linking structure.

For assessing the accuracy:

  • correctly linked proportions are investigated for each record with different blocking strategies
  • The average proportions of correct links are observed with the increasing block sizes and for a range of cut-off values with the aim of facilitating optimal choice of block sizes and cut-off values while improving existing linking processes by achieving higher accuracy.
  • To improve the existing method, I am working on a concept of using similarity weight in the agreement matrix. This weight will allow partial agreement of the linking variable values for record pairs in the form of similarity weight.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant