Functions to assist in performing probabilistic record linkage and deduplication: generating pairs, comparing records, em-algorithm for estimating m- and u-probabilities (I. Fellegi & A. Sunter (1969) <doi:10.1080/01621459.1969.10501049>, T.N. Herzog, F.J. Scheuren, & W.E. Winkler (2007), "Data Quality and Record Linkage Techniques", ISBN:978-0-387-69502-0), forcing one-to-one matching. Can also be used for pre- and post-processing for machine learning methods for record linkage. Focus is on memory, CPU performance and flexibility.
Version: | 0.5.0 |
Depends: | data.table, R (≥ 3.6.0) |
Imports: | stringdist, stats, utils, lpSolve, Rcpp, parallel |
LinkingTo: | Rcpp |
Suggests: | simplermarkdown |
Published: | 2024-02-09 |
DOI: | 10.32614/CRAN.package.reclin2 |
Author: | Jan van der Laan [aut, cre] |
Maintainer: | Jan van der Laan <r at eoos.dds.nl> |
BugReports: | https://github.com/djvanderlaan/reclin2/issues |
License: | GPL-3 |
URL: | https://github.com/djvanderlaan/reclin2 |
NeedsCompilation: | yes |
Materials: | README NEWS |
In views: | OfficialStatistics |
CRAN checks: | reclin2 results |
Reference manual: | reclin2.pdf |
Vignettes: |
Deduplication using reclin2 Introduction to reclin2 Record linkage using machine learning Using a cluster for record linkage |
Package source: | reclin2_0.5.0.tar.gz |
Windows binaries: | r-devel: reclin2_0.5.0.zip, r-release: reclin2_0.5.0.zip, r-oldrel: reclin2_0.5.0.zip |
macOS binaries: | r-release (arm64): reclin2_0.5.0.tgz, r-oldrel (arm64): reclin2_0.5.0.tgz, r-release (x86_64): reclin2_0.5.0.tgz, r-oldrel (x86_64): reclin2_0.5.0.tgz |
Old sources: | reclin2 archive |
Reverse imports: | PriceIndices |
Please use the canonical form https://CRAN.R-project.org/package=reclin2 to link to this page.