What I Read: Probabilistic Linkage, Data Deduplication