Most Read Research Articles


Warning: Creating default object from empty value in /var/www/html/sandbox.ijcaonline.org/public_html/modules/mod_mostread/helper.php on line 79

Warning: Creating default object from empty value in /var/www/html/sandbox.ijcaonline.org/public_html/modules/mod_mostread/helper.php on line 79

Warning: Creating default object from empty value in /var/www/html/sandbox.ijcaonline.org/public_html/modules/mod_mostread/helper.php on line 79

Warning: Creating default object from empty value in /var/www/html/sandbox.ijcaonline.org/public_html/modules/mod_mostread/helper.php on line 79

Warning: Creating default object from empty value in /var/www/html/sandbox.ijcaonline.org/public_html/modules/mod_mostread/helper.php on line 79
Call for Paper - May 2015 Edition
IJCA solicits original research papers for the May 2015 Edition. Last date of manuscript submission is April 20, 2015. Read More

Improved One-to-Many Record Linkage using One-Class Clustering Tree

Print
PDF
IJCA Proceedings on International Conference on Simulations in Computing Nexus
© 2014 by IJCA Journal
ICSCN - Number 2
Year of Publication: 2014
Authors:
Sunandhini
S Suguna
M Sharmila. D

Sunandhini, S Suguna and Sharmila. M D. Article: Improved One-to-Many Record Linkage using One-Class Clustering Tree. IJCA Proceedings on International Conference on Simulations in Computing Nexus ICSCN(2):23-26, May 2014. Full text available. BibTeX

@article{key:article,
	author = {Sunandhini and S Suguna and M Sharmila. D},
	title = {Article: Improved One-to-Many Record Linkage using One-Class Clustering Tree},
	journal = {IJCA Proceedings on International Conference on Simulations in Computing Nexus},
	year = {2014},
	volume = {ICSCN},
	number = {2},
	pages = {23-26},
	month = {May},
	note = {Full text available}
}

Abstract

Record linkage is traditionally performed among the entities of same type. It can be done based on entities that may or may not share a common identifier. In this paper we propose a new linkage method that performs linkage between matching entities of different data types as well. The proposed technique is based on one-class clustering tree that characterizes the entities which are to be linked. The tree is built in such a way that it is easy to understand and can be transformed into association rules. The inner nodes of the tree consist of features of the first set of entities. The leaves of the tree represent features of the second set that are matching. The data is split using two splitting criteria. Also two pruning methods are used for creating one-class clustering tree. The proposed system results better in performance of precision and recall.

References

  • M. Dror, A. Shabtai, L. Rokach, Y. Elovici, "OCCT: A One-Class Clustering Tree for Implementing One-to- Many Data Linkage," IEEE Trans. on Knowledge and Data Engineering, TKDE-2011-09-0577, 2013.
  • M. Yakout, A. K. Elmagarmid, H. Elmeleegy, M. Quzzani and A. Qi, "Behavior Based Record Linkage," in Proc. of the VLDB Endowment, vol. 3, no 1-2, pp. 439-448, 2010.
  • A. J. Storkey, C. K. I. Williams, E. Taylorand R. G. Mann, "An Expectation Maximisation Algorithm for One-to- Many Record Linkage," University of Edinburgh Informatics Research Report, 2005.
  • S. Ivie, G. Henry, H. Gatrell and C. Giraud-Carrier, "A Metric Based Machine Learning Approach to Genea- Logical Record Linkage," in Proc. of the 7th Annual Workshop on Technology for Family History and Genealogical Research, 2007.
  • P. Christen and K. Goiser, "Towards Automated Data Linkage and Deduplication," Australian National University, Technical Report, 2005.
  • P. Langley, Elements of Machine Learning, San Franc- Isco, Morgan Kaufmann, 1996.
  • S. Guha, R. Rastogi and K. Shim, "Rock: A Robust Clustering Algorithm for Categorical Attributes," Informat- ion Systems, vol. 25, no. 5, pp. 345-366, July 2000.
  • D. D. Dorfmann and E. Alf, "Maximum-Likelihood EstiMation of Parameters of Signal-Detection Theory and Determination of Confidence Intervals-Rating-Method Data," Journal of Math Psychology, vol. 6, no. 3, pp. 487-496, 1969.
  • A. Gershman et al. , "A Decision Tree Based Recomme- nder System," in Proc. the 10th Int. Conf. on Innovative Internet Community Services, pp. 170-179, 2010.
  • J. R. Quinlan, "Induction of Decision Trees," Machine Learning, vol. 1, no. 1, pp. 81-106, March 1986.