Most Read Research Articles


Warning: Creating default object from empty value in /var/www/html/sandbox.ijcaonline.org/public_html/modules/mod_mostread/helper.php on line 79

Warning: Creating default object from empty value in /var/www/html/sandbox.ijcaonline.org/public_html/modules/mod_mostread/helper.php on line 79

Warning: Creating default object from empty value in /var/www/html/sandbox.ijcaonline.org/public_html/modules/mod_mostread/helper.php on line 79

Warning: Creating default object from empty value in /var/www/html/sandbox.ijcaonline.org/public_html/modules/mod_mostread/helper.php on line 79

Warning: Creating default object from empty value in /var/www/html/sandbox.ijcaonline.org/public_html/modules/mod_mostread/helper.php on line 79
Call for Paper - May 2015 Edition
IJCA solicits original research papers for the May 2015 Edition. Last date of manuscript submission is April 20, 2015. Read More

Context based Web Indexing for Storage of Relevant Web Pages

Print
PDF
International Journal of Computer Applications
© 2012 by IJCA Journal
Volume 40 - Number 3
Year of Publication: 2012
Authors:
Nidhi Tyagi
Rahul Rishi
R. P. Agarwal
10.5120/5021-7166

Nidhi Tyagi, Rahul Rishi and R P Agarwal. Article: Context based Web Indexing for Storage of Relevant Web Pages. International Journal of Computer Applications 40(3):1-5, February 2012. Full text available. BibTeX

@article{key:article,
	author = {Nidhi Tyagi and Rahul Rishi and R. P. Agarwal},
	title = {Article: Context based Web Indexing for Storage of Relevant Web Pages},
	journal = {International Journal of Computer Applications},
	year = {2012},
	volume = {40},
	number = {3},
	pages = {1-5},
	month = {February},
	note = {Full text available}
}

Abstract

A focused crawler downloads web pages that are relevant to a user specified topic. The downloaded documents are indexed with a view to optimize speed and performance in finding relevant documents for a search query at the search engine side. However, the information will be more relevant if the context of the topic is also made available to the retrieval system. This paper proposes a technique for indexing the keyword extracted from the web documents along with their contexts wherein it uses a height balanced binary search (AVL) tree, for indexing purpose to enhance the performance of the retrieval system.

References

  • Diligenti M., Coetzee F.M., Lawrence S., Giles C.L.and Gori M., “Focused Crawling using context graphs”, Proc. International Conference on Very Large Databases (VLDB ’00), pp. 527-534, 2000.
  • Yang Yongsheng and Wang Hui, “Implementation of Focused Crawler”, COMP630D Course Project Report.
  • A.K.Sharma, “Data Structures using C”, Pearson publication, 2011.
  • Fabrizio Silvestri, Raffaele Perego and Salvatore Orlando ”Assigning Document Identifiers to Enhance Compressibility of Web Search Engines Indexes”. Proceedings of SAC, 2004.
  • Oren Zamir and Oren Etzioni “Web Document Clustering: A feasibility demonstration”. Proceedings of SIGIR, 1998.
  • Changshang Zhou, Wei Ding and Na Yang, “Double Indexing Mechanism of Search Engine based on Campus Net”, Proceedings of the 2006 IEEE Asia-Pacific Conference on Services Computing (APSCC'06), 2006.
  • Naresh Chauhan and A. K. Sharma,” Design of an Agent Based Context Driven Focused Crawler”,BVICAM’S International Journal of Information Technology, pp 61-66, 2008.
  • Parul Gupta and A.K.Sharma,” Context based Indexing in Search Engines using Ontology”, International Journal of Computer Applications, Volume 1 No. 14, pp 49-52, 2010.
  • Steve Lawrence, “Context in Web Search”, IEEE Data Engineering Bulletin, 2000.
  • Wang Jicheng, Huang Yuan, Wu Gangshan and Zhang Fuyan, “Web Mining: Knowledge Discovery on the Web”, IEEE International Conference, Tokyo, 1999.
  • O. Zamir, O. Etzioni, O. Madanim, and R.M. Karp “Fast and Intuitive Clustering of Web Documents,” Proceeding Third International Conference Knowledge Discovery and Data Mining, pp. 287-290, Aug. 1997.