Effective Performance of Information Retrieval by using Domain Based Crawler

Sk. Abdul Nabi; Dr. P. Premchand

doi:10.14569/IJACSA.2013.040713

DOI: 10.14569/IJACSA.2013.040713

PDF

Effective Performance of Information Retrieval by using Domain Based Crawler

Author 1: Sk. Abdul Nabi

Author 2: Dr. P. Premchand

International Journal of Advanced Computer Science and Applications(IJACSA), Volume 4 Issue 7, 2013.

Abstract and Keywords
How to Cite this Article
{} BibTeX Source

Abstract: World Wide Web continuously introduces new capabilities and attracts many people[1]. It consists of more than 60 billion pages online. Due to this explosion in size, the information retrieval system or Search Engines are being upgraded day by day and it can be used to access the information effectively and efficiently. In this paper, we have addressed Domain Based Information Retrieval (DBIR) System. In this system we crawl the information from the web and added all links to the data base which are related to a specific domain. It simply ignores which are not related to that domain. Because of that we can save the Storage Space (SS) and Searching Time (ST) and as a result it improves the performance of the system. It is an extension of Effective Performance of Web Crawler (EPOW) System [2], in which it has two Crawler modules. The first one is Basic Crawler. It consists of multiple downloaders to achieve parallelization policy . The second one is Master Crawler, which is used to filter the URLs send by the Basic Crawler based on the Domain and sends back to the Basic Crawler to extract the related links. All these related links are collectively stored into the database under a unique domain name.

Keywords: Domain Based Information Retrieval (DBIR); Storage Space (SS); Searching Time (ST) ; Master Crawler;Basic Crawler ; EPOW.

Sk. Abdul Nabi and Dr. P. Premchand, “Effective Performance of Information Retrieval by using Domain Based Crawler” International Journal of Advanced Computer Science and Applications(IJACSA), 4(7), 2013. http://dx.doi.org/10.14569/IJACSA.2013.040713

@article{Nabi2013,
title = {Effective Performance of Information Retrieval by using Domain Based Crawler},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2013.040713},
url = {http://dx.doi.org/10.14569/IJACSA.2013.040713},
year = {2013},
publisher = {The Science and Information Organization},
volume = {4},
number = {7},
author = {Sk. Abdul Nabi and Dr. P. Premchand}
}

Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.

Effective Performance of Information Retrieval by using Domain Based Crawler

Upcoming Conferences