Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.
Digital Object Identifier (DOI) : 10.14569/IJACSA.2011.020103
Article Published in International Journal of Advanced Computer Science and Applications(IJACSA), Volume 2 Issue 1, 2011.
Abstract: In parallel (IR) systems, where a large-scale collection is indexed and searched, the query response time is limited by the time of the slowest node in the system. Thus distributing the load equally across the nodes is very important issue. Mainly there are two methods for collection indexing, namely document-based and term-based indexing. In term-based partitioning, the terms of the global index of a large-scale data collection are distributed or partitioned equally among nodes, and then a given query is divided into sub-queries and each sub-query is then directed to the relevant node. This provides high query throughput and concurrency but poor parallelism and load balance. In this paper, we introduce new methods for terms partitioning and then we compare the results from our methods with the results from the previous work with respect to load balance and query response time.
Ahmad Abusukhon and Mohammad Talib, “ Analyzing the Load Balance of Term-based Partitioning” International Journal of Advanced Computer Science and Applications(IJACSA), 2(1), 2011. http://dx.doi.org/10.14569/IJACSA.2011.020103