دانلود مقاله ISI انگلیسی شماره 79143
ترجمه فارسی عنوان مقاله

یک الگوریتم سریع برای ساخت فایل های معکوس بر روی سیستم عامل ناهمگن

عنوان انگلیسی
A fast algorithm for constructing inverted files on heterogeneous platforms
کد مقاله سال انتشار تعداد صفحات مقاله انگلیسی
79143 2012 11 صفحه PDF
منبع

Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)

Journal : Journal of Parallel and Distributed Computing, Volume 72, Issue 5, May 2012, Pages 728–738

ترجمه کلمات کلیدی
نمایه سازی؛ فایل های معکوس؛ چند هسته ای - GPU؛ تجزیه خط لوله و موازی و نمایه سازی
کلمات کلیدی انگلیسی
Indexer; Inverted files; Multicore; GPU; Pipelined and parallel parsing and indexing
پیش نمایش مقاله
پیش نمایش مقاله  یک الگوریتم سریع برای ساخت فایل های معکوس بر روی سیستم عامل ناهمگن

چکیده انگلیسی

Given a collection of documents residing on a disk, we develop a new strategy for processing these documents and building the inverted files extremely quickly. Our approach is tailored for a heterogeneous platform consisting of multicore CPUs and highly multithreaded GPUs. Our algorithm is based on a number of novel techniques, including a high-throughput pipelined strategy, a hybrid trie and B-tree dictionary data structure, dynamic work allocation to CPU and GPU threads, and optimized CUDA indexer implementation. We have performed extensive tests of our algorithm on a single node (two Intel Xeon X5560 Quad-core CPUs) with two NVIDIA Tesla C1060 GPUs attached to it, and were able to achieve a throughput of more than 262 MB/s on the ClueWeb09 dataset. Similar results were obtained for widely different datasets. The throughput of our algorithm is superior to the best known algorithms reported in the literature even when compared to those run on large clusters.