Multi-threaded Extension of the IR Platform Terrier

Overview

Searching for patents on the large amount of available patents is a time consuming task. A parallel implementation of the Information Retrieval toolkit Terrier on a high-performance computer would increase the efficiency of the search process in very large document collections.

The goal of this project was to extend the Information Retrieval toolkit Terrier in such a way that it can be employed in a parallelised way on a supercomputing infrastructure. This included the parallelisation of the indexing process and resulting indexes as well as the parallelisation of query expansion algorithms and the query processing itself. 

 

Project Partners