Skip to content. | Skip to navigation

Personal tools
Sections
Home  /  Research  /  Research Projects  /  Patent search  /  Multi-threaded Extension of the IR Platform Terrier

Multi-threaded Extension of the IR Platform Terrier

Overview

Searching for patents on the large amount of available patents is a time consuming task. A parallel implementation of the Information Retrieval toolkit Terrier on a high-performance computer will increase the efficiency of the search process in very large document collections.

 

Goals

The goal of this project is to extend the Information Retrieval toolkit Terrier in such a way that it can be employed in a parallelised way on a supercomputing infrastructure. This includes the parallelisation of the indexing process and resulting indexes as well as the parallelisation of query expansion algorithms and the query processing itself. As a result, users may search very large document collections on a scalable information retrieval service efficiently.

 

Expected outcome for IP experts

A highly efficient parallelised information retrieval toolkit.

 

Timeline

This one year project has started in 2008. First results were presented at the IRFS2008. The command line tool is expected by September 2009.

 

Project Partners

 

Links

Matrixware.net/Terrier (for more information about methods and findings, as well as publications and related works)

 

Contact

Please send your inquiry to: science@ir-facility.org.

IRF Conference

The 1st Information Retrieval Facility Conference provides a multi-disciplinary, scientific forum for researchers and aims at bringing young researchers into contact with industry at an early stage. The conference focuses on large scale research projects. read more
 

MAREC

IRF Scientific Members now have access to the first standardised patent data corpus for research purposes. read more