Skip to content. | Skip to navigation

Personal tools
Sections
Home  /  Events  /  IRF Workshops  /  AsPIRe’10  /  Dataset

Dataset

A subset of 400,000 documents of the MAREC dataset is available for download. These documents can be accessed after registering to the MATRIXWARE.NET community (free registration).

The MAREC 400.000 collection consists of 100.000 randomly picked patents from each sub-collection of the MAREC dataset (EPO, JPO, USPTO, WIPO). It is targeted at people submitting papers to the AsPIRe'10 workshop at the ECIR. Participants are encouraged to apply the techniques they develop to this dataset, where possible. This will allow the results of the presented techniques applied to the same dataset to be more easily comparable. Furthermore, the MAREC 400.000 collection will allow initial patent processing experiments to be done on a representative dataset of a reasonable size, before scaling these up to the 19 million patents of the MAREC collection.

IRF Symposium Box

The 3rd IRF Symposium is placed under the motto “Benchmarking Relevance” and will especially focus on methodology and evaluation in patent searching and retrieval. Participants will have the opportunity to discover and test prototypical versions of the most innovative technologies in the market. read more
 

IRF Conference

The 1st Information Retrieval Facility Conference provides a multi-disciplinary, scientific forum for researchers and aims at bringing young researchers into contact with industry at an early stage. The conference focuses on large scale research projects. read more