Skip to content. | Skip to navigation

Personal tools
Sections
Home  /  Research  /  Evaluation Tracks  /  CLEF-IP '10  /  Test Collection

Data Collection

The data collection for the CLEF-IP 2010 track is ready for download.

The collection is an extract of the MAREC dataset, containing over 2.6 million patent documents pertaining to 1.3 milion patents from the European Patent Office with content in English, German and French.

Prior to downloading the data, participants must sign a License Agreement and send it back to the track organizers by e-mail (pdf) and snail-mail. Please mention the name of the track organizers on the envelope.

 

Important: In addition to signing the Licence Agreement, please use the registration form on the CLEF 2010 website to register to this campaign.

MAREC

IRF Scientific Members now have access to the first standardised patent data corpus for research purposes. read more