The IRF provides a number of test data collections that have either been developed by the IRF, by one of its members or by third parties. These data collections can be used freely for scientific experimentations.
MAREC consists of 19 million patent documents in different languages, normalised to a highly specific XML format developed by Matrixware for the IRF.
read more