ExoPatent derives from the project "Semantic Annotation for Patents".

Data Sources:

* U.S. FDA Orange Book
* UMLS Medical Terms Database
* Matrixware’s Alexandria Patent Archive

Document Set:

* 40,000 U.S. Patents
* Randomly selected from IPC Class A 61 K ("Preparations for Medical, Dental, or Toilet Purposes")

Underlying Tools:

* Ontotext’s KIM semantic annotation and search platform
* Ontotext’s BigOWLIM semantic database
* University of Sheffield’s GATE text analysis platform