abstract |
An extension of the vector space model for computing chemical similarity using textual and chemical descriptors is described. The method uses a chemical and/or textual description of a molecule/chemical and a decomposes a molecule/chemical descriptor matrix by a suitable technique such as singular value decomposition to create a low dimensional representation of the original descriptor space. Similarities between a user probe and the textual and/or chemical descriptors are then computed and ranked. |