abstract |
A searchable protein database is disclosed. The protein database comprises a plurality of entries, each entry having a sufficiently short predicting sequence and a protein classifier corresponding to the predicting sequence. An unclassified protein sequence can be classifiable by the database via searching therein for a motif of amino acids matching a predicting sequence of the database, thereby attributing to the unclassified protein a protein classifier. |