http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-102682073-B
Outgoing Links
Predicate | Object |
---|---|
classificationIPCInventive | http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06F17-30 |
filingDate | 2012-03-09-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
grantDate | 2017-04-12-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
publicationDate | 2017-04-12-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
publicationNumber | CN-102682073-B |
titleOfInvention | Selection of atoms for search engine retrieval |
abstract | Methods are provided for populating search indexes with atoms identified in documents. Documents that are to be indexed are identified, and for each document, atoms are identified and are categorized as unigrams, n-grams, and n-tuples. A list of atom/document pairs is generated such that an information metric can be computed for each pair. An information metric represents a ranking of the atom in relation to the particular document. Based on the information metric, some atom/document pairs are discarded and others are indexed. |
priorityDate | 2011-03-10-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
type | http://data.epo.org/linked-data/def/patent/Publication |
Incoming Links
Total number of triples: 13.