http://rdf.ncbi.nlm.nih.gov/pubchem/patent/WO-2004042641-A2

Outgoing Links

Predicate Object
assignee http://rdf.ncbi.nlm.nih.gov/pubchem/patentassignee/MD5_be055db3c1a09879df07379ba969e223
classificationCPCAdditional http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06V30-10
classificationCPCInventive http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06V30-268
classificationIPCAdditional http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06V30-10
filingDate 2003-11-04-04:00^^<http://www.w3.org/2001/XMLSchema#date>
inventor http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_b382fa2ca446b4de0ddce2a1686c36ac
http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_4149d7470c67e4d2ff3eb728e13107f2
http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_aae436bdf05b4991d5fa9b2e53f3a670
http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_9a36df6db6754461df3a92d53ce8cfee
http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_99aa22726b81fb268a0ac4f213045dfe
http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_27c364cbf78e7a58f90b1c9b513c9517
publicationDate 2004-05-21-04:00^^<http://www.w3.org/2001/XMLSchema#date>
publicationNumber WO-2004042641-A2
titleOfInvention Post-processing system and method for correcting machine recognized text
abstract A method of post-processing character data from an optical character recognition (OCR) engine and apparatus to perform the method. This exemplary method includes segmenting the character data into a set of initial words. The set of initial words is word level processed to determine at least one candidate word corresponding to each initial word. The set of initial words is segmented into a set of sentences. Each sentence in the set of sentences includes a plurality of initial words and candidate words corresponding to the initial words. A sentence is selected from the set of sentences. The selected sentence is word disambiguity processed to determine a plurality of final words. A final word is selected from the at least one candidate word corresponding to a matching initial word. The plurality of final words is then assembled as post-processed (OCR) data.
isCitedBy http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-110046350-A
priorityDate 2002-11-04-04:00^^<http://www.w3.org/2001/XMLSchema#date>
type http://data.epo.org/linked-data/def/patent/Publication

Incoming Links

Predicate Subject
isDiscussedBy http://rdf.ncbi.nlm.nih.gov/pubchem/substance/SID128491820
http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID6582
http://rdf.ncbi.nlm.nih.gov/pubchem/gene/GID39330
http://rdf.ncbi.nlm.nih.gov/pubchem/gene/GID217335

Total number of triples: 22.