http://rdf.ncbi.nlm.nih.gov/pubchem/patent/JP-2009193571-A
Outgoing Links
Predicate | Object |
---|---|
assignee | http://rdf.ncbi.nlm.nih.gov/pubchem/patentassignee/MD5_be92719b856db86c092cc233e9512ca2 |
classificationIPCInventive | http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06F17-21 |
filingDate | 2008-12-19-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
inventor | http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_5aaa7393e5c215a55562b1af0216f89a |
publicationDate | 2009-08-27-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
publicationNumber | JP-2009193571-A |
titleOfInvention | Method and apparatus used to extract web page content |
abstract | 【Task】 An object of the present invention is to provide a method and an apparatus used for extracting web page content that can obtain a web page extraction result more optimal than the conventional technique. [Solution] The present invention disclosed a method and apparatus used to extract web page content. The method extracts a web page content for web page input based on a digital document analysis (DDA) method to generate a DDA extraction result, and a web based on a document image identification (DIR) method. Extracting web page content for page input and generating a DIR extraction result; and fusing the DDA extraction result and the DIR extraction result to generate a fusion result. [Selection] Figure 2 |
isCitedBy | http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-102314497-A http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-102314497-B |
priorityDate | 2008-02-18-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
type | http://data.epo.org/linked-data/def/patent/Publication |
Incoming Links
Predicate | Subject |
---|---|
isDiscussedBy | http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID7156993 http://rdf.ncbi.nlm.nih.gov/pubchem/substance/SID426135032 |
Total number of triples: 14.