http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-114841241-A

Outgoing Links

Predicate Object
assignee http://rdf.ncbi.nlm.nih.gov/pubchem/patentassignee/MD5_4baf61b82705468dd1ff913b8ce785f7
classificationCPCInventive http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06F18-231
http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06F18-2413
http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06F18-2411
http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06F18-23213
classificationIPCInventive http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06K9-62
filingDate 2022-03-31-04:00^^<http://www.w3.org/2001/XMLSchema#date>
inventor http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_a13ce21961eddfeda8079d0b9eef944c
http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_417f354bd8e9393a26b44d1a6727878f
publicationDate 2022-08-02-04:00^^<http://www.w3.org/2001/XMLSchema#date>
publicationNumber CN-114841241-A
titleOfInvention An Imbalanced Data Classification Method Based on Clustering and Distance Weighting
abstract The invention discloses an unbalanced data classification method based on clustering and distance weighting, which mainly solves the problem of low classification accuracy on unbalanced data with discontinuous distribution within a minority class in the prior art. The implementation steps are: (1) collect unbalanced data sets and divide them into majority class and minority class sample sets; (2) non-overlapping division of majority class and minority class sample sets; (3) calculate samples based on the distance between clusters weights; (4) down-weighting the majority class boundary samples; (5) using the samples and their weights to train a weighted support vector machine classifier; (6) classifying the samples to be tested. The invention can effectively describe the relative distribution relationship of the two types of samples in the feature space, and on this basis assign the sample weights according to the relative importance of the samples, which is conducive to constructing a correct classification boundary and improving the classification accuracy of minority classes, and can be used for distribution situations. Classification of complex imbalanced data.
priorityDate 2022-03-31-04:00^^<http://www.w3.org/2001/XMLSchema#date>
type http://data.epo.org/linked-data/def/patent/Publication

Incoming Links

Predicate Subject
isDiscussedBy http://rdf.ncbi.nlm.nih.gov/pubchem/substance/SID456171974

Total number of triples: 16.