http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-114841241-A
Outgoing Links
Predicate | Object |
---|---|
assignee | http://rdf.ncbi.nlm.nih.gov/pubchem/patentassignee/MD5_4baf61b82705468dd1ff913b8ce785f7 |
classificationCPCInventive | http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06F18-231 http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06F18-2413 http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06F18-2411 http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06F18-23213 |
classificationIPCInventive | http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06K9-62 |
filingDate | 2022-03-31-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
inventor | http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_a13ce21961eddfeda8079d0b9eef944c http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_417f354bd8e9393a26b44d1a6727878f |
publicationDate | 2022-08-02-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
publicationNumber | CN-114841241-A |
titleOfInvention | An Imbalanced Data Classification Method Based on Clustering and Distance Weighting |
abstract | The invention discloses an unbalanced data classification method based on clustering and distance weighting, which mainly solves the problem of low classification accuracy on unbalanced data with discontinuous distribution within a minority class in the prior art. The implementation steps are: (1) collect unbalanced data sets and divide them into majority class and minority class sample sets; (2) non-overlapping division of majority class and minority class sample sets; (3) calculate samples based on the distance between clusters weights; (4) down-weighting the majority class boundary samples; (5) using the samples and their weights to train a weighted support vector machine classifier; (6) classifying the samples to be tested. The invention can effectively describe the relative distribution relationship of the two types of samples in the feature space, and on this basis assign the sample weights according to the relative importance of the samples, which is conducive to constructing a correct classification boundary and improving the classification accuracy of minority classes, and can be used for distribution situations. Classification of complex imbalanced data. |
priorityDate | 2022-03-31-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
type | http://data.epo.org/linked-data/def/patent/Publication |
Incoming Links
Predicate | Subject |
---|---|
isDiscussedBy | http://rdf.ncbi.nlm.nih.gov/pubchem/substance/SID456171974 |
Total number of triples: 16.