http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-110738129-B
Outgoing Links
Predicate | Object |
---|---|
classificationCPCInventive | http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06V20-41 http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06N3-048 http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06F18-24 http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06F18-214 http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G06V40-20 |
classificationIPCInventive | http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06N3-04 http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06V10-764 http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06V10-774 http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06V40-20 http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06V10-82 http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06K9-62 http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G06V20-40 |
filingDate | 2019-09-20-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
grantDate | 2022-08-05-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
publicationDate | 2022-08-05-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
publicationNumber | CN-110738129-B |
titleOfInvention | An end-to-end video timing behavior detection method based on R-C3D network |
abstract | The invention discloses an end-to-end video timing behavior detection method based on an R-C3D network, belonging to the field of computer vision. The method includes: performing frame rate adjustment and frame extraction on an input video, and normalizing the extracted frames. After the data is enhanced, it is used as a training set and a test set; a time-series behavior detection model is constructed; the time-series behavior detection model includes a feature extraction module, a long-term information coding module and a behavior recognition module; Encode to obtain features containing long-term information; input the training set and test set into the time series behavior detection model for training; input the video to be detected into the trained time series behavior detection model for detection, and obtain the behavior categories and positioning existing in the video information. The invention encodes the extracted features by designing a long-term information encoding network, so that the network can obtain the global time information of the sequential actions, and improves the accuracy of action positioning and classification. |
priorityDate | 2019-09-20-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
type | http://data.epo.org/linked-data/def/patent/Publication |
Incoming Links
Total number of triples: 23.