http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-114839884-A
Outgoing Links
Predicate | Object |
---|---|
assignee | http://rdf.ncbi.nlm.nih.gov/pubchem/patentassignee/MD5_653c418e41edbb4e1ce8bc5e53e70cec |
classificationCPCAdditional | http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/Y02T10-40 |
classificationCPCInventive | http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G05B13-042 |
classificationIPCInventive | http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G05B13-04 |
filingDate | 2022-07-05-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
inventor | http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_6110dd4002fe5f84ba7ec6f62c79e4ef http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_86c5f0a2bd9183e1ac634dbe5bb3a42c http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_8c8c7dceba3bd690310aa4c5fb6b63c1 http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_1a85e25507e006c4b2369290dc773340 |
publicationDate | 2022-08-02-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
publicationNumber | CN-114839884-A |
titleOfInvention | A low-level control method and system for underwater vehicle based on deep reinforcement learning |
abstract | The invention proposes a bottom layer control method and system of an underwater vehicle based on deep reinforcement learning, including: determining the input, output and system control target of the underwater vehicle control system according to the state information and action information of the underwater vehicle; Convert the system control objective to the underlying control objective of the underwater vehicle under the deep reinforcement learning based on the policy-evaluation network; obtain the new action information and the reward value corresponding to the action according to the state information of the underwater vehicle and store it in the experience recovery The strategy-evaluation network is iteratively trained through the experience recovery pool; the strategy-evaluation network after iterative training is used as a control network to control the bottom layer of the underwater vehicle. By adopting the strategy-evaluation network structure, the collected raw sensor information is processed, and the thruster and rudder angle commands are output to achieve high-precision and adaptive bottom-level control of the underwater vehicle. |
isCitedBy | http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-116295449-A http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-116295449-B |
priorityDate | 2022-07-05-04:00^^<http://www.w3.org/2001/XMLSchema#date> |
type | http://data.epo.org/linked-data/def/patent/Publication |
Incoming Links
Total number of triples: 21.