patent/CN-114839884-A

http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-114839884-A

Outgoing Links

Predicate	Object
assignee	http://rdf.ncbi.nlm.nih.gov/pubchem/patentassignee/MD5_653c418e41edbb4e1ce8bc5e53e70cec
classificationCPCAdditional	http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/Y02T10-40
classificationCPCInventive	http://rdf.ncbi.nlm.nih.gov/pubchem/patentcpc/G05B13-042
classificationIPCInventive	http://rdf.ncbi.nlm.nih.gov/pubchem/patentipc/G05B13-04
filingDate	2022-07-05-04:00^^<http://www.w3.org/2001/XMLSchema#date>
inventor	http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_6110dd4002fe5f84ba7ec6f62c79e4ef http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_86c5f0a2bd9183e1ac634dbe5bb3a42c http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_8c8c7dceba3bd690310aa4c5fb6b63c1 http://rdf.ncbi.nlm.nih.gov/pubchem/patentinventor/MD5_1a85e25507e006c4b2369290dc773340
publicationDate	2022-08-02-04:00^^<http://www.w3.org/2001/XMLSchema#date>
publicationNumber	CN-114839884-A
titleOfInvention	A low-level control method and system for underwater vehicle based on deep reinforcement learning
abstract	The invention proposes a bottom layer control method and system of an underwater vehicle based on deep reinforcement learning, including: determining the input, output and system control target of the underwater vehicle control system according to the state information and action information of the underwater vehicle; Convert the system control objective to the underlying control objective of the underwater vehicle under the deep reinforcement learning based on the policy-evaluation network; obtain the new action information and the reward value corresponding to the action according to the state information of the underwater vehicle and store it in the experience recovery The strategy-evaluation network is iteratively trained through the experience recovery pool; the strategy-evaluation network after iterative training is used as a control network to control the bottom layer of the underwater vehicle. By adopting the strategy-evaluation network structure, the collected raw sensor information is processed, and the thruster and rudder angle commands are output to achieve high-precision and adaptive bottom-level control of the underwater vehicle.
isCitedBy	http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-116295449-A http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-116295449-B
priorityDate	2022-07-05-04:00^^<http://www.w3.org/2001/XMLSchema#date>
type	http://data.epo.org/linked-data/def/patent/Publication

Incoming Links

Predicate	Subject
isCitedBy	http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-112540614-A http://rdf.ncbi.nlm.nih.gov/pubchem/patent/CN-110404264-A
isDiscussedBy	http://rdf.ncbi.nlm.nih.gov/pubchem/substance/SID413130148 http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID16734986

Total number of triples: 21.