bibliographicCitation |
Tan C, Yang L, Wong WS. Learning-Based Control Policy and Regret Analysis for Online Quadratic Optimization With Asymmetric Information Structure. IEEE Trans Cybern. 2022 Jun;52(6):4797–810. doi: 10.1109/tcyb.2021.3049357. PMID: 33502987. |