<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Kaige. Yang | Learning And Signal Processing</title><link>https://ucl-lasp.github.io/author/kaige.-yang/</link><atom:link href="https://ucl-lasp.github.io/author/kaige.-yang/index.xml" rel="self" type="application/rss+xml"/><description>Kaige. Yang</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><image><url>https://ucl-lasp.github.io/media/icon_hu488c70cfa50b07216f285734af4abcd1_22080_512x512_fill_lanczos_center_3.png</url><title>Kaige. Yang</title><link>https://ucl-lasp.github.io/author/kaige.-yang/</link></image><item><title>Kaige Yang</title><link>https://ucl-lasp.github.io/author/kaige-yang/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://ucl-lasp.github.io/author/kaige-yang/</guid><description>&lt;p>I earned by BE in Computer &amp;amp; Communication Engineering from the American University of Beirut (AUB) in 2020 and my MSc in Human Robotics from Imperial College London in 2021. I have also completed a research internship within the Systems Group at ETH Zurich in Summer 2019. From January 2022, I joined the LASP Group as a Phd student. My current research interest is the design of reinforcement learning algorithms in such a way that the energy costs remain reasonable while retaining high performance. My goal is to develop novel data-efficient and generalisable learning strategies.&lt;/p>
&lt;p>I joined LASP as a Phd student with UCL Overseas Research Scholarship in April 2018. Prior to joining the group, I earned my BEng and MSc in Electrical Engineering from UCL during 2012-2017.&lt;/p>
&lt;p>My research interests lie in the field of machine learning. In particular, I am interested in sequential decision making and reinforcement learning.&lt;/p>
&lt;h4 id="research">Research&lt;/h4>
&lt;h5 id="graph-bandit">Graph Bandit&lt;/h5>
&lt;p>We consider the problem of stochastic linear bandit with multiple users where a user graph characterizes the affinity between users is available. The goal is to design an asymptotic optimal and computational light algorithm with improved finite-time regret guarantee. The question to answer is: On the basis of existing provably asymptotic optimal algorithms, could the user graph be exploited to improve the finite-time behaviour of asymptotic optimal algorithms, while keeping the computational complexity low.&lt;/p>
&lt;p>&lt;a href="https://arxiv.org/abs/1907.05632" target="_blank" rel="noopener">Link to ArXiv&lt;/a>&lt;/p>
&lt;h5 id="laplacian-regularized-estimator-error-analysis">Laplacian-regularized Estimator Error Analysis&lt;/h5>
&lt;p>We provide a theoretical analysis of the representation learning problem aimed at learning the latent variables (design matrix) Θ of observations Y with the knowledge of the coefficient matrix X. The design matrix is learned under the assumption that the latent variables Θ are smooth with respect to a (known) topological structure G. To learn such latent variables, we study a graph Laplacian regularized estimator, which is the penalized least squares estimator with penalty term proportional to a Laplacian quadratic form.&lt;/p>
&lt;p>&lt;a href="https://arxiv.org/abs/1902.03720" target="_blank" rel="noopener">Link to Arxiv&lt;/a>&lt;/p></description></item></channel></rss>