MetaLearning-Standford-Lecture4 Posted on 2021-04-14 Edited on 2021-11-03 MetaLearning Learning Note - 4 optimization based on meta learning non-parametric few-shot learning properties of meta learning algorithms. Read more »
ReinforcementLearning Principle Day4 Posted on 2021-03-05 Edited on 2021-10-08 Reinforcement Learning Day 4 (Finite Markov Decision Processes’s Coursera Video Notes) Specifying Policies Value Functions Action-value function Bellman Equation Derivation Intuition - Bellman Eqaution Read more »
ReinforcementLearning-Principle-Day3 Posted on 2020-12-02 Edited on 2021-10-08 Reinforcement Learning Day 3 (Finite Markov Decision Processes) Return, Policy and Value Function Optimal Policies and Optimal Value Functions Coursera False Questions Optimality and Approximation Summary Read more »
HHKB's BS and Delete 按钮引起的疑惑 Posted on 2020-11-25 Edited on 2021-10-08 最近又再捣鼓捣鼓自己HKKB的键盘,一直没有整明白自己的HHKB后面那一串开关的用处,后面写了一串英文介绍也没弄懂是怎么使用。经过仔细探究发现主要要的三个用途。 Read more »
MetaLearning-Standford-Lecture3 Posted on 2020-11-12 Edited on 2021-10-08 MetaLearning Learning Note - 2Stanford CS330: Multi-Task and Meta-Learning, 2019 | Lecture 3 - Optimization-Based Meta-Learning Recap the probabilistic formulation of meta-learning general recipe of meta-learning algorithm black box adaption appraoches optimization-based meta-learning algorithm Read more »
MetaLearning-Standford-Lecture2 Posted on 2020-11-04 Edited on 2021-10-08 PrefaceI choose to learn meta-learning through the stanford’s coursework. Prof.Song and Dr.Xu ask me do some work around reinforcement learning and meta learning. Therefore, I choose to learn meta learning from stanford’s coursework by Chelsea Finn. Read more »
ReinforcementLearning-Principle-Day2 Posted on 2020-10-30 Edited on 2021-10-08 Reinforcement Learning Day 2 (Multi-armed Bandits) Q* Formula Nonstationary problem Optimistic Initial Value Gradient Bandits Problem Associative Search Read more »
Slurm-Day5 Posted on 2020-09-25 Edited on 2021-10-08 Insights for co-run HPC AND GPU taks - Day5 Try some examples for co-run HPC and ML tasks Build the production environment NPB Read more »