- Sarsa: On-policy TD Control
- Q-learning: Off-policy TD Control
- Maximization Bias and Double Learning
- Games, Afterstates, and Other Special Cases
- Summary
ReinforcementLearning-Principle-Day8
Posted on
Edited on
- Abstract
- TD Prediction
- Advantages of TD Prediction Methods
- Optimality of TD(0)
ReinforcementLearning-Principle-Day7
Posted on
Edited on
- What is Monte Carlo
- Using Monte Carlo for Prediction
- Using Monte Carlo for Action Values
- Using Monte Carlo methods for generalized policy iteration
- Solving the Blackjack Example
Operating System Memory Address
Posted on
Edited on
Preface
When I do some system research work, I found it worth understanding the real implementation of every details of every components (file system, memory management, etc.). Thus, I want to start a new chapter here to records every notes and experience of reading books - Understanding the Linux Kernel, Third Edition 3rd Edition by Daniel P. Bovet. Hope after reading this books, I can understand every papers in the OSDI and figure out more useful, novel idea. Not only think without considering any real problems or architecture in the operating system.
ReinforcementLearning-Principle-Day6
Posted on
Edited on
ReinforcementLearning-Principle-Day5
Posted on
Edited on
MetaLearning-Standford-Lecture4
Posted on
Edited on
ReinforcementLearning Principle Day4
Posted on
Edited on