강의 Help
-
1. Introduction강의시간43:54
-
2. Markov Decision Process강의시간34:44
-
3. Dynamic Programming강의시간46:48
-
4. Monte Carlo methods강의시간01:06:02
-
Monte Carlo method 개요
- Writer
- Update :
- 2022.03.18
- 좋아요
- 2
- 동영상
- 강의시간17:48
-
Stochastic approximation
- Writer
- Update :
- 2022.03.18
- 좋아요
- 2
- 동영상
- 강의시간14:44
-
MC policy evaluation
- Writer
- Update :
- 2022.03.18
- 좋아요
- 2
- 동영상
- 강의시간15:04
-
MC control
- Writer
- Update :
- 2022.03.18
- 좋아요
- 2
- 동영상
- 강의시간18:26
-
Quiz 4
- Writer
- Update :
- 2021.11.25
-
-
5. Temporal difference methods강의시간57:15
-
6. n-Step TD methods강의시간49:52
-
n-step return
- Writer
- Update :
- 2022.03.18
- 좋아요
- 1
- 동영상
- 강의시간11:58
-
TD(λ) policy evaluation
- Writer
- Update :
- 2022.03.18
- 좋아요
- 1
- 동영상
- 강의시간14:56
-
Eligibility trace와 TD control
- Writer
- Update :
- 2022.03.18
- 좋아요
- 1
- 동영상
- 강의시간08:13
-
Q(λ) algorithm
- Writer
- Update :
- 2022.03.18
- 좋아요
- 1
- 동영상
- 강의시간14:45
-
Quiz 6
- Writer
- Update :
- 2021.11.25
-
-
7. Value function approximation강의시간53:02
-
Value function approximation 개요
- Writer
- Update :
- 2022.03.18
- 좋아요
- 1
- 동영상
- 강의시간16:20
-
Features for VFA
- Writer
- Update :
- 2022.03.18
- 좋아요
- 1
- 동영상
- 강의시간14:20
-
Application of VFA : Cartpole
- Writer
- Update :
- 2022.03.18
- 좋아요
- 1
- 동영상
- 강의시간11:31
-
Linear VFA for Cartpole
- Writer
- Update :
- 2022.03.18
- 좋아요
- 1
- 동영상
- 강의시간10:51
-
Quiz 7
- Writer
- Update :
- 2021.11.25
-
Coming soon.