MDP vs MRP 강화학습 1 : edwith

강화학습 1

KAIST 산업및시스템공학과 신하용 교수님

http://kooc.kaist.ac.kr/reinforcement/forum/136325

백정 2024.04.13

강의 내용 중,

"If a polity pi is given to MDP, than it becoms a Markov Reward Process." 라는 내용이 있는데,

"If a polity pi is given to MRP, than it becoms a Markov Decision Process." 가 맞는 내용 아닐까요?

앞 페이지에서는 MDP is an MRP with decision이라고 되어 있어서요...

강화학습 1