cs231n lec 14 reinforcement learning

Question

cs231n lec 14 reinforcement learning

52 views Asked by sky park At 11 November 2021 at 11:47

I'm studying CS231N, lecture 14, "Reinforcement Learning". In the lecture, the instructor mentioned the value function, which is shown in the picture:

picture of value function

I am wondering what is that bar between rt and s0? I thought it was something like conditional probability, but I'm not sure about it. Or is it just a division?

Original Q&A

There are 1 answers

**AudioBubble** · Accepted Answer · 2021-11-11T19:38:14+00:00

AudioBubble On 11 November 2021 at 19:38 BEST ANSWER

It's the conditional probability. It literally means the reward at time t, given state s, following policy pi.

TechQA.

cs231n lec 14 reinforcement learning

There are 1 answers

Related Questions in REINFORCEMENT-LEARNING

Related Questions in CS231N

Popular Questions

Trending Questions