there are many different types of ML that we are familiar with
what separates RL from these other branches of ML?
typically in ML, you have:
except in the case of unsupervised! (only $y$)
short answer: it's a little less clear, it is a little less cut and dry.
an important thing to note is that, lots of the time you don't have the access to the full state $(X)$
these spaces can be continuous, discrete, or even pictorially!
they are tuples for a timestep $t$ that contain a state and an action for that cooresponding timestep
\[(s_t, a_t) \rightarrow s_{t+1}\]
in deep reinforcement learning $\pi$ is a neural network, but in more traditional approaches, $\pi$ is a table
drl becomes useful when you are operating in a non-differentiable environment
acrobot our reward function looks something like the following:
these problems naturally lend themselves to being effectively solved by these techniques!
Which of the following might be an area wherein DRL could be useful?
Which of the following might be an area wherein DRL could be useful?
Which of the following might be an area wherein DRL could be useful?
just to be clear this stuff is stolen straight from skrl