Commit df835ecc authored by Daniel Lukats's avatar Daniel Lukats

feedback fabian

parent 912fde77
\section{From Theory to Application}
\section{From Theory to Practice}
\label{sec:04:implementation}
\input{04_implementation/introduction}
......
......@@ -70,7 +70,7 @@ Finally, the four most recent maximized images seen by an agent are combined to
elements of the tensor are set to 0, which represents the color black. The same applies to simulated episode ends and
beginnings as performed by the \emph{episodic life} operation.
Although no explanation is given by \citeA{nature_dqn}, one advantage is at hand: by stacking images we provide an agent
Although no explanation is given by \citeA{nature_dqn}, one benefit is apparent: by stacking images we provide an agent
further information on the state of the game such as direction and movement. If the agent would see one frame only, it
could not determine which direction the ball is moving in Pong. By showing it four frames at once, the agent can discern
if the ball is moving towards itself or the enemy or whether it will hit a wall or not (cf.~figure
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment