Reinforcement Learning: What is, Algorithms, Applications, Example

What is Reinforcement Learning?

Reinforcement Learning is defined as a Machine Learning method that is concerned with how software agents should take actions in an environment. Reinforcement Learning is a part of the deep learning method that helps you to maximize some portion of the cumulative reward.

This neural network learning method helps you to learn how to attain a complex objective or maximize a specific dimension over many steps.

In Reinforcement Learning tutorial, you will learn:

Important terms used in Deep Reinforcement Learning method

Here are some important terms used in Reinforcement AI:

How Reinforcement Learning works?

Let's see some simple example which helps you to illustrate the reinforcement learning mechanism.

Consider the scenario of teaching new tricks to your cat

Explanation about the example:

How Reinforcement Learning works

In this case,

Reinforcement Learning Algorithms

There are three approaches to implement a Reinforcement Learning algorithm.

Value-Based:

In a value-based Reinforcement Learning method, you should try to maximize a value function V(s). In this method, the agent is expecting a long-term return of the current states under policy π.

Policy-based:

In a policy-based RL method, you try to come up with such a policy that the action performed in every state helps you to gain maximum reward in the future.

Two types of policy-based methods are:

Model-Based:

In this Reinforcement Learning method, you need to create a virtual model for each environment. The agent learns to perform in that specific environment.

Characteristics of Reinforcement Learning

Here are important characteristics of reinforcement learning

Types of Reinforcement Learning

Two kinds of reinforcement learning methods are:

Positive:

It is defined as an event, that occurs because of specific behavior. It increases the strength and the frequency of the behavior and impacts positively on the action taken by the agent.

This type of Reinforcement helps you to maximize performance and sustain change for a more extended period. However, too much Reinforcement may lead to over-optimization of state, which can affect the results.

Negative:

Negative Reinforcement is defined as strengthening of behavior that occurs because of a negative condition which should have stopped or avoided. It helps you to define the minimum stand of performance. However, the drawback of this method is that it provides enough to meet up the minimum behavior.

Learning Models of Reinforcement

There are two important learning models in reinforcement learning:

Markov Decision Process

The following parameters are used to get a solution:

The mathematical approach for mapping a solution in reinforcement Learning is recon as a Markov Decision Process or (MDP).

Q-Learning

Q learning is a value-based method of supplying information to inform which action an agent should take.

Let's understand this method by the following example:

Next, you need to associate a reward value to each door:

Explanation:

In this image, you can view that room represents a state

Agent's movement from one room to another represents an action

In the below-given image, a state is described as a node, while the arrows show the action.

For example, an agent traverse from room number 2 to 5

Reinforcement Learning vs. Supervised Learning

Parameters Reinforcement Learning Supervised Learning
Decision style reinforcement learning helps you to take your decisions sequentially. In this method, a decision is made on the input given at the beginning.
Works on Works on interacting with the environment. Works on examples or given sample data.
Dependency on decision In RL method learning decision is dependent. Therefore, you should give labels to all the dependent decisions. Supervised learning the decisions which are independent of each other, so labels are given for every decision.
Best suited Supports and work better in AI, where human interaction is prevalent. It is mostly operated with an interactive software system or applications.
Example Chess game Object recognition

Applications of Reinforcement Learning

Here are applications of Reinforcement Learning:

Why use Reinforcement Learning?

Here are prime reasons for using Reinforcement Learning:

When Not to Use Reinforcement Learning?

You can't apply reinforcement learning model is all the situation. Here are some conditions when you should not use reinforcement learning model.

Challenges of Reinforcement Learning

Here are the major challenges you will face while doing Reinforcement earning:

Summary:

 

YOU MIGHT LIKE: