What is reinforcement learning?

admin 9th June 2020 Design Comments Off on What is reinforcement learning? 1,461 Views

Neil Ballinger explains how reinforcement learning works and delves deeper into its applications in manufacturing

What do cobots, dogs and dolphins have in common? The fact that they can all be trained by rewarding desired behaviours while ignoring undesired ones.

For animals, this training technique is called positive reinforcement. For machines, it is known as reinforcement learning, and falls under the broader umbrella of machine learning.

Reinforcement learning is a form of machine learning where a computer learns to complete a task by having repeated interaction with a dynamic environment.

Through an iterative trial-and-error approach, the machine explores the environment. This exploration generates data, which is used by the machine to determine the best course of action to complete its job. This happens without human intervention and without having to programme the machine to perform a specific task.

Reinforcement learning differs from supervised machine learning in that in the latter, algorithms are built using data sets that contain the correct answer to a given problem.

In reinforcement learning there is no answer – the machine has to find one by trying different courses of action and eventually selecting the one that gives the most reward with the least effort.

We could say that in the absence of answers, the machine learns through its own experience. The component that makes the decision of which action to take is known as ‘agent’.

How it works

Imagine that a dog in garden is given a tennis ball. The dog, which represents the agent, will first observe the garden and construct its representation of the environment. It will then wonder – what can I do with this ball? What happens if I throw it? Can I hide it? If so, where?

It will choose a course of action, such as hiding the ball, and observe how the owner responds. If the owner simply stares at the dog and doesn’t interact, the dog will find this dull, receiving a negative reward.

The dog will repeat the process until it realises that bringing the ball back to the owner will result in a smile and a treat, that is a positive reward. It will then understand that this action is the best one to maximise rewards.

Reinforcement learning algorithms encourage a machine to act in a similar way, interacting with a dynamic environment – for example a factory floor with several production lines – until it finds the most convenient way of proceeding.

Applications in manufacturing

In industrial manufacturing, reinforcement learning is used in processes where complex decision-making skills are required, especially where machines need to cope with changes in dynamic environments.

For example, a cobot can be trained to find the best path to avoid interferences, such as objects or the limbs of human workers, while continuing to perform its task. This would be simple for a human, but for machines it is an incredibly complex process that requires a careful analysis of an unpredictable environment.

If successful, the cobot will be more productive, because it won’t need to stop to avoid impact.

Reinforcement learning can also be used to streamline production, an approach used by researchers at the Industrial AI Lab at Hitachi America.

The researchers designed a virtual shop floor as a bidimensional matrix and used reinforcement learning algorithms to repeatedly interact with this virtual environment.

By doing this, they were able to determine the best set up to increase productivity and reduce delays in servicing their customers.

Applications of reinforcement learning in manufacturing are just emerging, but the first experiments are already offering promising results. Industrial machines work hard to increase your productivity. It’s time to reward them.

Neil Ballinger, head of EMEA sales at automation parts supplier EU Automation.

Engineer News Network The ultimate online news and information resource for today’s engineer

What is reinforcement learning?

Related Articles

Check Also

AI-powered software simplifies flexible robot handling

Brushed universal motors

First 2-wire 2-bit Hall switch

Managing electrical installation in precast concrete construction

igus helps engineering artist and designer create a bigger 3D footprint

Six-axis robot turns 3D printing into an art form

Structural vibration: What is it and how to control it

Gravity energy storage ‘cheaper than lithium batteries’

How to make sense of Alarm System Performance KPIs

Chill out: Variable speed chiller saves fuel, money and the environment

3G,4G/LTE antennas for the smallest PCBs

Surge protection: go beyond

Control cabinet heat management

Additive manufacturing: when it works and why it works

Continuous monitoring, predictive analysis

Compact 240W GaN-based desktop adapters simplify integration into medical and industrial applications

Remote I/O modules provide flexible options for distributed applications

Could a virtual twin make your changes a non-event?

Decentralised switchgear demands modular enclosure solutions for scalable electrification

Monitoring moisture content in minerals, aggregates, sand and bulk materials

Digital isolators with two quad-channel, low power devices

How precast packaged pump lift stations are reshaping stormwater and wastewater infrastructure

Risk-based quality in metal AM