Reinforcement Learning To Help Autonomous Vehicles and Underwater Robots to Locate and Track Marine Objects and Animals

Category Machine Learning

tldr #

A research team has demonstrated that reinforcement learning allows robots to identify and track marine objects and animals. This work was done by using range acoustic techniques and artificial intelligence which allows the robot to identify the best points to reach the goal. Neural networks were trained with the powerful supercomputer located at the Barcelona Supercomputing Center to adjust the parameters of different algorithms much faster.


content #

A research team has shown for the first time that reinforcement learning—i.e., a neural network that learns the best action to perform at each moment based on a series of rewards—allows autonomous vehicles and underwater robots to locate and carefully track marine objects and animals. The details are reported in a paper published in Science Robotics.

Currently, underwater robotics is emerging as a key tool for improving knowledge of the oceans in the face of the many difficulties in exploring them, with vehicles capable of descending to depths of up to 4,000 meters. In addition, the in-situ data they provide help to complement other data, such as that obtained from satellites. This technology makes it possible to study small-scale phenomena, such as CO2 capture by marine organisms, which helps to regulate climate change.

The barriers to ocean exploration has made it difficult to explore the depths of the ocean, but advances in underwater robotics have made it possible to explore more applications.

Specifically, this new work reveals that reinforcement learning, widely used in the field of control and robotics, as well as in the development of tools related to natural language processing such as ChatGPT, allows underwater robots to learn what actions to perform at any given time to achieve a specific goal. These action policies match, or even improve in certain circumstances, traditional methods based on analytical development.

Underwater robots are able to capture in-situ data which in turn helps to complement data from satellites.

"This type of learning allows us to train a neural network to optimize a specific task, which would be very difficult to achieve otherwise. For example, we have been able to demonstrate that it is possible to optimize the trajectory of a vehicle to locate and track objects moving underwater," explains Ivan Masmitjà, the lead author of the study, who has worked between Institut de Ciències del Mar (ICM-CSIC) and the Monterey Bay Aquarium Research Institute (MBARI).

The marine organisms capture CO2 which helps regulate climate change.

This "will allow us to deepen the study of ecological phenomena such as migration or movement at small and large scales of a multitude of marine species using autonomous robots. In addition, these advances will make it possible to monitor other oceanographic instruments in real time through a network of robots, where some can be on the surface monitoring and transmitting by satellite the actions performed by other robotic platforms on the seabed," points out the ICM-CSIC researcher Joan Navarro, who also participated in the study.

Reinforcement learning allows neural networks to learn the best action to perform at each moment based on a series of rewards.

To carry out this work, researchers used range acoustic techniques, which allow estimating the position of an object considering distance measurements taken at different points. However, this fact makes the accuracy in locating the object highly dependent on the place where the acoustic range measurements are taken.

And this is where the application of artificial intelligence and, specifically, reinforcement learning, which allows the identification of the best points and, therefore, the optimal trajectory to be performed by the robot, becomes important.

Researchers used range acoustic techniques, which gives an estimate of the position of an object based on distance measurements taken at different points.

Neural networks were trained, in part, using the computer cluster at the Barcelona Supercomputing Center (BSC-CNS), where the most powerful supercomputer in Spain and one of the most powerful in Europe are located. "This made it possible to adjust the parameters of different algorithms much faster than using conventional systems, which is of great help when designing an AI system to control a robotics platform," explains Yash Mulgaonkar, a Ramón y Cajal researcher at BSC.

Through the use of the powerful supercomputer located at the Barcelona Supercomputing Center, the parameters of different algorithms were adjusted faster.

hashtags #
worddensity #

Share