November 4, 2019

Navigation method may speed up autonomous last-mile delivery

by Jennifer Chu, Massachusetts Institute of Technology

In the not too distant future, robots may be dispatched as last-mile delivery vehicles to drop your takeout order, package, or meal-kit subscription at your doorstep—if they can find the door.

Standard approaches for robotic navigation involve mapping an area ahead of time, then using algorithms to guide a robot toward a specific goal or GPS coordinate on the map. While this approach might make sense for exploring specific environments, such as the layout of a particular building or planned obstacle course, it can become unwieldy in the context of last-mile delivery.

Imagine, for instance, having to map in advance every single neighborhood within a robot's delivery zone, including the configuration of each house within that neighborhood along with the specific coordinates of each house's front door. Such a task can be difficult to scale to an entire city, particularly as the exteriors of houses often change with the seasons. Mapping every single house could also run into issues of security and privacy.

Now MIT engineers have developed a navigation method that doesn't require mapping an area in advance. Instead, their approach enables a robot to use clues in its environment to plan out a route to its destination, which can be described in general semantic terms, such as "front door" or "garage," rather than as coordinates on a map. For example, if a robot is instructed to deliver a package to someone's front door, it might start on the road and see a driveway, which it has been trained to recognize as likely to lead toward a sidewalk, which in turn is likely to lead to the front door.

The new technique can greatly reduce the time a robot spends exploring a property before identifying its target, and it doesn't rely on maps of specific residences.

"We wouldn't want to have to make a map of every building that we'd need to visit," says Michael Everett, a graduate student in MIT's Department of Mechanical Engineering. "With this technique, we hope to drop a robot at the end of any driveway and have it find a door."

Everett will present the group's results this week at the International Conference on Intelligent Robots and Systems. The paper, which is co-authored by Jonathan How, professor of aeronautics and astronautics at MIT, and Justin Miller of the Ford Motor Company, is a finalist for "Best Paper for Cognitive Robots."

Credit: Massachusetts Institute of Technology

"A sense of what things are"

In recent years, researchers have worked on introducing natural, semantic language to robotic systems, training robots to recognize objects by their semantic labels, so they can visually process a door as a door, for example, and not simply as a solid, rectangular obstacle.

"Now we have an ability to give robots a sense of what things are, in real-time," Everett says.

Everett, How, and Miller are using similar semantic techniques as a springboard for their new navigation approach, which leverages pre-existing algorithms that extract features from visual data to generate a new map of the same scene, represented as semantic clues, or context.

In their case, the researchers used an algorithm to build up a map of the environment as the robot moved around, using the semantic labels of each object and a depth image. This algorithm is called semantic SLAM (Simultaneous Localization and Mapping).

While other semantic algorithms have enabled robots to recognize and map objects in their environment for what they are, they haven't allowed a robot to make decisions in the moment while navigating a new environment, on the most efficient path to take to a semantic destination such as a "front door."

"Before, exploring was just, plop a robot down and say 'go," and it will move around and eventually get there, but it will be slow," How says.

The cost to go

The researchers looked to speed up a robot's path-planning through a semantic, context-colored world. They developed a new "cost-to-go estimator," an algorithm that converts a semantic map created by preexisting SLAM algorithms into a second map, representing the likelihood of any given location being close to the goal.

"This was inspired by image-to-image translation, where you take a picture of a cat and make it look like a dog," Everett says. "The same type of idea happens here where you take one image that looks like a map of the world, and turn it into this other image that looks like the map of the world but now is colored based on how close different points of the map are to the end goal."

This cost-to-go map is colorized, in gray-scale, to represent darker regions as locations far from a goal, and lighter regions as areas that are close to the goal. For instance, the sidewalk, coded in yellow in a semantic map, might be translated by the cost-to-go algorithm as a darker region in the new map, compared with a driveway, which is progressively lighter as it approaches the front door—the lightest region in the new map.

The researchers trained this new algorithm on satellite images from Bing Maps containing 77 houses from one urban and three suburban neighborhoods. The system converted a semantic map into a cost-to-go map, and mapped out the most efficient path, following lighter regions in the map, to the end goal. For each satellite image, Everett assigned semantic labels and colors to context features in a typical front yard, such as grey for a front door, blue for a driveway, and green for a hedge.

During this training process, the team also applied masks to each image to mimic the partial view that a robot's camera would likely have as it traverses a yard.

"Part of the trick to our approach was [giving the system] lots of partial images," How explains. "So it really had to figure out how all this stuff was interrelated. That's part of what makes this work robustly."

The researchers then tested their approach in a simulation of an image of an entirely new house, outside of the training dataset, first using the preexisting SLAM algorithm to generate a semantic map, then applying their new cost-to-go estimator to generate a second map, and path to a goal, in this case, the front door.

The group's new cost-to-go technique found the front door 189 percent faster than classical navigation algorithms, which do not take context or semantics into account, and instead spend excessive steps exploring areas that are unlikely to be near their goal.

Everett says the results illustrate how robots can use context to efficiently locate a goal, even in unfamiliar, unmapped environments.

"Even if a robot is delivering a package to an environment it's never been to, there might be clues that will be the same as other places it's seen," Everett says. "So the world may be laid out a little differently, but there's probably some things in common."

Provided by Massachusetts Institute of Technology

This story is republished courtesy of MIT News (web.mit.edu/newsoffice/), a popular site that covers news about MIT research, innovation and teaching.

Citation: Navigation method may speed up autonomous last-mile delivery (2019, November 4) retrieved 17 July 2024 from https://techxplore.com/news/2019-11-method-autonomous-last-mile-delivery.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Bipedal robot Digit seen as final step in autonomous delivery system

143 shares

Feedback to editors

Flexible electronics researchers develop a completely stretchy lithium-ion battery

1 hour ago

A strategy to enhance the stability of perovskite solar cells under reverse bias conditions

3 hours ago

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

18 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

20 hours ago

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

22 hours ago

Large language models make human-like reasoning mistakes, researchers find

23 hours ago

Unveiling a new class of synthetic fuels

23 hours ago

Microsoft unveils software that allows LLMs to work with spreadsheets

23 hours ago

New technique to assess a general-purpose AI model's reliability before it's deployed

Jul 16, 2024

New system enables intuitive teleoperation of a robotic manipulator in real-time

Jul 16, 2024

Load comments (0)

Navigation method may speed up autonomous last-mile delivery

The cost to go

Flexible electronics researchers develop a completely stretchy lithium-ion battery

A strategy to enhance the stability of perovskite solar cells under reverse bias conditions

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Bipedal robot Digit seen as final step in autonomous delivery system

A new approach to enhance robot navigation in indoor environments

New robot rolls with the rules of pedestrian conduct

Model helps robots navigate more like humans do

Robots' maps of their environments can make existing object-recognition algorithms more accurate

A new method to express robot incapability

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

New system enables intuitive teleoperation of a robotic manipulator in real-time

Open-TeleVision allows VR-type control of remote robot

New framework enables animal-like agile movements in four-legged robots

DeepMind demonstrates a robot capable of giving context-based guided tours of an office building

A chemical claw machine: Vapor exposure enables soft actuator to perform diverse tasks

Phys.org

Medical Xpress

Science X

Navigation method may speed up autonomous last-mile delivery

The cost to go

Flexible electronics researchers develop a completely stretchy lithium-ion battery

A strategy to enhance the stability of perovskite solar cells under reverse bias conditions

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Related Stories

Bipedal robot Digit seen as final step in autonomous delivery system

A new approach to enhance robot navigation in indoor environments

New robot rolls with the rules of pedestrian conduct

Model helps robots navigate more like humans do

Robots' maps of their environments can make existing object-recognition algorithms more accurate

A new method to express robot incapability

Recommended for you

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

New system enables intuitive teleoperation of a robotic manipulator in real-time

Open-TeleVision allows VR-type control of remote robot

New framework enables animal-like agile movements in four-legged robots

DeepMind demonstrates a robot capable of giving context-based guided tours of an office building

A chemical claw machine: Vapor exposure enables soft actuator to perform diverse tasks

Your Privacy