Clearpath dual UR5 arm Husky robot used for the real-world experiments.

Source publication

Figure 2. Deep reinforcement learning control and policy training. The...

Figure 3. Coordinate frames transforming from the camera frame to the...

Figure 4. Husky Dual UR5 MuJoCo Environment we built for a basic mobile...

Figure 5. Husky robot mobile picking task from three random initial...

Figure 6. PPO training results. We choose 3 different seeds to train...

Learning Mobile Manipulation through Deep Reinforcement Learning

Article

Full-text available

Feb 2020

Mobile manipulation has a broad range of applications in robotics. However, it is usually more challenging than fixed-base manipulation due to the complex coordination of a mobile base and a manipulator. Although recent works have demonstrated that deep reinforcement learning is a powerful technique for fixed-base manipulation tasks, most of them a...

Context 1

... evaluate the learned policy in practice, we test the trained model and policy in the real environment. As shown in Figure 10, a Clearpath dual UR5 arm Husky robot is used to perform the mobile manipulation task, based on an on-board RGB camera. ...

View in full-text

Fig. 3. Supervised Learning with Varied Amounts of Data

Using Dynamic Binary Instrumentation to Detect Failures in Robotics Software

Preprint

Full-text available

Jan 2022

Autonomous and Robotics Systems (ARSs) are widespread, complex, and increasingly coming into contact with the public. Many of these systems are safety-critical, and it is vital to detect software errors to protect against harm. We propose a family of novel techniques to detect unusual program executions and incorrect program behavior. We model exec...

FIGURE 1. View of the arena for the MBZRIC Challenge 2. The structure...

FIGURE 2. Scheme of the proposed multi-agent architecture. There are...

FIGURE 3. Output of the Color-based Brick Detector. Left, detection...

FIGURE 4. Visualization of the output of the Wall Detector. The 2D...

FIGURE 6. Procedure for a pick and place operation. The piles and the...

Experimental Evaluation of a Team of Multiple Unmanned Aerial Vehicles for Cooperative Construction

Article

Full-text available

Jan 2021

This paper presents a team of multiple Unmanned Aerial Vehicles (UAVs) to perform cooperative missions for autonomous construction. In particular, the UAVs have to build a wall made of bricks that need to be picked and transported from different locations. First, we propose a novel architecture for multi-robot systems operating in outdoor and unstr...

Figure 23: The battery 18650 has a 2000 mAh capacity and is a Li-ion...

Autonomous Landmine Detection Rover

Article

Full-text available

Sep 2023

Bhushan S Kumbhar

#UKRAS21: The 4th UK Robotics and Autonomous Systems Conference

Conference Paper

Full-text available

Jul 2021

A Unified Approach for Autonomous Volumetric Exploration of Large Scale Environments Under Severe Odometry Drift

Article

Full-text available

Mar 2021

Exploration is a fundamental problem in robot autonomy. A major imitation, however, is that during exploration robots oftentimes have to rely on on-board systems alone for state estimation, accumulating significant drift over time in large environments. Drift can be detrimental to robot safety and exploration performance. In this work, a submap-bas...

Pre-Grasp Approaching on Mobile Robots: A Pre-Active Layered Approach

Article

Full-text available

Mar 2024

In Mobile Manipulation (MM), navigation and manipulation are generally solved as subsequent disjoint tasks. Combined optimization of navigation and manipulation costs can improve the time efficiency of MM. However, this is challenging as precise object pose estimates, which are necessary for such combined optimization, are often not available until the later stages of MM. Moreover, optimizing navigation and manipulation costs with conventional planning methods using uncertain object pose estimates can lead to failures and hence requires replanning. Hence, in the presence of object pose uncertainty, preactive approaches are preferred. We propose such a pre-active approach for determining the base pose and pre-grasp manipulator configuration to improve the time efficiency of MM. We devise a Reinforcement Learning (RL) based solution that learns suitable base poses for grasping and pre-grasp manipulator configurations using layered learning that guides exploration and enables sample-efficient learning. Further, we accelerate learning of pre-grasp manipulator configurations by providing dense rewards using a predictor network trained on previously learned base poses for grasping. Our experiments validate that in the presence of uncertain object pose estimates, the proposed approach results in reduced execution time. Finally, we show that our policy learned in simulation can be easily transferred to a real robot. The code repository and the supplementary video can be found on the project webpage*.

Advancements in Deep Reinforcement Learning and Inverse Reinforcement Learning for Robotic Manipulation: Toward Trustworthy, Interpretable, and Explainable Artificial Intelligence

Article

Full-text available

Jan 2024

This article presents a literature review of the past five years of studies using Deep Reinforcement Learning (DRL) and Inverse Reinforcement Learning (IRL) in robotic manipulation tasks. The reviewed articles are examined in various categories, including DRL and IRL for perception, assembly, manipulation with uncertain rewards, multitasking, transfer learning, multimodal, and Human-Robot Interaction (HRI). The articles are summarized in terms of the main contributions, methods, challenges, and highlights of the latest and relevant studies using DRL and IRL for robotic manipulation. Additionally, summary tables regarding the problem and solution are presented. The literature review then focuses on the concepts of trustworthy AI, interpretable AI, and explainable AI (XAI) in the context of robotic manipulation. Moreover, this review provides a resource for future research on DRL/IRL in trustworthy robotic manipulation.

Motion Planning of Mobile Manipulator using Virtual Impedance Energy Field

Article

Full-text available

Jan 2024

Motion planning for mobile manipulators is challenging because of their high degrees of freedom. The most effective approach for the motion planning of a mobile manipulator is to consider the different characteristics of a mobile robot and a manipulator while planning and controlling each system separately. In a previous study, different characteristics were considered using virtual impedance. This method involves forming a virtual impedance relationship between the two subsystems, enabling the mobile robot to track the movement of the manipulator. However, this study had certain limitations. Firstly, this method is not applicable to non-holonomic mobile robots. Secondly, obstacle avoidance methods for mobile robots are not considered. To address these limitations, we propose a novel concept for our motion planner that is called the virtual impedance energy field (VIEF), which refers to the work and change in total energy. We solved the first limitation of the previous study by transforming the virtual impedance force into the VIEF. Moreover, by integrating an obstacle field with the VIEF and using it as a local costmap, the second limitation was addressed. To validate the performance of our motion planner, we conducted simulations in environments comprising obstacles, with straight and curved trajectories of the end-effector. We then analyzed the pose change graph of the mobile robot and end-effector, pose error of the end-effector, and velocity of the mobile robot. We therefore confirmed that the manipulator successfully followed the desired trajectory, while the mobile robot maintained a distance from the end-effector and avoided obstacles.

Multi-Point path planning for robots based on deep reinforcement learning

Article

Full-text available

Sep 2023
J Phys Conf

Zeyu Fan

Motion Planning is a key technology for mobile robots, which decomposes a Motion task that cannot be completed by a single action into multiple discrete actions that can be performed. This paper aims to design a robot motion planning algorithm based on reinforcement learning and make a robot carry out continuous multi-objective point motion planning. Motion planning network is a planning algorithm based on a neural network, and DQN is a classical algorithm in the field of reinforcement learning. Based on the two kinds of algorithm for motion planning, the Deep Q - learning algorithm chooses the robot’s next target, and then through the motion planning of the network between the current coordinates to the next target path planning. This paper analyzes the performance of the multi-point motion planning algorithm, and the results show that the algorithm is able to a higher success rate of successful completion of the task planning, but the reward strategy derived from the experiment still has the possibility of optimization.

Skill Transformer: A Monolithic Policy for Mobile Manipulation

Preprint

Aug 2023

We present Skill Transformer, an approach for solving long-horizon robotic tasks by combining conditional sequence modeling and skill modularity. Conditioned on egocentric and proprioceptive observations of a robot, Skill Transformer is trained end-to-end to predict both a high-level skill (e.g., navigation, picking, placing), and a whole-body low-level action (e.g., base and arm motion), using a transformer architecture and demonstration trajectories that solve the full task. It retains the composability and modularity of the overall task through a skill predictor module while reasoning about low-level actions and avoiding hand-off errors, common in modular approaches. We test Skill Transformer on an embodied rearrangement benchmark and find it performs robust task planning and low-level control in new scenarios, achieving a 2.5x higher success rate than baselines in hard rearrangement problems.

A Versatile Door Opening System with Mobile Manipulator Through Adaptive Position-Force Control and Reinforcement Learning

Preprint

Full-text available

Jul 2023

The ability of robots to navigate through doors is crucial for their effective operation in indoor environments. Consequently, extensive research has been conducted to develop robots capable of opening specific doors. However, the diverse combinations of door handles and opening directions necessitate a more versatile door opening system for robots to successfully operate in real-world environments. In this paper, we propose a mobile manipulator system that can autonomously open various doors without prior knowledge. By using convolutional neural networks, point cloud extraction techniques, and external force measurements during exploratory motion, we obtained information regarding handle types, poses, and door characteristics. Through two different approaches, adaptive position-force control and deep reinforcement learning, we successfully opened doors without precise trajectory or excessive external force. The adaptive position-force control method involves moving the end-effector in the direction of the door opening while responding compliantly to external forces, ensuring safety and manipulator workspace. Meanwhile, the deep reinforcement learning policy minimizes applied forces and eliminates unnecessary movements, enabling stable operation across doors with different poses and widths. The RL-based approach outperforms the adaptive position-force control method in terms of compensating for external forces, ensuring smooth motion, and achieving efficient speed. It reduces the maximum force required by 3.27 times and improves motion smoothness by 1.82 times. However, the non-learning-based adaptive position-force control method demonstrates more versatility in opening a wider range of doors, encompassing revolute doors with four distinct opening directions and varying widths.

Causal Policy Gradient for Whole-Body Mobile Manipulation

Conference Paper

Full-text available

Jul 2023

RT-1: Robotics Transformer for Real-World Control at Scale

Conference Paper

Jul 2023

M-EMBER: Tackling Long-Horizon Mobile Manipulation via Factorized Domain Transfer

Preprint

Full-text available

May 2023

In this paper, we propose a method to create visuomotor mobile manipulation solutions for long-horizon activities. We propose to leverage the recent advances in simulation to train visual solutions for mobile manipulation. While previous works have shown success applying this procedure to autonomous visual navigation and stationary manipulation, applying it to long-horizon visuomotor mobile manipulation is still an open challenge that demands both perceptual and compositional generalization of multiple skills. In this work, we develop Mobile-EMBER, or M-EMBER, a factorized method that decomposes a long-horizon mobile manipulation activity into a repertoire of primitive visual skills, reinforcement-learns each skill, and composes these skills to a long-horizon mobile manipulation activity. On a mobile manipulation robot, we find that M-EMBER completes a long-horizon mobile manipulation activity, cleaning_kitchen, achieving a 53% success rate. This requires successfully planning and executing five factorized, learned visual skills.

Causal Policy Gradient for Whole-Body Mobile Manipulation

Preprint

Full-text available

May 2023

Developing the next generation of household robot helpers requires combining locomotion and interaction capabilities, which is generally referred to as mobile manipulation (MoMa). MoMa tasks are difficult due to the large action space of the robot and the common multi-objective nature of the task, e.g., efficiently reaching a goal while avoiding obstacles. Current approaches often segregate tasks into navigation without manipulation and stationary manipulation without locomotion by manually matching parts of the action space to MoMa sub-objectives (e.g. base actions for locomotion objectives and arm actions for manipulation). This solution prevents simultaneous combinations of locomotion and interaction degrees of freedom and requires human domain knowledge for both partitioning the action space and matching the action parts to the sub-objectives. In this paper, we introduce Causal MoMa, a new framework to train policies for typical MoMa tasks that makes use of the most favorable subspace of the robot's action space to address each sub-objective. Causal MoMa automatically discovers the causal dependencies between actions and terms of the reward function and exploits these dependencies in a causal policy learning procedure that reduces gradient variance compared to previous state-of-the-art policy gradient algorithms, improving convergence and results. We evaluate the performance of Causal MoMa on three types of simulated robots across different MoMa tasks and demonstrate success in transferring the policies trained in simulation directly to a real robot, where our agent is able to follow moving goals and react to dynamic obstacles while simultaneously and synergistically controlling the whole-body: base, arm, and head. More information at https://sites.google.com/view/causal-moma.

Clearpath dual UR5 arm Husky robot used for the real-world experiments.

Context in source publication

Similar publications

Citations