DOE-SLAM: Dynamic Object Enhanced Visual SLAM

Xiao Hu; Jochen Lang

doi:10.3390/s21093091

DOE-SLAM: Dynamic Object Enhanced Visual SLAM

Sensors ◽

10.3390/s21093091 ◽

2021 ◽

Vol 21 (9) ◽

pp. 3091

Author(s):

Xiao Hu ◽

Jochen Lang

Keyword(s):

State Of The Art ◽

Dynamic Environments ◽

Object Motion ◽

Estimation Accuracy ◽

Camera Tracking ◽

Localization And Mapping ◽

Novel Strategy ◽

Object Features ◽

Test Scenarios ◽

Static Background

In this paper, we formulate a novel strategy to adapt monocular-vision-based simultaneous localization and mapping (vSLAM) to dynamic environments. When enough background features can be captured, our system not only tracks the camera trajectory based on static background features but also estimates the foreground object motion from object features. In cases when a moving object obstructs too many background features for successful camera tracking from the background, our system can exploit the features from the object and the prediction of the object motion to estimate the camera pose. We use various synthetic and real-world test scenarios and the well-known TUM sequences to evaluate the capabilities of our system. The experiments show that we achieve higher pose estimation accuracy and robustness over state-of-the-art monocular vSLAM systems.

Download Full-text

Assistive Navigation Using Deep Reinforcement Learning Guiding Robot With UWB/Voice Beacons and Semantic Feedbacks for Blind and Visually Impaired People

Frontiers in Robotics and AI ◽

10.3389/frobt.2021.654132 ◽

2021 ◽

Vol 8 ◽

Author(s):

Chen-Lung Lu ◽

Zi-Yan Liu ◽

Jui-Te Huang ◽

Ching-I Huang ◽

Bo-Hui Wang ◽

...

Keyword(s):

Reinforcement Learning ◽

Visually Impaired ◽

State Of The Art ◽

Dynamic Environments ◽

Learning Approach ◽

Independent Mobility ◽

Blind And Visually Impaired ◽

Visually Impaired People ◽

Localization And Mapping ◽

Navigation Method

Facilitating navigation in pedestrian environments is critical for enabling people who are blind and visually impaired (BVI) to achieve independent mobility. A deep reinforcement learning (DRL)–based assistive guiding robot with ultrawide-bandwidth (UWB) beacons that can navigate through routes with designated waypoints was designed in this study. Typically, a simultaneous localization and mapping (SLAM) framework is used to estimate the robot pose and navigational goal; however, SLAM frameworks are vulnerable in certain dynamic environments. The proposed navigation method is a learning approach based on state-of-the-art DRL and can effectively avoid obstacles. When used with UWB beacons, the proposed strategy is suitable for environments with dynamic pedestrians. We also designed a handle device with an audio interface that enables BVI users to interact with the guiding robot through intuitive feedback. The UWB beacons were installed with an audio interface to obtain environmental information. The on-handle and on-beacon verbal feedback provides points of interests and turn-by-turn information to BVI users. BVI users were recruited in this study to conduct navigation tasks in different scenarios. A route was designed in a simulated ward to represent daily activities. In real-world situations, SLAM-based state estimation might be affected by dynamic obstacles, and the visual-based trail may suffer from occlusions from pedestrians or other obstacles. The proposed system successfully navigated through environments with dynamic pedestrians, in which systems based on existing SLAM algorithms have failed.

Download Full-text

Visual SLAM Framework Based on Segmentation with the Improvement of Loop Closure Detection in Dynamic Environments

Journal of Robotics and Mechatronics ◽

10.20965/jrm.2021.p1385 ◽

2021 ◽

Vol 33 (6) ◽

pp. 1385-1397

Author(s):

Leyuan Sun ◽

Rohan P. Singh ◽

Fumio Kanehiro ◽

◽

...

Keyword(s):

Point Cloud ◽

Three Dimensional ◽

Dynamic Environments ◽

Camera Tracking ◽

Loop Closure ◽

Loop Closure Detection ◽

Multiple Datasets ◽

Localization And Mapping ◽

Object Based ◽

Precision Recall Curve

Most simultaneous localization and mapping (SLAM) systems assume that SLAM is conducted in a static environment. When SLAM is used in dynamic environments, the accuracy of each part of the SLAM system is adversely affected. We term this problem as dynamic SLAM. In this study, we propose solutions for three main problems in dynamic SLAM: camera tracking, three-dimensional map reconstruction, and loop closure detection. We propose to employ geometry-based method, deep learning-based method, and the combination of them for object segmentation. Using the information from segmentation to generate the mask, we filter the keypoints that lead to errors in visual odometry and features extracted by the CNN from dynamic areas to improve the performance of loop closure detection. Then, we validate our proposed loop closure detection method using the precision-recall curve and also confirm the framework’s performance using multiple datasets. The absolute trajectory error and relative pose error are used as metrics to evaluate the accuracy of the proposed SLAM framework in comparison with state-of-the-art methods. The findings of this study can potentially improve the robustness of SLAM technology in situations where mobile robots work together with humans, while the object-based point cloud byproduct has potential for other robotics tasks.

Download Full-text

A survey: which features are required for dynamic visual simultaneous localization and mapping?

Visual Computing for Industry Biomedicine and Art ◽

10.1186/s42492-021-00086-w ◽

2021 ◽

Vol 4 (1) ◽

Author(s):

Zewen Xu ◽

Zheng Rong ◽

Yihong Wu

Keyword(s):

Simultaneous Localization And Mapping ◽

Dynamic Environments ◽

Visual Features ◽

Advantages And Disadvantages ◽

Intelligent Robots ◽

Localization And Mapping ◽

High Level ◽

Static World ◽

Robotic Applications ◽

Significant Attention

AbstractIn recent years, simultaneous localization and mapping in dynamic environments (dynamic SLAM) has attracted significant attention from both academia and industry. Some pioneering work on this technique has expanded the potential of robotic applications. Compared to standard SLAM under the static world assumption, dynamic SLAM divides features into static and dynamic categories and leverages each type of feature properly. Therefore, dynamic SLAM can provide more robust localization for intelligent robots that operate in complex dynamic environments. Additionally, to meet the demands of some high-level tasks, dynamic SLAM can be integrated with multiple object tracking. This article presents a survey on dynamic SLAM from the perspective of feature choices. A discussion of the advantages and disadvantages of different visual features is provided in this article.

Download Full-text

Sensor Fusion-Based Approach to Eliminating Moving Objects for SLAM in Dynamic Environments

Sensors ◽

10.3390/s21010230 ◽

2021 ◽

Vol 21 (1) ◽

pp. 230

Author(s):

Xiangwei Dang ◽

Zheng Rong ◽

Xingdong Liang

Keyword(s):

State Estimation ◽

Autonomous Navigation ◽

Moving Objects ◽

Physical Simulation ◽

Point Clouds ◽

Dynamic Environments ◽

Accurate Localization ◽

Novel Approach ◽

Localization And Mapping ◽

Real World Datasets

Accurate localization and reliable mapping is essential for autonomous navigation of robots. As one of the core technologies for autonomous navigation, Simultaneous Localization and Mapping (SLAM) has attracted widespread attention in recent decades. Based on vision or LiDAR sensors, great efforts have been devoted to achieving real-time SLAM that can support a robot’s state estimation. However, most of the mature SLAM methods generally work under the assumption that the environment is static, while in dynamic environments they will yield degenerate performance or even fail. In this paper, first we quantitatively evaluate the performance of the state-of-the-art LiDAR-based SLAMs taking into account different pattens of moving objects in the environment. Through semi-physical simulation, we observed that the shape, size, and distribution of moving objects all can impact the performance of SLAM significantly, and obtained instructive investigation results by quantitative comparison between LOAM and LeGO-LOAM. Secondly, based on the above investigation, a novel approach named EMO to eliminating the moving objects for SLAM fusing LiDAR and mmW-radar is proposed, towards improving the accuracy and robustness of state estimation. The method fully uses the advantages of different characteristics of two sensors to realize the fusion of sensor information with two different resolutions. The moving objects can be efficiently detected based on Doppler effect by radar, accurately segmented and localized by LiDAR, then filtered out from the point clouds through data association and accurate synchronized in time and space. Finally, the point clouds representing the static environment are used as the input of SLAM. The proposed approach is evaluated through experiments using both semi-physical simulation and real-world datasets. The results demonstrate the effectiveness of the method at improving SLAM performance in accuracy (decrease by 30% at least in absolute position error) and robustness in dynamic environments.

Download Full-text

Smartphone-Based Indoor Visual Navigation with Leader-Follower Mode

ACM Transactions on Sensor Networks ◽

10.1145/3448417 ◽

2021 ◽

Vol 17 (2) ◽

pp. 1-22

Author(s):

Jingao Xu ◽

Erqun Dong ◽

Qiang Ma ◽

Chenshu Wu ◽

Zheng Yang

Keyword(s):

Real Time ◽

Environmental Changes ◽

State Of The Art ◽

Visual Navigation ◽

Indoor Navigation ◽

Location Services ◽

Localization And Mapping ◽

Leaders And Followers ◽

Indoor Navigation System ◽

Free Pair

Existing indoor navigation solutions usually require pre-deployed comprehensive location services with precise indoor maps and, more importantly, all rely on dedicatedly installed or existing infrastructure. In this article, we present Pair-Navi, an infrastructure-free indoor navigation system that circumvents all these requirements by reusing a previous traveler’s (i.e., leader) trace experience to navigate future users (i.e., followers) in a Peer-to-Peer mode. Our system leverages the advances of visual simultaneous localization and mapping ( SLAM ) on commercial smartphones. Visual SLAM systems, however, are vulnerable to environmental dynamics in the precision and robustness and involve intensive computation that prohibits real-time applications. To combat environmental changes, we propose to cull non-rigid contexts and keep only the static and rigid contents in use. To enable real-time navigation on mobiles, we decouple and reorganize the highly coupled SLAM modules for leaders and followers. We implement Pair-Navi on commodity smartphones and validate its performance in three diverse buildings and two standard datasets (TUM and KITTI). Our results show that Pair-Navi achieves an immediate navigation success rate of 98.6%, which maintains as 83.4% even after 2 weeks since the leaders’ traces were collected, outperforming the state-of-the-art solutions by >50%. Being truly infrastructure-free, Pair-Navi sheds lights on practical indoor navigations for mobile users.

Download Full-text

Problem of angular motion control of a non-cooperative on-orbit service object

Technical mechanics ◽

10.15407/itm2021.01.037 ◽

2021 ◽

Vol 2021 (1) ◽

pp. 37-50

Author(s):

A.A. Fokov ◽

◽

O.P. Savchuk ◽

Keyword(s):

Motion Control ◽

State Of The Art ◽

Object Motion ◽

The State ◽

Angular Motion ◽

Motion Parameter ◽

Rotary Motion ◽

Parameter Determination ◽

Service Object

The realization of existing projects of on-orbit servicing and the development of new ones is a steady trend in the development of space technology. In many cases, on-orbit service clients are objects that exhibit an undesired rotary motion, which renders their servicing difficult or impossible. The problem of on-orbit service object motion control determines the topicality of studies aimed not only at the refinement of methods and algorithms of controlling both the translational and the rotary motion of an object, but also at the development and refinement of methods of onboard determination of the object – service spacecraft relative motion parameters. This paper overviews the state of the art of the problem of object motion parameter determination in on-orbit servicing tasks and existing methods of object motion control and angular motion damping and specifies lines of further investigations into the angular motion control of non-cooperative service objects. Based on the analysis of publications on the subject, the applicability of onboard means for object motion parameter determination is characterized. The analysis of the applicability of methods of remote determination of the parameters of an unknown non-cooperative object from a service spacecraft shows that they are at the research stage. The input data for the verification of methods proposed in the literature were simulated or taken from ground experiments or previous missions. Contact and contactless methods of angular motion control of non-cooperative on-orbit service objects are considered. From the state of the art of investigations into the contactless motion control of on-orbit service objects it may be concluded that the most advanced contactless method of motion control of an on-orbit service object is a technology based on the use of an ion beam directed to the object from an electrojet engine onboard a service spacecraft. Lines of further investigations into non-cooperative object motion control are proposed.

Download Full-text

Multi-Robot SLAM in Dynamic Environments with Parallel Maps

International Journal of Humanoid Robotics ◽

10.1142/s0219843621500110 ◽

2021 ◽

pp. 2150011

Author(s):

Sajad Badalkhani ◽

Ramazan Havangi ◽

Mohsen Farshad

Keyword(s):

Large Scale ◽

Dynamic Environment ◽

Dynamic Environments ◽

Extensive Literature ◽

Real World Data ◽

Data Set ◽

Cooperative Approach ◽

Localization And Mapping ◽

Multi Robot

There is an extensive literature regarding multi-robot simultaneous localization and mapping (MRSLAM). In most part of the research, the environment is assumed to be static, while the dynamic parts of the environment degrade the estimation quality of SLAM algorithms and lead to inherently fragile systems. To enhance the performance and robustness of the SLAM in dynamic environments (SLAMIDE), a novel cooperative approach named parallel-map (p-map) SLAM is introduced in this paper. The objective of the proposed method is to deal with the dynamics of the environment, by detecting dynamic parts and preventing the inclusion of them in SLAM estimations. In this approach, each robot builds a limited map in its own vicinity, while the global map is built through a hybrid centralized MRSLAM. The restricted size of the local maps, bounds computational complexity and resources needed to handle a large scale dynamic environment. Using a probabilistic index, the proposed method differentiates between stationary and moving landmarks, based on their relative positions with other parts of the environment. Stationary landmarks are then used to refine a consistent map. The proposed method is evaluated with different levels of dynamism and for each level, the performance is measured in terms of accuracy, robustness, and hardware resources needed to be implemented. The method is also evaluated with a publicly available real-world data-set. Experimental validation along with simulations indicate that the proposed method is able to perform consistent SLAM in a dynamic environment, suggesting its feasibility for MRSLAM applications.

Download Full-text

AN IMPROVED SIMULTANEOUS LOCALIZATION AND MAPPING FOR DYNAMIC ENVIRONMENTS

International Journal of Robotics and Automation ◽

10.2316/j.2021.206-0404 ◽

2021 ◽

Vol 36 (0) ◽

Author(s):

S. Badalkhani ◽

R. Havangi ◽

M. Farshad

Keyword(s):

Simultaneous Localization And Mapping ◽

Dynamic Environments ◽

Localization And Mapping

Download Full-text

Incorporating Deep Features into GEOBIA Paradigm for Remote Sensing Imagery Classification: A Patch-Based Approach

Remote Sensing ◽

10.3390/rs12183007 ◽

2020 ◽

Vol 12 (18) ◽

pp. 3007 ◽

Cited By ~ 3

Author(s):

Bo Liu ◽

Shihong Du ◽

Shouji Du ◽

Xiuyuan Zhang

Keyword(s):

Remote Sensing ◽

State Of The Art ◽

Remote Sensing Imagery ◽

Image Objects ◽

Deep Feature ◽

Object Based ◽

Feature Aggregation ◽

Art Methods ◽

Object Features ◽

Land Cover Maps

The fast and accurate creation of land use/land cover maps from very-high-resolution (VHR) remote sensing imagery is crucial for urban planning and environmental monitoring. Geographic object-based image analysis methods (GEOBIA) provide an effective solution using image objects instead of individual pixels in VHR remote sensing imagery analysis. Simultaneously, convolutional neural networks (CNN) have been widely used in the image processing field because of their powerful feature extraction capabilities. This study presents a patch-based strategy for integrating deep features into GEOBIA for VHR remote sensing imagery classification. To extract deep features from irregular image objects through CNN, a patch-based approach is proposed for representing image objects and learning patch-based deep features, and a deep features aggregation method is proposed for aggregating patch-based deep features into object-based deep features. Finally, both object and deep features are integrated into a GEOBIA paradigm for classifying image objects. We explored the influences of segmentation scales and patch sizes in our method and explored the effectiveness of deep and object features in classification. Moreover, we performed 5-fold stratified cross validations 50 times to explore the uncertainty of our method. Additionally, we explored the importance of deep feature aggregation, and we evaluated our method by comparing it with three state-of-the-art methods in a Beijing dataset and Zurich dataset. The results indicate that smaller segmentation scales were more conducive to VHR remote sensing imagery classification, and it was not appropriate to select too large or too small patches as the patch size should be determined by imagery and its resolution. Moreover, we found that deep features are more effective than object features, while object features still matter for image classification, and deep feature aggregation is a critical step in our method. Finally, our method can achieve the highest overall accuracies compared with the state-of-the-art methods, and the overall accuracies are 91.21% for the Beijing dataset and 99.05% for the Zurich dataset.

Download Full-text

A Novel Feature-Based Approach for Indoor Monocular SLAM

Electronics ◽

10.3390/electronics7110305 ◽

2018 ◽

Vol 7 (11) ◽

pp. 305 ◽

Cited By ~ 3

Author(s):

Seyyed Hoseini ◽

Peyman Kabiri

Keyword(s):

Camera Tracking ◽

Translation Error ◽

Natural Feature ◽

Localization And Mapping ◽

Monocular Slam ◽

Challenging Tasks ◽

Camera Pose ◽

Quality Video ◽

Feature Based ◽

Chessboard Pattern

Camera tracking and the construction of a robust and accurate map in unknown environments are still challenging tasks in computer vision and robotic applications. Visual Simultaneous Localization and Mapping (SLAM) along with Augmented Reality (AR) are two important applications, and their performance is entirely dependent on the accuracy of the camera tracking routine. This paper presents a novel feature-based approach for the monocular SLAM problem using a hand-held camera in room-sized workspaces with a maximum scene depth of 4–5 m. In the core of the proposed method, there is a Particle Filter (PF) responsible for the estimation of extrinsic parameters of the camera. In addition, contrary to key-frame based methods, the proposed system tracks the camera frame by frame and constructs a robust and accurate map incrementally. Moreover, the proposed algorithm initially constructs a metric sparse map. To this end, a chessboard pattern with a known cell size has been placed in front of the camera for a few frames. This enables the algorithm to accurately compute the pose of the camera and therefore, the depth of the primary detected natural feature points are easily calculated. Afterwards, camera pose estimation for each new incoming frame is carried out in a framework that is merely working with a set of visible natural landmarks. Moreover, to recover the depth of the newly detected landmarks, a delayed approach based on linear triangulation is used. The proposed method is applied to a realworld VGA quality video (640 × 480 pixels) where the translation error of the camera pose is less than 2 cm on average and the orientation error is less than 3 degrees, which indicates the effectiveness and accuracy of the developed algorithm.

Download Full-text