scholarly journals Adaptive Quadruped Balance Control for Dynamic Environments Using Maximum-Entropy Reinforcement Learning

Sensors ◽  
2021 ◽  
Vol 21 (17) ◽  
pp. 5907
Author(s):  
Haoran Sun ◽  
Tingting Fu ◽  
Yuanhuai Ling ◽  
Chaoming He

External disturbance poses the primary threat to robot balance in dynamic environments. This paper provides a learning-based control architecture for quadrupedal self-balancing, which is adaptable to multiple unpredictable scenes of external continuous disturbance. Different from conventional methods which construct analytical models which explicitly reason the balancing process, our work utilized reinforcement learning and artificial neural network to avoid incomprehensible mathematical modeling. The control policy is composed of a neural network and a Tanh Gaussian policy, which implicitly establishes the fuzzy mapping from proprioceptive signals to action commands. During the training process, the maximum-entropy method (soft actor-critic algorithm) is employed to endow the policy with powerful exploration and generalization ability. The trained policy is validated in both simulations and realistic experiments with a customized quadruped robot. The results demonstrate that the policy can be easily transferred to the real world without elaborate configurations. Moreover, although this policy is trained in merely one specific vibration condition, it demonstrates robustness under conditions that were never encountered during training.

2021 ◽  
Vol 11 (7) ◽  
pp. 3257
Author(s):  
Chen-Huan Pi ◽  
Wei-Yuan Ye ◽  
Stone Cheng

In this paper, a novel control strategy is presented for reinforcement learning with disturbance compensation to solve the problem of quadrotor positioning under external disturbance. The proposed control scheme applies a trained neural-network-based reinforcement learning agent to control the quadrotor, and its output is directly mapped to four actuators in an end-to-end manner. The proposed control scheme constructs a disturbance observer to estimate the external forces exerted on the three axes of the quadrotor, such as wind gusts in an outdoor environment. By introducing an interference compensator into the neural network control agent, the tracking accuracy and robustness were significantly increased in indoor and outdoor experiments. The experimental results indicate that the proposed control strategy is highly robust to external disturbances. In the experiments, compensation improved control accuracy and reduced positioning error by 75%. To the best of our knowledge, this study is the first to achieve quadrotor positioning control through low-level reinforcement learning by using a global positioning system in an outdoor environment.


1996 ◽  
Vol 51 (5-6) ◽  
pp. 337-347 ◽  
Author(s):  
Mariusz Maćkowiak ◽  
Piotr Kątowski

Abstract Two-dimensional zero-field nutation NQR spectroscopy has been used to determine the full quadrupolar tensor of spin - 3/2 nuclei in serveral molecular crystals containing the 3 5 Cl and 7 5 As nuclei. The problems of reconstructing 2D-nutation NQR spectra using conventional methods and the advantages of using implementation of the maximum entropy method (MEM) are analyzed. It is shown that the replacement of conventional Fourier transform by an alternative data processing by MEM in 2D NQR spectroscopy leads to sensitivity improvement, reduction of instrumental artefacts and truncation errors, shortened data acquisition times and suppression of noise, while at the same time increasing the resolution. The effects of off-resonance irradiation in nutation experiments are demonstrated both experimentally and theoretically. It is shown that off-resonance nutation spectroscopy is a useful extension of the conventional on-resonance experiments, thus facilitating the determination of asymmetry parameters in multiple spectrum. The theoretical description of the off-resonance effects in 2D nutation NQR spectroscopy is given, and general exact formulas for the asymmetry parameter are obtained. In off-resonance conditions, the resolution of the nutation NQR spectrum decreases with the spectrometer offset. However, an enhanced resolution can be achieved by using the maximum entropy method in 2D-data reconstruction.


Geophysics ◽  
2003 ◽  
Vol 68 (4) ◽  
pp. 1417-1422 ◽  
Author(s):  
Danilo R. Velis

The distribution of primary reflection coefficients can be estimated by means of the maximum entropy method, giving rise to smooth nonparametric functions which are consistent with the data. Instead of using classical moments (e.g. skewness and kurtosis) to constraint the maximization, nonconventional sample statistics help to improve the quality of the estimates. Results using real log data from various wells located in the Neuquen Basin (Argentina) show the effectiveness of the method to estimate both robust and consistent distributions that may be used to simulate realistic sequences.


1987 ◽  
Vol 4 (1) ◽  
pp. 78-82 ◽  
Author(s):  
B. C. De Simone ◽  
F. De Luca ◽  
B. Maraviglia

Sign in / Sign up

Export Citation Format

Share Document