Differentially Private Actor and Its Eligibility Trace

Kanghyeon Seo; Jihoon Yang

doi:10.3390/electronics9091486

Differentially Private Actor and Its Eligibility Trace

Electronics ◽

10.3390/electronics9091486 ◽

2020 ◽

Vol 9 (9) ◽

pp. 1486

Author(s):

Kanghyeon Seo ◽

Jihoon Yang

Keyword(s):

Real World ◽

Autonomous Navigation ◽

Differential Privacy ◽

Cosine Similarity ◽

Gradient Algorithm ◽

Sensitive Data ◽

Private Actor ◽

Policy Gradient ◽

Eligibility Trace ◽

Real World Problems

We present a differentially private actor and its eligibility trace in an actor-critic approach, wherein an actor takes actions directly interacting with an environment; however, the critic estimates only the state values that are obtained through bootstrapping. In other words, the actor reflects the more detailed information about the sequence of taken actions on its parameter than the critic. Moreover, their corresponding eligibility traces have the same properties. Therefore, it is necessary to preserve the privacy of an actor and its eligibility trace while training on private or sensitive data. In this paper, we confirm the applicability of differential privacy methods to the actors updated using the policy gradient algorithm and discuss the advantages of such an approach with regard to differentially private critic learning. In addition, we measured the cosine similarity between the differentially private applied eligibility trace and the non-differentially private eligibility trace to analyze whether their anonymity is appropriately protected in the differentially private actor or the critic. We conducted the experiments considering two synthetic examples imitating real-world problems in medical and autonomous navigation domains, and the results confirmed the feasibility of the proposed method.

Download Full-text

A randomized block policy gradient algorithm with differential privacy in Content Centric Networks

International Journal of Distributed Sensor Networks ◽

10.1177/15501477211059934 ◽

2021 ◽

Vol 17 (12) ◽

pp. 155014772110599

Author(s):

Lin Wang ◽

Xingang Xu ◽

Xuhui Zhao ◽

Baozhu Li ◽

Ruijuan Zheng ◽

...

Keyword(s):

Privacy Protection ◽

Differential Privacy ◽

Effective Means ◽

High Dimensional Data ◽

Computational Cost ◽

Gradient Methods ◽

Multimedia Data ◽

Gradient Algorithm ◽

High Dimensional ◽

Policy Gradient

Policy gradient methods are effective means to solve the problems of mobile multimedia data transmission in Content Centric Networks. Current policy gradient algorithms impose high computational cost in processing high-dimensional data. Meanwhile, the issue of privacy disclosure has not been taken into account. However, privacy protection is important in data training. Therefore, we propose a randomized block policy gradient algorithm with differential privacy. In order to reduce computational complexity when processing high-dimensional data, we randomly select a block coordinate to update the gradients at each round. To solve the privacy protection problem, we add a differential privacy protection mechanism to the algorithm, and we prove that it preserves the [Formula: see text]-privacy level. We conduct extensive simulations in four environments, which are CartPole, Walker, HalfCheetah, and Hopper. Compared with the methods such as important-sampling momentum-based policy gradient, Hessian-Aided momentum-based policy gradient, REINFORCE, the experimental results of our algorithm show a faster convergence rate than others in the same environment.

Download Full-text

The Discrete Gaussian Expectation Maximization (Gradient) Algorithm for Differential Privacy

Computational Intelligence and Neuroscience ◽

10.1155/2021/7962489 ◽

2021 ◽

Vol 2021 ◽

pp. 1-13

Author(s):

Weisan Wu

Keyword(s):

Em Algorithm ◽

Expectation Maximization ◽

Differential Privacy ◽

High Dimensional Data ◽

Gradient Algorithm ◽

High Dimensional ◽

Sensitive Data ◽

The Difference

In this paper, we give a modified gradient EM algorithm; it can protect the privacy of sensitive data by adding discrete Gaussian mechanism noise. Specifically, it makes the high-dimensional data easier to process mainly by scaling, truncating, noise multiplication, and smoothing steps on the data. Since the variance of discrete Gaussian is smaller than that of the continuous Gaussian, the difference privacy of data can be guaranteed more effectively by adding the noise of the discrete Gaussian mechanism. Finally, the standard gradient EM algorithm, clipped algorithm, and our algorithm (DG-EM) are compared with the GMM model. The experiments show that our algorithm can effectively protect high-dimensional sensitive data.

Download Full-text

Multiagent Decision Making For Maritime Traffic Management

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33016171 ◽

2019 ◽

Vol 33 ◽

pp. 6171-6178

Author(s):

Arambam James Singh ◽

Duc Thien Nguyen ◽

Akshat Kumar ◽

Hoong Chuin Lau

Keyword(s):

Real World ◽

Traffic Management ◽

Scale Up ◽

Traffic Data ◽

Maritime Traffic ◽

Policy Gradient ◽

Collective Interactions ◽

Gradient Approach ◽

Domain Constraints ◽

Real World Problems

We address the problem of maritime traffic management in busy waterways to increase the safety of navigation by reducing congestion. We model maritime traffic as a large multiagent systems with individual vessels as agents, and VTS authority as the regulatory agent. We develop a maritime traffic simulator based on historical traffic data that incorporates realistic domain constraints such as uncertain and asynchronous movement of vessels. We also develop a traffic coordination approach that provides speed recommendation to vessels in different zones. We exploit the nature of collective interactions among agents to develop a scalable policy gradient approach that can scale up to real world problems. Empirical results on synthetic and real world problems show that our approach can significantly reduce congestion while keeping the traffic throughput high.

Download Full-text

How psychologists help solve real-world problems in multidisciplinary research teams: Introduction to the special issue.

American Psychologist ◽

10.1037/amp0000458 ◽

2019 ◽

Vol 74 (3) ◽

pp. 271-277 ◽

Cited By ~ 4

Author(s):

Robert W. Proctor ◽

Kim-Phuong L. Vu

Keyword(s):

Real World ◽

Special Issue ◽

Multidisciplinary Research ◽

Research Teams ◽

Real World Problems

Download Full-text

Psychological science applied to real world problems

PsycEXTRA Dataset ◽

10.1037/e597642012-001 ◽

2012 ◽

Author(s):

Danny Wedding

Keyword(s):

Real World ◽

Psychological Science ◽

Real World Problems

Download Full-text

Analysis of elementary school pre-service teachers' responses to real-world problems and their case studies: Focusing on finding octagonal pavilion floor area

Korean Association For Learner-Centered Curriculum And Instruction ◽

10.22251/jlcci.2020.20.10.1061 ◽

2020 ◽

Vol 20 (10) ◽

pp. 1061-1083

Author(s):

Sang Hun Song

Keyword(s):

Elementary School ◽

Case Studies ◽

Real World ◽

Floor Area ◽

Real World Problems

Download Full-text

Handrails through the Swamp? A Pilot to Test the Integration and Implementation Science Framework in Complex Real-World Research

Sustainability ◽

10.3390/su13105491 ◽

2021 ◽

Vol 13 (10) ◽

pp. 5491

Author(s):

Melissa Robson-Williams ◽

Bruce Small ◽

Roger Robson-Williams ◽

Nick Kirk

Keyword(s):

Pilot Study ◽

Case Studies ◽

Implementation Science ◽

Real World ◽

Environmental Problems ◽

Environmental Research ◽

Integrative Framework ◽

The World ◽

Science Framework ◽

Real World Problems

The socio-environmental challenges the world faces are ‘swamps’: situations that are messy, complex, and uncertain. The aim of this paper is to help disciplinary scientists navigate these swamps. To achieve this, the paper evaluates an integrative framework designed for researching complex real-world problems, the Integration and Implementation Science (i2S) framework. As a pilot study, we examine seven inter and transdisciplinary agri-environmental case studies against the concepts presented in the i2S framework, and we hypothesise that considering concepts in the i2S framework during the planning and delivery of agri-environmental research will increase the usefulness of the research for next users. We found that for the types of complex, real-world research done in the case studies, increasing attention to the i2S dimensions correlated with increased usefulness for the end users. We conclude that using the i2S framework could provide handrails for researchers, to help them navigate the swamps when engaging with the complexity of socio-environmental problems.

Download Full-text

Combined Games with Randomly Delayed Beginnings

Mathematics ◽

10.3390/math9050534 ◽

2021 ◽

Vol 9 (5) ◽

pp. 534

Author(s):

F. Thomas Bruss

Keyword(s):

Discrete Time ◽

Real World ◽

Optimal Stopping ◽

Random Variables ◽

Approximate Solutions ◽

Real World Problems

This paper presents two-person games involving optimal stopping. As far as we are aware, the type of problems we study are new. We confine our interest to such games in discrete time. Two players are to chose, with randomised choice-priority, between two games G1 and G2. Each game consists of two parts with well-defined targets. Each part consists of a sequence of random variables which determines when the decisive part of the game will begin. In each game, the horizon is bounded, and if the two parts are not finished within the horizon, the game is lost by definition. Otherwise the decisive part begins, on which each player is entitled to apply their or her strategy to reach the second target. If only one player achieves the two targets, this player is the winner. If both win or both lose, the outcome is seen as “deuce”. We motivate the interest of such problems in the context of real-world problems. A few representative problems are solved in detail. The main objective of this article is to serve as a preliminary manual to guide through possible approaches and to discuss under which circumstances we can obtain solutions, or approximate solutions.

Download Full-text

Review of Network Flow Algorithms David P. Williamson

ACM SIGACT News ◽

10.1145/3457588.3457592 ◽

2021 ◽

Vol 52 (1) ◽

pp. 12-15

Author(s):

S.V. Nagaraj

Keyword(s):

Graduate Students ◽

Real World ◽

Network Flow ◽

Network Flows ◽

Optimization Problems ◽

Capacity Constraints ◽

Incoming Flow ◽

Network Flow Problems ◽

Flow Problems ◽

Real World Problems

This book is on algorithms for network flows. Network flow problems are optimization problems where given a flow network, the aim is to construct a flow that respects the capacity constraints of the edges of the network, so that incoming flow equals the outgoing flow for all vertices of the network except designated vertices known as the source and the sink. Network flow algorithms solve many real-world problems. This book is intended to serve graduate students and as a reference. The book is also available in eBook (ISBN 9781316952894/US$ 32.00), and hardback (ISBN 9781107185890/US$99.99) formats. The book has a companion web site www.networkflowalgs.com where a pre-publication version of the book can be downloaded gratis.

Download Full-text

Help communities solve real-world problems with AI

AI Matters ◽

10.1145/3362077.3362080 ◽

2019 ◽

Vol 5 (3) ◽

pp. 12-14

Author(s):

Tara Chklovski

Keyword(s):

Real World ◽

Real World Problems

Download Full-text