scholarly journals A New Cache Update Scheme Using Reinforcement Learning for Coded Video Streaming Systems

Sensors ◽  
2021 ◽  
Vol 21 (8) ◽  
pp. 2867
Author(s):  
Yu-Sin Kim ◽  
Jeong-Min Lee ◽  
Jong-Yeol Ryu ◽  
Tae-Won Ban

As the demand for video streaming has been rapidly increasing recently, new technologies for improving the efficiency of video streaming have attracted much attention. In this paper, we thus investigate how to improve the efficiency of video streaming by using clients’ cache storage considering exclusive OR (XOR) coding-based video streaming where multiple different video contents can be simultaneously transmitted in one transmission as long as prerequisite conditions are satisfied, and the efficiency of video streaming can be thus significantly enhanced. We also propose a new cache update scheme using reinforcement learning. The proposed scheme uses a K-actor-critic (K-AC) network that can mitigate the disadvantage of actor-critic networks by yielding K candidate outputs and by selecting the final output with the highest value out of the K candidates. The K-AC exists in each client, and each client can train it by using only locally available information without any feedback or signaling so that the proposed cache update scheme is a completely decentralized scheme. The performance of the proposed cache update scheme was analyzed in terms of the average number of transmissions for XOR coding-based video streaming and was compared to that of conventional cache update schemes. Our numerical results show that the proposed cache update scheme can reduce the number of transmissions up to 24% when the number of videos is 100, the number of clients is 50, and the cache size is 5.

2011 ◽  
Vol 2011 ◽  
pp. 1-12 ◽  
Author(s):  
Karim El-Laithy ◽  
Martin Bogdan

An integration of both the Hebbian-based and reinforcement learning (RL) rules is presented for dynamic synapses. The proposed framework permits the Hebbian rule to update the hidden synaptic model parameters regulating the synaptic response rather than the synaptic weights. This is performed using both the value and the sign of the temporal difference in the reward signal after each trial. Applying this framework, a spiking network with spike-timing-dependent synapses is tested to learn the exclusive-OR computation on a temporally coded basis. Reward values are calculated with the distance between the output spike train of the network and a reference target one. Results show that the network is able to capture the required dynamics and that the proposed framework can reveal indeed an integrated version of Hebbian and RL. The proposed framework is tractable and less computationally expensive. The framework is applicable to a wide class of synaptic models and is not restricted to the used neural representation. This generality, along with the reported results, supports adopting the introduced approach to benefit from the biologically plausible synaptic models in a wide range of intuitive signal processing.


Author(s):  
Chung-wei Lee ◽  
Joshua L. Smith

Mobile video streaming is a natural augmentation to today’s thriving Internet video streaming service. With the rapid growth of the capability of mobile handheld devices and abundant bandwidth from high-speed wireless networks, it is expected that mobile video streaming service will soon become a lucrative business section and a thrust for technological advancement on computer and telecommunication industries. In this chapter, essential technical components for constructing mobile video streaming systems are introduced. They include the latest development on broadband wireless technology and video-capable mobile handheld devices. As many modern technologies are often driven by consumer demand, user experience and expectation are discussed from the perspective of mobile video streaming. At the end, several cutting-edge research and development breakthroughs are presented as they may change the future of mobile video streaming systems.


Sign in / Sign up

Export Citation Format

Share Document