Compacted CPU/GPU Data Compression via Modified Virtual Address Translation

Larry Seiler; Daqi Lin; Cem Yuksel

doi:10.1145/3406177

Compacted CPU/GPU Data Compression via Modified Virtual Address Translation

Proceedings of the ACM on Computer Graphics and Interactive Techniques ◽

10.1145/3406177 ◽

2020 ◽

Vol 3 (2) ◽

pp. 1-18

Author(s):

Larry Seiler ◽

Daqi Lin ◽

Cem Yuksel

Keyword(s):

Data Compression ◽

Random Access ◽

Fixed Rate ◽

Address Translation ◽

Compression Data ◽

Virtual Address ◽

Depth Buffer ◽

Compressed Data ◽

Fast Access ◽

Better Than

We propose a method to reduce the footprint of compressed data by using modified virtual address translation to permit random access to the data. This extends our prior work on using page translation to perform automatic decompression and deswizzling upon accesses to fixed rate lossy or lossless compressed data. Our compaction method allows a virtual address space the size of the uncompressed data to be used to efficiently access variable-size blocks of compressed data. Compression and decompression take place between the first and second level caches, which allows fast access to uncompressed data in the first level cache and provides data compaction at all other levels of the memory hierarchy. This improves performance and reduces power relative to compressed but uncompacted data. An important property of our method is that compression, decompression, and reallocation are automatically managed by the new hardware without operating system intervention and without storing compression data in the page tables. As a result, although some changes are required in the page manager, it does not need to know the specific compression algorithm and can use a single memory allocation unit size. We tested our method with two sample CPU algorithms. When performing depth buffer occlusion tests, our method reduces the memory footprint by 3.1x. When rendering into textures, our method reduces the footprint by 1.69x before rendering and 1.63x after. In both cases, the power and cycle time are better than for uncompacted compressed data, and significantly better than for accessing uncompressed data.

Download Full-text

P-Codec: Parallel Compressed File Decompression Algorithm for Hadoop

INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY ◽

10.24297/ijct.v15i8.1500 ◽

2016 ◽

Vol 15 (8) ◽

pp. 6991-6998

Author(s):

Idris Hanafi ◽

Amal Abdel-Raouf

Keyword(s):

Big Data ◽

Data Processing ◽

Data Compression ◽

Distributed File System ◽

Design And Implementation ◽

Average Improvement ◽

Hadoop Distributed File System ◽

Compression Data ◽

Compressed Data ◽

Data Analytic

The increasing amount and size of data being handled by data analytic applications running on Hadoop has created a need for faster data processing. One of the effective methods for handling big data sizes is compression. Data compression not only makes network I/O processing faster, but also provides better utilization of resources. However, this approach defeats one of Hadoopâ€™s main purposes, which is the parallelism of map and reduce tasks. The number of map tasks created is determined by the size of the file, so by compressing a large file, the number of mappers is reduced which in turn decreases parallelism. Consequently, standard Hadoop takes longer times to process. In this paper, we propose the design and implementation of a Parallel Compressed File Decompressor (P-Codec) that improves the performance of Hadoop when processing compressed data. P-Codec includes two modules; the first module decompresses data upon retrieval by a data node during the phase of uploading the data to the Hadoop Distributed File System (HDFS). This process reduces the runtime of a job by removing the burden of decompression during the MapReduce phase. The second P-Codec module is a decompressed map task divider that increases parallelism by dynamically changing the map task split sizes based on the size of the final decompressed block. Our experimental results using five different MapReduce benchmarks show an average improvement of approximately 80% compared to standard Hadoop.

Download Full-text

Automatic GPU Data Compression and Address Swizzling for CPUs via Modified Virtual Address Translation

Symposium on Interactive 3D Graphics and Games ◽

10.1145/3384382.3384533 ◽

2020 ◽

Author(s):

Larry Seiler ◽

Daqi Lin ◽

Cem Yuksel

Keyword(s):

Data Compression ◽

Address Translation ◽

Virtual Address

Download Full-text

ExypnoSteganos - A smarter approach to steganography

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189879 ◽

2021 ◽

pp. 1-12

Author(s):

Gaurav Sarraf ◽

Anirudh Ramesh Srivatsa ◽

MS Swetha

Keyword(s):

Neural Network ◽

Deep Learning ◽

Data Compression ◽

Building Block ◽

Communication Techniques ◽

Systems Security ◽

Learning Technique ◽

Critical Problems ◽

Sporadic Cases ◽

Compressed Data

With the ever-rising threat to security, multiple industries are always in search of safer communication techniques both in rest and transit. Multiple security institutions agree that any systems security can be modeled around three major concepts: Confidentiality, Availability, and Integrity. We try to reduce the holes in these concepts by developing a Deep Learning based Steganography technique. In our study, we have seen, data compression has to be at the heart of any sound steganography system. In this paper, we have shown that it is possible to compress and encode data efficiently to solve critical problems of steganography. The deep learning technique, which comprises an auto-encoder with Convolutional Neural Network as its building block, not only compresses the secret file but also learns how to hide the compressed data in the cover file efficiently. The proposed techniques can encode secret files of the same size as of cover, or in some sporadic cases, even larger files can be encoded. We have also shown that the same model architecture can theoretically be applied to any file type. Finally, we show that our proposed technique surreptitiously evades all popular steganalysis techniques.

Download Full-text

Joint Source-Channel Coding in Dictionary Methods of Lossless Data Compression

International Journal of Electronics and Telecommunications ◽

10.2478/v10177-010-0046-8 ◽

2010 ◽

Vol 56 (4) ◽

pp. 351-355

Author(s):

Marcin Rodziewicz

Keyword(s):

Data Compression ◽

Channel Coding ◽

Data Stream ◽

Noisy Channels ◽

Error Resiliency ◽

Joint Source Channel Coding ◽

Lossless Data Compression ◽

Compressed Data ◽

Source Channel Coding

Joint Source-Channel Coding in Dictionary Methods of Lossless Data Compression Limitations on memory and resources of communications systems require powerful data compression methods. Decompression of compressed data stream is very sensitive to errors which arise during transmission over noisy channels, therefore error correction coding is also required. One of the solutions to this problem is the application of joint source and channel coding. This paper contains a description of methods of joint source-channel coding based on the popular data compression algorithms LZ'77 and LZSS. These methods are capable of introducing some error resiliency into compressed stream of data without degradation of the compression ratio. We analyze joint source and channel coding algorithms based on these compression methods and present their novel extensions. We also present some simulation results showing usefulness and achievable quality of the analyzed algorithms.

Download Full-text

Lossless text compression using GPT-2 language model and Huffman coding

SHS Web of Conferences ◽

10.1051/shsconf/202110204013 ◽

2021 ◽

Vol 102 ◽

pp. 04013

Author(s):

Md. Atiqur Rahman ◽

Mohamed Hamada

Keyword(s):

Data Compression ◽

State Of The Art ◽

Language Model ◽

Huffman Coding ◽

Original Text ◽

Text Compression ◽

Compression Technique ◽

Daily Life Activities ◽

Burrows Wheeler Transform ◽

Compressed Data

Modern daily life activities produced lots of information for the advancement of telecommunication. It is a challenging issue to store them on a digital device or transmit it over the Internet, leading to the necessity for data compression. Thus, research on data compression to solve the issue has become a topic of great interest to researchers. Moreover, the size of compressed data is generally smaller than its original. As a result, data compression saves storage and increases transmission speed. In this article, we propose a text compression technique using GPT-2 language model and Huffman coding. In this proposed method, Burrows-Wheeler transform and a list of keys are used to reduce the original text file’s length. Finally, we apply GPT-2 language mode and then Huffman coding for encoding. This proposed method is compared with the state-of-the-art techniques used for text compression. Finally, we show that the proposed method demonstrates a gain in compression ratio compared to the other state-of-the-art methods.

Download Full-text

Data compression, data association and reduced complexity SLAM techniques for UUVs during transit

Autonomous Underwater Vehicles: Design and practice ◽

10.1049/sbra525e_ch7 ◽

2020 ◽

pp. 191-209

Keyword(s):

Data Compression ◽

Data Association ◽

Compression Data ◽

Reduced Complexity

Download Full-text

A Comparative Study of 6T and 8T SRAM Cell With Improved Read and Write Margins in 130 nm CMOS Technology

WSEAS TRANSACTIONS ON CIRCUITS AND SYSTEMS ◽

10.37394/23201.2020.19.2 ◽

2020 ◽

Vol 19 ◽

Keyword(s):

Comparative Analysis ◽

Random Access ◽

Cmos Technology ◽

Noise Margin ◽

Sram Cell ◽

Static Random Access Memories ◽

Node Voltage ◽

Improve Stability ◽

Static Noise ◽

Better Than

This paper examines the factors that affect the Static Noise Margin (SNM) of a Static Random Access memories which focus on optimizing Read and Write operation of 8T SRAM cell which is better than 6T SRAM cell Using Swing Restoration for Dual Node Voltage. The read and Write time and improve Stability. New 8T SRAM technique on the circuit or architecture level is required. In this paper Comparative Analysis of 6T and 8T SRAM Cells with Improved Read and Write Margin is done for 130 nm Technology with Cadence Virtuoso schematics Tool.

Download Full-text

Lora-WAN Powered by Renewable Energy, and Its Operation with Siri / Google Assistant

International Research Journal of Multidisciplinary Technovation ◽

10.34256/irjmt2055 ◽

2020 ◽

Vol 2 (5) ◽

pp. 26-34

Author(s):

Albin Joseph ◽

Balamurugan M

Keyword(s):

Renewable Energy ◽

Data Compression ◽

Communication Technology ◽

Energy Resource ◽

Small Data ◽

Dual Mode ◽

Data Packets ◽

Internet Connection ◽

Voice Control ◽

Better Than

LoRa WAN is a newly emerged game changing communication technology for sending small data packets of size 50 bytes or less, wirelessly over an area of up to 10 Km without the need of an internet connection. LoRa WAN has its own frequency band and the band is different for every country. This technology is now starring to boost WSN technology better than ever before. This paper aims to, power up a LoRa Enabled Device or a LoRa Gateway by using a reliable dual mode non-conventional energy resource for storage and utilization, find peak performances altering the data rate that can be achieved in a LoRa WAN Communication (using Indoor RAK Gateway), make use data compression techniques, data packet encoding / decoding, Coding Apple Shortcuts, setting up Siri and Google Assistant for voice control and future scope.

Download Full-text

Queryable Compression on Time-evolving Web and Social Networks with Streaming

ACM Transactions on the Web ◽

10.1145/3495012 ◽

2022 ◽

Vol 16 (2) ◽

pp. 1-21

Author(s):

Michael Nelson ◽

Sridhar Radhakrishnan ◽

Chandra Sekharan ◽

Amlan Chatterjee ◽

Sudhindra Gopal Krishna

Keyword(s):

Data Structure ◽

Adjacency Matrix ◽

Binary Tree ◽

Network Data ◽

Data Sets ◽

Undirected Graphs ◽

Sparse Graphs ◽

Compressed Data ◽

Over Time ◽

Better Than

Time-evolving web and social network graphs are modeled as a set of pages/individuals (nodes) and their arcs (links/relationships) that change over time. Due to their popularity, they have become increasingly massive in terms of their number of nodes, arcs, and lifetimes. However, these graphs are extremely sparse throughout their lifetimes. For example, it is estimated that Facebook has over a billion vertices, yet at any point in time, it has far less than 0.001% of all possible relationships. The space required to store these large sparse graphs may not fit in most main memories using underlying representations such as a series of adjacency matrices or adjacency lists. We propose building a compressed data structure that has a compressed binary tree corresponding to each row of each adjacency matrix of the time-evolving graph. We do not explicitly construct the adjacency matrix, and our algorithms take the time-evolving arc list representation as input for its construction. Our compressed structure allows for directed and undirected graphs, faster arc and neighborhood queries, as well as the ability for arcs and frames to be added and removed directly from the compressed structure (streaming operations). We use publicly available network data sets such as Flickr, Yahoo!, and Wikipedia in our experiments and show that our new technique performs as well or better than our benchmarks on all datasets in terms of compression size and other vital metrics.

Download Full-text

Enabling Efficient Random Access to Hierarchically-Compressed Data

2020 IEEE 36th International Conference on Data Engineering (ICDE) ◽

10.1109/icde48307.2020.00097 ◽

2020 ◽

Cited By ~ 1

Author(s):

Feng Zhang ◽

Jidong Zhai ◽

Xipeng Shen ◽

Onur Mutlu ◽

Xiaoyong Du

Keyword(s):

Random Access ◽

Compressed Data

Download Full-text