Optimizing Large File Transfer on Data Grid

Data Grid is an infrastructure that manages huge amount of data files, and provides intensive computational resources across geographically distributed collaboration. To increase resource availability and to ease resource sharing in such environment, there is a need for replication services. Data replication is one of the methods used to improve the performance of data access in distributed systems by replicating multiple copies of data files in the distributed sites. Replica placement mechanism is the process of identifying where to place copies of replicated data files in a Grid system. Existing work identifies the suitable sites based on number of requests and read cost of the required file. Such approaches consume large bandwidth and increases the computational time. The authors propose a replica placement strategy (RPS) that finds the best locations to store replicas based on four criteria, namely, 1) Read Cost, 2) File Transfer Time, 3) Sites’ Workload, and 4) Replication Sites. OptorSim is used to evaluate the performance of this replica placement strategy. The simulation results show that RPS requires less execution time and consumes less network usage compared to existing approaches of Simple Optimizer and LFU (Least Frequently Used).

Download Full-text

Performance impact of large file transfer on web proxy caching: A case study in a high bandwidth campus network environment

Journal of Communications and Networks ◽

10.1109/jcn.2010.6388434 ◽

2010 ◽

Vol 12 (1) ◽

pp. 52-66 ◽

Cited By ~ 2

Author(s):

Hyun-Chul Kim ◽

Dongman Lee ◽

Kilnam Chon ◽

Beakcheol Jang ◽

Taekyoung Kwon ◽

...

Keyword(s):

Network Environment ◽

Campus Network ◽

File Transfer ◽

Performance Impact ◽

Proxy Caching ◽

High Bandwidth ◽

Large File

Download Full-text

A four-phase data replication algorithm for data grid

Journal of Advanced Computer Science & Technology ◽

10.14419/jacst.v4i1.4009 ◽

2015 ◽

Vol 4 (1) ◽

pp. 163 ◽

Cited By ~ 2

Author(s):

Alireza Saleh ◽

Reza Javidan ◽

Mohammad Taghi FatehiKhajeh

Keyword(s):

Large Scale ◽

Data Replication ◽

Data Grid ◽

Data Availability ◽

Access Time ◽

File Transfer ◽

System Availability ◽

Large Scale Data ◽

Effective Network ◽

Average Access Time

<p>Nowadays, scientific applications generate a huge amount of data in terabytes or petabytes. Data grids currently proposed solutions to large scale data management problems including efficient file transfer and replication. Data is typically replicated in a Data Grid to improve the job response time and data availability. A reasonable number and right locations for replicas has become a challenge in the Data Grid. In this paper, a four-phase dynamic data replication algorithm based on Temporal and Geographical locality is proposed. It includes: 1) evaluating and identifying the popular data and triggering a replication operation when the popularity data passes a dynamic threshold; 2) analyzing and modeling the relationship between system availability and the number of replicas, and calculating a suitable number of new replicas; 3) evaluating and identifying the popular data in each site, and placing replicas among them; 4) removing files with least cost of average access time when encountering insufficient space for replication. The algorithm was tested using a grid simulator, OptorSim developed by European Data Grid Projects. The simulation results show that the proposed algorithm has better performance in comparison with other algorithms in terms of job execution time, effective network usage and percentage of storage filled.</p>

Download Full-text

Joint Bandwidth Assignment and Routing for Power Saving on Large File Transfer with Time Constraints

IEICE Transactions on Communications ◽

10.1587/transcom.2019ebp3072 ◽

2020 ◽

Vol E103.B (4) ◽

pp. 431-439

Author(s):

Kazuhiko KINOSHITA ◽

Masahiko AIHARA ◽

Nariyoshi YAMAI ◽

Takashi WATANABE

Keyword(s):

Power Saving ◽

Time Constraints ◽

File Transfer ◽

Large File

Download Full-text

Joint Bandwidth Scheduling and Routing Method for Large File Transfer with Time Constraint and Its Implementation

IEICE Transactions on Communications ◽

10.1587/transcom.2017ebp3121 ◽

2018 ◽

Vol E101.B (3) ◽

pp. 763-771 ◽

Cited By ~ 4

Author(s):

Kazuhiko KINOSHITA ◽

Masahiko AIHARA ◽

Shiori KONO ◽

Nariyoshi YAMAI ◽

Takashi WATANABE

Keyword(s):

Time Constraint ◽

File Transfer ◽

Bandwidth Scheduling ◽

Routing Method ◽

Large File

Download Full-text

Replica Aware Reliable File Transfer Service for the Data Grid

2008 IEEE Fourth International Conference on eScience ◽

10.1109/escience.2008.117 ◽

2008 ◽

Cited By ~ 1

Author(s):

Yoonki Lee ◽

Eunsung Kim ◽

Heon Y. Yeom

Keyword(s):

Data Grid ◽

File Transfer

Download Full-text

Design of FCAST System which is Standard of Large File Transfer of IETF : Enhancement in Header Stability and Packet Loss Recovery Ability

The Journal of Korean Institute of Communications and Information Sciences ◽

10.7840/kics.2021.46.12.2351 ◽

2021 ◽

Vol 46 (12) ◽

pp. 2351-2360

Author(s):

Bong Jun Sah ◽

Doug Young Suh ◽

Dae Han Weon ◽

Cheol Hyeon Jeong ◽

Eun Young Cha

Keyword(s):

Packet Loss ◽

File Transfer ◽

Loss Recovery ◽

Large File

Download Full-text

Joint bandwidth scheduling and routing method for large file transfer with time constraint

NOMS 2016 - 2016 IEEE/IFIP Network Operations and Management Symposium ◽

10.1109/noms.2016.7502974 ◽

2016 ◽

Cited By ~ 1

Author(s):

Masahiko Aihara ◽

Shiori Kono ◽

Kazuhiko Kinoshita ◽

Nariyoshi Yamai ◽

Takashi Watanabe

Keyword(s):

Time Constraint ◽

File Transfer ◽

Bandwidth Scheduling ◽

Routing Method ◽

Large File

Download Full-text