Reducing the number of response time service level objective violations by a cloud-HPC convergence scheduler

Alessandro Kraemer; Carlos Maziero; Olivier Richard; Denis Trystram

doi:10.1002/cpe.4352

Solving a Location, Allocation, and Capacity Planning Problem with Dynamic Demand and Response Time Service Level

Mathematical Problems in Engineering ◽

10.1155/2014/492340 ◽

2014 ◽

Vol 2014 ◽

pp. 1-25 ◽

Cited By ~ 1

Author(s):

Carrie Ka Yuk Lin

Keyword(s):

Response Time ◽

Travel Time ◽

Service Level ◽

Mixed Integer ◽

Mixed Integer Program ◽

Planning Problem ◽

Time Requirement ◽

Time Service ◽

Location Allocation ◽

Total Capacity

Logistic systems with uncertain demand, travel time, and on-site processing time are studied here where sequential trip travel is allowed. The relationship between three levels of decisions: facility location, demand allocation, and resource capacity (number of service units), satisfying the response time requirement, is analysed. The problem is formulated as a stochastic mixed integer program. A simulation-based hybrid heuristic is developed to solve the dynamic problem under different response time service level. An initial solution is obtained from solving static location-allocation models, followed by iterative improvement of the three levels of decisions by ejection, reinsertion procedure with memory of feasible and infeasible service regions. Results indicate that a higher response time service level could be achieved by allocating a given resource under an appropriate decentralized policy. Given a response time requirement, the general trend is that the minimum total capacity initially decreases with more facilities. During this stage, variability in travel time has more impact on capacity than variability in demand arrivals. Thereafter, the total capacity remains stable and then gradually increases. When service level requirement is high, the dynamic dispatch based on first-come-first-serve rule requires smaller capacity than the one by nearest-neighbour rule.

Download Full-text

Response time service level agreements for cloud-hosted web applications

Proceedings of the Sixth ACM Symposium on Cloud Computing - SoCC '15 ◽

10.1145/2806777.2806842 ◽

2015 ◽

Cited By ~ 14

Author(s):

Hiranya Jayathilaka ◽

Chandra Krintz ◽

Rich Wolski

Keyword(s):

Response Time ◽

Web Applications ◽

Service Level ◽

Service Level Agreements ◽

Time Service

Download Full-text

Derivation of Response Time Service Level Objectives for Business Services

2007 2nd IEEE/IFIP International Workshop on Business-Driven IT Management ◽

10.1109/bdim.2007.375009 ◽

2007 ◽

Cited By ~ 12

Author(s):

David Breitgand ◽

Ealan A. Henis ◽

Onn Shehory ◽

John M. Lake

Keyword(s):

Response Time ◽

Service Level ◽

Business Services ◽

Time Service ◽

Service Level Objectives

Download Full-text

Service-Level Agreement Durability for Web Service Response Time

2015 IEEE 7th International Conference on Cloud Computing Technology and Science (CloudCom) ◽

10.1109/cloudcom.2015.37 ◽

2015 ◽

Cited By ~ 4

Author(s):

Hiranya Jayathilaka ◽

Chandra Krintz ◽

Rich Wolski

Keyword(s):

Response Time ◽

Web Service ◽

Service Level Agreement ◽

Service Level ◽

Service Response Time

Download Full-text

Analysis and Optimization of a Combined Make-to-Stock and Make-to-Order Multiproduct Manufacturing System

Journal of Applied Mathematics and Decision Sciences ◽

10.1155/2009/716059 ◽

2009 ◽

Vol 2009 ◽

pp. 1-27 ◽

Cited By ~ 3

Author(s):

Khaled Hadj Youssef ◽

Christian van Delft ◽

Yves Dallery

Keyword(s):

Lead Time ◽

High Volume ◽

Service Level ◽

Time Service ◽

Inventory Costs ◽

Make To Order ◽

End Products ◽

Low Volume ◽

Multiproduct Manufacturing ◽

The Impact

We consider a single-stage multiproduct manufacturing facility producing several end-products for delivery to customers with a required customer lead-time. The end-products can be split in two classes: few products with high volume demands and a large number of products with low-volume demands. In order to reduce inventory costs, it seems efficient to produce the high-volume products according to an MTS policy and the low volume products according to an MTO policy. The purpose of this paper is to analyze and compare the impact of the scheduling policy on the overall inventory costs, under customer lead-time service level constraints. We consider two policies: the classical FIFO policy and a priority policy (PR) which gives priority to low volume products over high volume products. We show that for some range of parameters, the PR rule can significantly outperform the FIFO rule. In these ranges, the service level constraints are satisfied by the PR rule with much lower inventory costs.

Download Full-text

Comparison of particular logistic models’ adoption in the Czech Republic

Open Engineering ◽

10.1515/eng-2016-0092 ◽

2016 ◽

Vol 6 (1) ◽

Author(s):

Petra Vrbová ◽

Václav Cempírek

Keyword(s):

Supply Chain ◽

Czech Republic ◽

Lead Time ◽

Logistic Model ◽

Logistic Models ◽

Service Level ◽

Time Service ◽

The Czech Republic ◽

Total Cost ◽

Challenging Tasks

Abstract Managing inventory is considered as one of the most challenging tasks facing supply chain managers and specialists. Decisions related to inventory locations along with level of inventory kept throughout the supply chain have a fundamental impact on the response time, service level, delivery lead-time and the total cost of the supply chain. The main objective of this paper is to identify and analyse the share of a particular logistic model adopted in the Czech Republic (Consignment stock, Buffer stock, Safety stock) and also compare their usage and adoption according to different industries. This paper also aims to specify possible reasons of particular logistic model preferences in comparison to the others. The analysis is based on quantitative survey held in the Czech Republic.

Download Full-text

Market-Based Joint Decisions on Price, Delivery Time, Service Level, and Investment or Supplier Selection

SSRN Electronic Journal ◽

10.2139/ssrn.1819301 ◽

2010 ◽

Author(s):

Li Qian

Keyword(s):

Supplier Selection ◽

Service Level ◽

Delivery Time ◽

Time Service ◽

Joint Decisions

Download Full-text

Dynamic Resource Allocation Method for Load Balance Scheduling Over Cloud Data Center Networks

Journal of Web Engineering ◽

10.13052/jwe1540-9589.2083 ◽

2021 ◽

Author(s):

Sakshi Chhabra ◽

Ashutosh Kumar Singh

Keyword(s):

Resource Allocation ◽

Load Balancing ◽

Response Time ◽

Resource Utilization ◽

Service Level Agreement ◽

Service Level ◽

Dynamic Resource Allocation ◽

Cloud Data Center ◽

Cloud Data ◽

Dynamic Resource

The cloud datacenter has numerous hosts as well as application requests where resources are dynamic. The demands placed on the resource allocation are diverse. These factors could lead to load imbalances, which affect scheduling efficiency and resource utilization. A scheduling method called Dynamic Resource Allocation for Load Balancing (DRALB) is proposed. The proposed solution constitutes two steps: First, the load manager analyzes the resource requirements such as CPU, Memory, Energy and Bandwidth usage and allocates an appropriate number of VMs for each application. Second, the resource information is collected and updated where resources are sorted into four queues according to the loads of resources i.e. CPU intensive, Memory intensive, Energy intensive and Bandwidth intensive. We demonstarate that SLA-aware scheduling not only facilitates the cloud consumers by resources availability and improves throughput, response time etc. but also maximizes the cloud profits with less resource utilization and SLA (Service Level Agreement) violation penalties. This method is based on diversity of client’s applications and searching the optimal resources for the particular deployment. Experiments were carried out based on following parameters i.e. average response time; resource utilization, SLA violation rate and load balancing. The experimental results demonstrate that this method can reduce the wastage of resources and reduces the traffic upto 44.89% and 58.49% in the network.

Download Full-text

Service Level Agreements for Real-Time Service-Oriented Infrastructures

Advances in Systems Analysis, Software Engineering, and High Performance Computing - Achieving Real-Time in Distributed Computing ◽

10.4018/978-1-60960-827-9.ch008 ◽

2011 ◽

pp. 133-159

Author(s):

Roland Kübert ◽

Georgina Gallizo ◽

Theodoros Polychniatis ◽

Theodora Varvarigou ◽

Eduardo Oliveros ◽

...

Keyword(s):

Cloud Computing ◽

Grid Computing ◽

Real Time ◽

Service Level ◽

Service Level Agreements ◽

Real Time Systems ◽

Service Oriented Architectures ◽

Time Service ◽

Service Oriented ◽

Time Systems

Service Level Agreements (SLAs) are nowadays used as a cornerstone for building service-oriented architectures. SLAs have been closely investigated in the scope of distributed and Grid computing and are now gaining uptake in cloud computing as well. However, most solutions have been developed for specific purposes and are not applicable generally, even though the most approaches propose a general usability. Only rarely have SLAs been applied to real-time systems. The purpose of this chapter is to analyze different fields where SLAs are used, examine the proposed solutions, and investigate how these can be improved in order to better support the creation of real-time service-oriented architectures.

Download Full-text

Service Level Agreements for Real-Time Service-Oriented Infrastructures

Grid and Cloud Computing ◽

10.4018/978-1-4666-0879-5.ch807 ◽

2012 ◽

pp. 1836-1862 ◽

Cited By ~ 2

Author(s):

Roland Kübert ◽

Georgina Gallizo ◽

Thodoris Polychniatis ◽

Theodora Varvarigou ◽

Eduardo Oliveros ◽

...

Keyword(s):

Cloud Computing ◽

Grid Computing ◽

Real Time ◽

Service Level ◽

Service Level Agreements ◽

Real Time Systems ◽

Service Oriented Architectures ◽

Time Service ◽

Service Oriented ◽

Time Systems

Service Level Agreements (SLAs) are nowadays used as a cornerstone for building service-oriented architectures. SLAs have been closely investigated in the scope of distributed and Grid computing and are now gaining uptake in cloud computing as well. However, most solutions have been developed for specific purposes and are not applicable generally, even though the most approaches propose a general usability. Only rarely have SLAs been applied to real-time systems. The purpose of this chapter is to analyze different fields where SLAs are used, examine the proposed solutions, and investigate how these can be improved in order to better support the creation of real-time service-oriented architectures.

Download Full-text