Reducing the number of response time service level objective violations by a cloud-HPC convergence scheduler

2017 ◽  
Vol 30 (12) ◽  
pp. e4352 ◽  
Author(s):  
Alessandro Kraemer ◽  
Carlos Maziero ◽  
Olivier Richard ◽  
Denis Trystram
2014 ◽  
Vol 2014 ◽  
pp. 1-25 ◽  
Author(s):  
Carrie Ka Yuk Lin

Logistic systems with uncertain demand, travel time, and on-site processing time are studied here where sequential trip travel is allowed. The relationship between three levels of decisions: facility location, demand allocation, and resource capacity (number of service units), satisfying the response time requirement, is analysed. The problem is formulated as a stochastic mixed integer program. A simulation-based hybrid heuristic is developed to solve the dynamic problem under different response time service level. An initial solution is obtained from solving static location-allocation models, followed by iterative improvement of the three levels of decisions by ejection, reinsertion procedure with memory of feasible and infeasible service regions. Results indicate that a higher response time service level could be achieved by allocating a given resource under an appropriate decentralized policy. Given a response time requirement, the general trend is that the minimum total capacity initially decreases with more facilities. During this stage, variability in travel time has more impact on capacity than variability in demand arrivals. Thereafter, the total capacity remains stable and then gradually increases. When service level requirement is high, the dynamic dispatch based on first-come-first-serve rule requires smaller capacity than the one by nearest-neighbour rule.


2009 ◽  
Vol 2009 ◽  
pp. 1-27 ◽  
Author(s):  
Khaled Hadj Youssef ◽  
Christian van Delft ◽  
Yves Dallery

We consider a single-stage multiproduct manufacturing facility producing several end-products for delivery to customers with a required customer lead-time. The end-products can be split in two classes: few products with high volume demands and a large number of products with low-volume demands. In order to reduce inventory costs, it seems efficient to produce the high-volume products according to an MTS policy and the low volume products according to an MTO policy. The purpose of this paper is to analyze and compare the impact of the scheduling policy on the overall inventory costs, under customer lead-time service level constraints. We consider two policies: the classical FIFO policy and a priority policy (PR) which gives priority to low volume products over high volume products. We show that for some range of parameters, the PR rule can significantly outperform the FIFO rule. In these ranges, the service level constraints are satisfied by the PR rule with much lower inventory costs.


2016 ◽  
Vol 6 (1) ◽  
Author(s):  
Petra Vrbová ◽  
Václav Cempírek

Abstract Managing inventory is considered as one of the most challenging tasks facing supply chain managers and specialists. Decisions related to inventory locations along with level of inventory kept throughout the supply chain have a fundamental impact on the response time, service level, delivery lead-time and the total cost of the supply chain. The main objective of this paper is to identify and analyse the share of a particular logistic model adopted in the Czech Republic (Consignment stock, Buffer stock, Safety stock) and also compare their usage and adoption according to different industries. This paper also aims to specify possible reasons of particular logistic model preferences in comparison to the others. The analysis is based on quantitative survey held in the Czech Republic.


Author(s):  
Sakshi Chhabra ◽  
Ashutosh Kumar Singh

The cloud datacenter has numerous hosts as well as application requests where resources are dynamic. The demands placed on the resource allocation are diverse. These factors could lead to load imbalances, which affect scheduling efficiency and resource utilization. A scheduling method called Dynamic Resource Allocation for Load Balancing (DRALB) is proposed. The proposed solution constitutes two steps: First, the load manager analyzes the resource requirements such as CPU, Memory, Energy and Bandwidth usage and allocates an appropriate number of VMs for each application. Second, the resource information is collected and updated where resources are sorted into four queues according to the loads of resources i.e. CPU intensive, Memory intensive, Energy intensive and Bandwidth intensive. We demonstarate that SLA-aware scheduling not only facilitates the cloud consumers by resources availability and improves throughput, response time etc. but also maximizes the cloud profits with less resource utilization and SLA (Service Level Agreement) violation penalties. This method is based on diversity of client’s applications and searching the optimal resources for the particular deployment. Experiments were carried out based on following parameters i.e. average response time; resource utilization, SLA violation rate and load balancing. The experimental results demonstrate that this method can reduce the wastage of resources and reduces the traffic upto 44.89% and 58.49% in the network.


Author(s):  
Roland Kübert ◽  
Georgina Gallizo ◽  
Theodoros Polychniatis ◽  
Theodora Varvarigou ◽  
Eduardo Oliveros ◽  
...  

Service Level Agreements (SLAs) are nowadays used as a cornerstone for building service-oriented architectures. SLAs have been closely investigated in the scope of distributed and Grid computing and are now gaining uptake in cloud computing as well. However, most solutions have been developed for specific purposes and are not applicable generally, even though the most approaches propose a general usability. Only rarely have SLAs been applied to real-time systems. The purpose of this chapter is to analyze different fields where SLAs are used, examine the proposed solutions, and investigate how these can be improved in order to better support the creation of real-time service-oriented architectures.


2012 ◽  
pp. 1836-1862 ◽  
Author(s):  
Roland Kübert ◽  
Georgina Gallizo ◽  
Thodoris Polychniatis ◽  
Theodora Varvarigou ◽  
Eduardo Oliveros ◽  
...  

Service Level Agreements (SLAs) are nowadays used as a cornerstone for building service-oriented architectures. SLAs have been closely investigated in the scope of distributed and Grid computing and are now gaining uptake in cloud computing as well. However, most solutions have been developed for specific purposes and are not applicable generally, even though the most approaches propose a general usability. Only rarely have SLAs been applied to real-time systems. The purpose of this chapter is to analyze different fields where SLAs are used, examine the proposed solutions, and investigate how these can be improved in order to better support the creation of real-time service-oriented architectures.


Sign in / Sign up

Export Citation Format

Share Document