ORCID ID

0000-0001-6366-8874

Date Awarded

Spring 2017

Document Type

Dissertation

Degree Name

Doctor of Philosophy (Ph.D.)

Department

Computer Science

Advisor

Evgenia Smirni

Committee Member

Weizhen Mao

Committee Member

Gang Zhou

Committee Member

Xu Liu

Committee Member

Arif Merchant

Abstract

In large-scaled and distributed systems, like multi-tier storage systems and cloud data centers, resource sharing among workloads brings multiple benefits while introducing many performance challenges. The key to effective workload multiplexing is accurate workload prediction. This thesis focuses on how to capture the salient characteristics of the real-world workloads to develop workload prediction methods and to drive scheduling and resource allocation policies, in order to achieve efficient and in-time resource isolation among applications. For a multi-tier storage system, high-priority user work is often multiplexed with low-priority background work. This brings the challenge of how to strike a balance between maintaining the user performance and maximizing the amount of finished background work. In this thesis, we propose two resource isolation policies based on different workload prediction methods: one is a Markovian model-based and the other is a neural networks-based. These policies aim at, via workload prediction, discovering the opportune time to schedule background work with minimum impact on user performance. Trace-driven simulations verify the efficiency of the two pro- posed resource isolation policies. The Markovian model-based policy successfully schedules the background work at the appropriate periods with small impact on the user performance. The neural networks-based policy adaptively schedules user and background work, resulting in meeting both performance requirements consistently. This thesis also proposes an accurate while efficient neural networks-based pre- diction method for data center usage series, called PRACTISE. Different from the traditional neural networks for time series prediction, PRACTISE selects the most informative features from the past observations of the time series itself. Testing on a large set of usage series in production data centers illustrates the accuracy (e.g., prediction error) and efficiency (e.g., time cost) of PRACTISE. The superiority of the usage prediction also allows a proactive resource management in the highly virtualized cloud data centers. In this thesis, we analyze on the performance tickets in the cloud data centers, and propose an active sizing algorithm, named ATM, that predicts the usage workloads and re-allocates capacity to work- loads to avoid VM performance tickets. Moreover, driven by cheap prediction of usage tails, we also present TailGuard in this thesis, which dynamically clones VMs among co-located boxes, in order to efficiently reduce the performance violations of physical boxes in cloud data centers.

DOI

http://doi.org/10.21220/S2CM12

Rights

Recommended Citation

Xue, Ji, "Workload Prediction for Efficient Performance Isolation and System Reliability" (2017). Dissertations, Theses, and Masters Projects. William & Mary. Paper 1499450064.
http://doi.org/10.21220/S2CM12

Dissertations, Theses, and Masters Projects

Workload Prediction for Efficient Performance Isolation and System Reliability

ORCID ID

Date Awarded

Document Type

Degree Name

Department

Advisor

Committee Member

Committee Member

Committee Member

Committee Member

Abstract

DOI

Rights

Recommended Citation

Included in

Browse

Search

Author Corner

Links

About Scholarworks

Links

Dissertations, Theses, and Masters Projects

Workload Prediction for Efficient Performance Isolation and System Reliability

Author

ORCID ID

Date Awarded

Document Type

Degree Name

Department

Advisor

Committee Member

Committee Member

Committee Member

Committee Member

Abstract

DOI

Rights

Recommended Citation

Included in

Share

Browse

Search

Author Corner

Links

About Scholarworks

Links