Reining in Long Tails in Warehouse-Scale Computers with Quick Voltage boosting using Adrenaline 6

By: 4 0 16, [, ] | [, ] |
Contributor(s): ACM Transactions on Computer Systems.35:1 (2017). pp.2 5 6 [] |
Language: Unknown language code Summary language: Unknown language code Original language: Unknown language code Series: ; 46Edition: Description: Content type: text Media type: unmediated Carrier type: volumeISBN: ISSN: 2Other title: 6 []Uniform titles: | | Related works: 1 40 Chang-Hong Hsu 6 []Subject(s): -- 2 -- 0 -- -- | -- 2 -- 0 -- 6 -- | 2 0 -- | -- -- 20 -- | | -- -- DATACENTERS;ENERGY EFFICIENCY -- LATENCY-CRITICAL WORKLOADS;FINE - GRAINED DYNAMIC VOLTAGE/FREQUENCY SCALING -- TAIL QUERIES -- | -- -- -- 20 -- --Genre/Form: -- 2 -- Additional physical formats: DDC classification: | LOC classification: | | 2Other classification:
Contents:
Action note: In: Summary: Other editions:
Tags from this library: No tags from this library for this title. Log in to add tags.
    Average rating: 0.0 (0 votes)

ABSTRACT: Reducing the long tail of the query latency distribution in modern warehouse scale computers is critical for improving performance and quality of service (QoS) of workloads such as Web search and Memcached. Traditional turbo boost increases a processor's voltage and frequency during a coarse-grained sliding window, boosting all queries that are processed during that window. However, the inability of such a technique to pinpoint tail queries for boosting limits its tail reduction benefit. In this work, we propose Adrenaline, an approach to leverage finer-granularity (tens of nanoseconds) voltage boosting to effectively rein in the tail latency with query-level precision. Two key insights underlie this work. First, emerging finer granularity voltage/frequency boosting is an enabling mechanism for intelligent allocation of the power budget to precisely boost only the queries that contribute to the tail latency; second , per-query characteristics can be used to design indicators for proactively pinpointing these queries, triggering boosting accordingly. Based on these insights, Adrenaline effectively pinpoints and boosts queries that are likely to increase the tail distribution and can reap more benefit from the voltage/frequency boost. By evaluating inder various work-load configurations, we demonstrate the effectiveness of our methodology. We achieve up to a 2.50x tail latency improvement for memcached and up to a 3.03x for Web Search over coarse-grained dynamic voltage and frequency scaling (DVFS) given boosting power budget. When optimizing for energy reduction, adrenaline achieves up to a1.81x improvement for Memcached and up to a 1.99x for web search over coarse-grained DVFS. By using the carefully chosen thresholds, Adrenaline further improves the taillatency reduction to 4.82x over coarse-grained DVFS. 56

5

5

There are no comments for this item.

to post a comment.

© Copyright 2024 Phoenix Library Management System - Pinnacle Technologies, Inc. All Rights Reserved.