Please use this identifier to cite or link to this item: https://open.uns.ac.rs/handle/123456789/3850
Title: Efficient routing for cost effective scale-out data architectures
Authors: Narayan A.
Marković, Vuk 
Postawa N.
King A.
Morales A.
Kumar K.
Efstathopoulos P.
Issue Date: 5-Dec-2016
Journal: Proceedings - 2016 IEEE 24th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, MASCOTS 2016
Abstract: © 2016 IEEE. In large scale-out data architectures, data are distributed and replicated across several machines. Queries/tasks to such data architectures, are sent to a router which determines the machines containing the requested data. Ideally, to reduce the overall cost of analytics, the smallest set of machines required to satisfy the query should be returned by the router. Mathematically, this can be modeled as the set cover problem, which is NP-hard. Given large number of incoming queries in real-time, it is often impractical to compute set cover for each incoming query to perform routing. In this paper, we propose a novel technique to speedup the routing of a large number of real-time queries while minimizing the number of machines that each query touches (query span). We demonstrate that by analyzing the correlation between known queries and performing query clustering, we can reduce the set cover computation time, thereby significantly speeding up routing of unknown queries. Experiments show that our incremental set cover-based routing is 2.5 times faster and can return on average 50% fewer machines per query when compared to repeated greedy set cover and baseline routing techniques.
URI: https://open.uns.ac.rs/handle/123456789/3850
ISBN: 9781509034314
DOI: 10.1109/MASCOTS.2016.29
Appears in Collections:MDF Publikacije/Publications

Show full item record

Page view(s)

17
Last Week
15
Last month
0
checked on May 3, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.