DICL - Octopus

Distributed Tiered Storage for Cluster Computing

Improvements in memory, storage devices, and network technologies are constantly exploited by distributed systems in order to meet the increasing data storage and I/O demands of modern large-scale data analytics. We present OctopusFS, a novel distributed file system that is aware of storage media (e.g., memory, SSDs, HDDs, NAS) with different capacities and performance characteristics. The system offers a variety of pluggable policies for automating data management across both the storage tiers and cluster nodes. A new data placement policy employs multi-objective optimization techniques for making intelligent data management decisions based on the requirements of fault tolerance, data and load balancing, and throughput maximization. Moreover, machine learning is employed for tracking and predicting file access patterns, which are then used by data movement policies to decide when and which data to move up or down the storage tiers for increasing system performance. This approach uses incremental learning along with XGBoost to dynamically refine the models with new file accesses and improve the prediction performance of the models. At the same time, the storage media are explicitly exposed to users and applications, allowing them to choose the distribution, placement, and movement of replicas in the cluster based on their own performance and fault tolerance requirements.

Big Data Ecosystem with OctopusFS and Trident

While the use of storage tiering is becoming popular in data-intensive compute clusters, current big data platforms (such as Hadoop and Spark) are not exploiting the presence of storage tiers and the opportunities they present for performance optimizations. Specifically, schedulers and prefetchers will make decisions only based on data locality information and completely ignore the fact that local data are now stored on a variety of storage media with different performance characteristics. We propose Trident, a scheduling and prefetching framework that is designed to make task assignment, resource scheduling, and prefetching decisions based on both locality and storage tier information. Trident formulates task scheduling as a minimum cost maximum matching problem in a bipartite graph and utilizes two novel pruning algorithms for bounding the size of the graph, while still guaranteeing optimality. In addition, Trident extends YARN’s resource request model and proposes a new storage-tier-aware resource scheduling algorithm. Finally, Trident includes a cost-based data prefetching approach that coordinates with the schedulers for optimizing prefetching operations.

Relevant Publications

H. Herodotou and E. Kakoulli. Cost-based Data Prefetching and Scheduling in Big Data Platforms over Tiered Storage Systems. ACM Transactions on Database Systems (TODS), Vol. 48, No. 4, Article 11, pp. 1-40, November 2023.
H. Herodotou and E. Kakoulli. Trident: Task Scheduling over Tiered Storage Systems in Big Data Platforms. Proc. of VLDB Endowment (PVLDB), Vol. 14, No. 9, pp. 1570-1582, May 2021.
H. Herodotou and E. Kakoulli. Automating Distributed Tiered Storage Management in Cluster Computing. Proc. of VLDB Endowment (PVLDB), Vol. 13, No. 1, pp. 43-56, September 2019.
H. Herodotou. AutoCache: Employing Machine Learning to Automate Caching in Distributed File Systems. In Proc. of the 35th IEEE Intl. Conf. on Data Engineering Workshops (ICDEW '19), pp. 133-139, April 2019.
E. Kakoulli, N. D. Karmiris, and H. Herodotou. OctopusFS in Action: Tiered Storage Management for Data Intensive Computing. Demo, Proc. of VLDB Endowment (PVLDB), Vol. 11, No. 12, pp. 1914-1917, August 2018.
E. Kakoulli and H. Herodotou. OctopusFS: A Distributed File System with Tiered Storage Management. In Proc. of the ACM Intl. Conf. on Management of Data (SIGMOD '17), May 2017.
H. Herodotou. Towards a Distributed Multi-tier File System for Cluster Computing. In Proc. of the Intl. Workshop on Big Data Management on Emerging Hardware (HardBD '16), May 2016.
H. Herodotou. A Distributed File System with Storage-Media Awareness. Poster in Proc. of the 2015 IEEE/ACM 8th Intl. Conf. on Utility and Cloud Computing (UCC '15), December 2015.

Software Releases

OCTOPUS: Distributed Tiered Storage for Cluster Computing, Apache License 2.0, December 2023

Funding

AWS Cloud Credits for Research Grant, Amazon Web Services, July 2018
Starting Grant, Cyprus University of Technology, May 2015 - Apr 2017