SUMMARY


Emerging hardware devices and technologies are challenging decades-old assumptions in the design of data-centric computer systems. Byte-addressable, persistent memory enables fast, non-volatile storage with near-DRAM performance. CXL is enabling the expansion of memory and efficient sharing of (disaggregated) resources with low latency. NVMe devices can now deliver millions of IOPS at microsecond scale. However, modern data centers (e.g., cloud, HPC supercomputers) still rely on legacy software stacks (e.g., kernel-based file systems; general-purpose storage, memory, and network systems), thus being unable to reap the performance benefits of these devices.

At the same time, to accommodate the exponential demand of I/O and GPU-intensive workloads, data centers have been growing in size every year, as well as their carbon footprint and power consumption. In fact, recent studies report that data centers are estimated to consume 8% to 13% of the world’s total electricity usage by 2030.

The main goal of DSR is to design a new generation of storage and operating system building blocks fitted for the performance, reliability, and energy consumption of modern large-scale infrastructures.

OBJECTIVES


Within this domain, our current areas of interest include:

  • Redesign storage building blocks for new hardware devices: We redesign existing storage building blocks (e.g., key-value stores, file systems, caching), which still rely on decades-old assumptions in computer systems, for emerging hardware and storage technologies. By consolidating the applications’ workloads with the inherent characteristics of novel hardware devices, we design specialized storage systems that can fully reap the performance benefits of emerging devices.
  • Maximize performance and resource utilization: We research and design new storage and OS primitives to improve modern infrastructures' performance and resource utilization. We are exploring techniques for mitigating resource under and overprovisioning, including resource composability and disaggregation, workload consolidation, and workload-aware scheduling.
  • Improve energy efficiency and sustainability: We research and design new sustainable and energy-efficient mechanisms for reducing the energy consumption and carbon footprint of large-scale infrastructures while maintaining the performance of deployed applications. We focus on developing energy-efficient solutions at all levels, from specific compute resources (e.g., CPU, GPU, memory) to the administration and maintenance of large-scale infrastructures (e.g., HPC supercomputers).

SELECTED PUBLICATIONS


  1. Keigo: Co-designing Log-Structured Merge Key-Value Stores with a Non-Volatile, Concurrency-aware Storage HierarchyAdão, R., Wu, Z., Zhou, C., Balmau, O., Paulo, J., & Macedo, R.Proceedings of the VLDB Endowment. 2025.

  1. PAIO: General, Portable I/O Optimizations With Minor Application ModificationsMacedo, R., Tanimura, Y., Haga, J., Chidambaram, V., Pereira, J., & Paulo, J.In 20th USENIX Conference on File and Storage Technologies (FAST). USENIX, 2022.

  1. S2Dedup: SGX-enabled Secure DeduplicationMiranda, M., Esteves, T., Portela, B., & Paulo, J.In 14th ACM International Conference on Systems and Storage (SYSTOR). ACM, 2021.

Check the full list of publications here.

RESPONSIBLE FOR THE DOMAIN


Ricardo Macedo

Assistant Researcher