2025-02-20

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Micro Blossom: Accelerated Minimum-Weight Perfect Matching Decoding for Quantum Error Correction	Yue Wu, Namitha Liyanage, Lin Zhong	2025-02-20	下载	Minimum-Weight Perfect Matching (MWPM) decoding is important to quantum error correction decoding because of its accuracy. However, many believe that it is difficult, if possible at all, to achieve th...
Leveraging Error Resilience of Iterative Algorithms for Energy Efficiency: from Concept to Implementation	G. A. Gillani, A. Krapukhin, A. B. J. Kokkeler	2025-02-20	下载	Iterative algorithms are widely used in digital signal processing applications. With the case study of radio astronomy calibration processing, this work contributes towards revealing and exploiting th...
Parallelizing a modern GPU simulator	Rodrigo Huerta, Antonio González	2025-02-20	下载	Simulators are a primary tool in computer architecture research but are extremely computationally intensive. Simulating modern architectures with increased core counts and recent workloads can be chal...
DeepRTL: Bridging Verilog Understanding and Generation with a Unified Representation Model	Yi Liu, Changran Xu, Yunhao Zhou, Zeju Li, Qiang Xu	2025-02-20	下载	Recent advancements in large language models (LLMs) have shown significant potential for automating hardware description language (HDL) code generation from high-level natural language instructions.
μRL: Discovering Transient Execution Vulnerabilities Using Reinforcement Learning	M. Caner Tol, Kemal Derya, Berk Sunar	2025-02-20	下载	We propose using reinforcement learning to address the challenges of discovering microarchitectural vulnerabilities, such as Spectre and Meltdown, which exploit subtle interactions in modern processor...
NDPage: Efficient Address Translation for Near-Data Processing Architectures via Tailored Page Table	Qingcai Jiang, Buxin Tu, Hong An	2025-02-20	下载	Near-Data Processing (NDP) has been a promising architectural paradigm to address the memory wall problem for data-intensive applications. Practical implementation of NDP architectures calls for syste...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention	Shang Yang, Junxian Guo, Haotian Tang, Qinghao Hu, Guangxuan Xiao, Jiaming Tang, Yujun Lin, Zhijian Liu, Yao Lu, Song Han	2025-02-20	下载	Large language models (LLMs) have shown remarkable potential in processing long sequences and complex reasoning tasks, yet efficiently serving these models remains challenging due to the quadratic com...
MadVoro: Parallel Construction of Voronoi Diagrams in Distributed Memory Systems	Maor Mizrachi, Barak Raveh, Elad Steinberg	2025-02-20	下载	Voronoi diagrams are essential geometrical structures with numerous applications, particularly astrophysics-driven finite volume methods. While serial algorithms for constructing these entities are we...
Byzantine Game Theory: Sun Tzus Boxes	Andrei Constantinescu, Roger Wattenhofer	2025-02-20	下载	We introduce the Byzantine Selection Problem, living at the intersection of game theory and fault-tolerant distributed computing. Here, an event organizer is presented with a group of $n$ agents, and ...
Parallelizing a modern GPU simulator	Rodrigo Huerta, Antonio González	2025-02-20	下载	Simulators are a primary tool in computer architecture research but are extremely computationally intensive. Simulating modern architectures with increased core counts and recent workloads can be chal...
SageServe: Optimizing LLM Serving on Cloud Data Centers with Forecast Aware Auto-Scaling	Shashwat Jaiswal, Kunal Jain, Yogesh Simmhan, Anjaly Parayil, Ankur Mallick, Rujia Wang, Renee St. Amant, Chetan Bansal, Victor Rühle, Anoop Kulkarni, Steve Kofsky, Saravan Rajmohan	2025-02-20	下载	Global cloud service providers handle inference workloads for Large Language Models (LLMs) that span latency-sensitive (e.g., chatbots) and insensitive (e.g.
madupite: A High-Performance Distributed Solver for Large-Scale Markov Decision Processes	Matilde Gargiani, Robin Sieber, Philip Pawlowsky, Václav Hapla, John Lygeros	2025-02-20	下载	This paper introduces madupite, a high-performance distributed solver for large-scale Markov Decision Processes (MDPs). MDPs are widely used to model complex dynamical systems in various fields, inclu...
LLM4FaaS: No-Code Application Development using LLMs and FaaS	Minghe Wang, Tobias Pfandzelter, Trever Schirmer, David Bermbach	2025-02-20	下载	Large language models (LLMs) show great capabilities in generating code from natural language descriptions, bringing programming power closer to non-technical users.
Optimizing the Longhorn Cloud-native Software Defined Storage Engine for High Performance	Konstantinos Kampadais, Antony Chazapis, Angelos Bilas	2025-02-20	下载	Longhorn is an open-source, cloud-native software-defined storage (SDS) engine that delivers distributed block storage management in Kubernetes environments.
A Parallel Hierarchical Approach for Community Detection on Large-scale Dynamic Networks	Grigoriy Bokov, Aleksandr Konovalov, Anna Uporova, Stanislav Moiseev, Ivan Safonov, Alexander Radionov	2025-02-20	下载	In this paper, we propose a novel parallel hierarchical Leiden-based algorithm for dynamic community detection. The algorithm, for a given batch update of edge insertions and deletions, partitions the...
It Takes Two to Tango: Serverless Workflow Serving via Bilaterally Engaged Resource Adaptation	Jing Wu, Lin Wang, Quanfeng Deng, Chen Yu, Dong Zhang, Bingheng Yan, Fangming Liu	2025-02-20	下载	Serverless platforms typically adopt an early-binding approach for function sizing, requiring developers to specify an immutable size for each function within a workflow beforehand.
Blockchain-based Framework for Scalable and Incentivized Federated Learning	Bijun Wu, Oshani Seneviratne	2025-02-20	下载	Federated Learning (FL) enables collaborative model training without sharing raw data, preserving privacy while harnessing distributed datasets.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
NeSt-VR: An Adaptive Bitrate Algorithm for Virtual Reality Streaming over Wi-Fi	Miguel Casasnovas, Ferran Maura, Isjtar Vandebroeck, Haryo Sukmawanto, Eric Joris, Boris Bellalta	2025-02-20	下载	Real-time interactive Virtual Reality (VR) streaming is a significantly challenging use case for Wi-Fi given its high throughput and low latency requirements, especially considering the constraints im...
Online Resource Management for the Uplink of Wideband Hybrid Beamforming System	Yuan Quan, Haseen Rahman, Catherine Rosenberg	2025-02-20	下载	This paper studies the radio resource management (RRM) for the uplink (UL) of a cellular system with codebook-based hybrid beamforming. We consider the often neglected but highly practical multi-chann...
Tracking and Assigning Jobs to a Markov Machine	Subhankar Banerjee, Sennur Ulukus	2025-02-20	下载	We consider a time-slotted communication system with a machine, a cloud server, and a sampler. Job requests from the users are queued on the server to be completed by the machine.
A Survey of Internet Censorship and its Measurement: Methodology, Trends, and Challenges	Steffen Wendzel, Simon Volpert, Sebastian Zillien, Julia Lenz, Philip Rünz, Luca Caviglione	2025-02-20	下载	Internet censorship limits the access of nodes residing within a specific network environment to the public Internet, and vice versa. During the last decade, techniques for conducting Internet censors...
Reinforcement Learning with Graph Attention for Routing and Wavelength Assignment with Lightpath Reuse	Michael Doherty, Alejandra Beghelli	2025-02-20	下载	Many works have investigated reinforcement learning (RL) for routing and spectrum assignment on flex-grid networks but only one work to date has examined RL for fixed-grid with flex-rate transponders,...
Counter Pools: Counter Representation for Efficient Stream Processing	Ran Ben Basat, Gil Einziger, Bilal Tyah, Shay Vargaftik	2025-02-20	下载	Due to the large data volume and number of distinct elements, space is often the bottleneck of many stream processing systems. The data structures used by these systems often consist of counters whose...
Optimal Popularity-based Transmission Range Selection for D2D-supported Content Delivery	Loreto Pescosolido, Andrea Passarella, Marco Conti	2025-02-20	下载	Considering device-to-device (D2D) wireless links as a virtual extension of 5G (and beyond) cellular networks to deliver popular contents has been proposed as an interesting approach to reduce energy ...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Taming and Controlling Performance and Energy Trade-offs Automatically in Network Applications	Han Dong, Yara Awad, Sanjay Arora, Orran Krieger, Jonathan Appavoo	2025-02-20	下载	In this paper, we demonstrate that a server running a single latency-sensitive application can be treated as a black box to reduce energy consumption while meeting an SLA target.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention	Shang Yang, Junxian Guo, Haotian Tang, Qinghao Hu, Guangxuan Xiao, Jiaming Tang, Yujun Lin, Zhijian Liu, Yao Lu, Song Han	2025-02-20	下载	Large language models (LLMs) have shown remarkable potential in processing long sequences and complex reasoning tasks, yet efficiently serving these models remains challenging due to the quadratic com...
Parallelizing a modern GPU simulator	Rodrigo Huerta, Antonio González	2025-02-20	下载	Simulators are a primary tool in computer architecture research but are extremely computationally intensive. Simulating modern architectures with increased core counts and recent workloads can be chal...