Skip to content

2025-04-25

cs.AR - Architecture

标题作者发布日期PDF摘要
Periodic Online Testing for Sparse Systolic Tensor ArraysChristodoulos Peltekis, Chrysostomos Nicopoulos, Giorgos Dimitrakopoulos2025-04-25下载Modern Machine Learning (ML) applications often benefit from structured sparsity, a technique that efficiently reduces model complexity and simplifies handling of sparse data in hardware.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
The Big Send-off: Scalable and Performant Collectives for Deep LearningSiddharth Singh, Keshav Pradeep, Mahua Singh, Cunyang Wei, Abhinav Bhatele2025-04-25下载Collective communication is becoming increasingly important in data center and supercomputer workloads with an increase in distributed AI related jobs.
Raptr: Prefix Consensus for Robust High-Performance BFTAndrei Tonkikh, Balaji Arun, Zhuolun Xiang, Zekun Li, Alexander Spiegelman2025-04-25下载In this paper, we present Raptr--a Byzantine fault-tolerant state machine replication (BFT SMR) protocol that combines strong robustness with high throughput, while attaining near-optimal theoretical ...
Dynamic Memory Management on GPUs with SYCLRussell K. Standish2025-04-25下载Dynamic memory allocation is not traditionally available in kernels running on GPUs. This work aims to build on Ouroboros, an efficient dynamic memory management library for CUDA applications, by port...
EcoServe: Enabling Cost-effective LLM Serving with Proactive Intra- and Inter-Instance OrchestrationJiangsu Du, Hongbin Zhang, Taosheng Wei, Zhenyi Zheng, Kaiyi Wu, Zhiguang Chen, Yutong Lu2025-04-25下载Existing LLM serving strategies can be categorized based on whether prefill and decode phases are disaggregated: non-disaggregated (NoDG) or fully disaggregated (FuDG).
Optimal Secure Coded Distributed Computation over all FieldsPedro Soto2025-04-25下载We construct optimal secure coded distributed schemes that extend the known optimal constructions over fields of characteristic 0 to all fields.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
GeoFINDR: Practical Approach to Verify Cloud Instances Geolocation in MulticloudSaid Ider, Maryline Laurent2025-04-25下载In multicloud environments, where legal obligations, technical constraints and economic interests are at stake, it is of interest to stakeholders to be able to locate cloud data or the cloud instance ...
Information Freshness in Dynamic Gossip NetworksArunabh Srivastava, Thomas Jacob Maranzatto, Sennur Ulukus2025-04-25下载We consider a source that shares updates with a network of nn gossiping nodes. The network's topology switches between two arbitrary topologies, with switching governed by a two-state continuous time...
FlexiNS: A SmartNIC-Centric, Line-Rate and Flexible Network StackXuzheng Chen, Jie Zhang, Baolin Zhu, Xueying Zhu, Zhongqing Chen, Shu Ma, Lingjun Zhu, Chao Shi, Yin Zhang, Zeke Wang2025-04-25下载As the gap between network and CPU speeds rapidly increases, the CPU-centric network stack proves inadequate due to excessive CPU and memory overhead.
Task-Oriented Semantic Compression for Localization at the Network EdgeZhengru Fang, Senkang Hu, Yu Guo, Yiqin Deng, Yuguang Fang2025-04-25下载Achieving precise visual localization in GPS-limited urban environments poses significant challenges for resource-constrained mobile platforms, particularly under strict bandwidth, memory, and process...
LLM-hRIC: LLM-empowered Hierarchical RAN Intelligent Control for O-RANLingyan Bao, Sinwoong Yun, Jemin Lee, Tony Q. S. Quek2025-04-25下载Despite recent advances in applying large language models (LLMs) and machine learning (ML) techniques to open radio access network (O-RAN), critical challenges remain, such as insufficient cooperation...
Joint Resource Estimation and Trajectory Optimization for eVTOL-involved CR network: A Monte Carlo Tree Search-based ApproachKai Xiong, Chenxin Yang, Yujie Qin, Wanzhi Ma, Chau Yuen2025-04-25下载Electric Vertical Take-Off and Landing (eVTOL) aircraft, pivotal to Advanced Air Mobility (AAM), are emerging as a transformative transportation paradigm with the potential to redefine urban and regio...
Integrating Explainable AI for Energy Efficient Open Radio Access NetworksL. Malakalapalli, V. Gudepu, B. Chirumamilla, S. J. Yadhunandan, K. Kondepu2025-04-25下载The Open Radio Access Network (Open RAN) is an emerging idea -- transforming the traditional Radio Access Networks (RAN) that are monolithic and inflexible into more flexible and innovative.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
From Good to Great: Improving Memory Tiering Performance Through Parameter TuningKonstantinos Kanellis, Sujay Yadalam, Fanchao Chen, Michael Swift, Shivaram Venkataraman2025-04-25下载Memory tiering systems achieve memory scaling by adding multiple tiers of memory wherein different tiers have different access latencies and bandwidth.

cs.PF - Performance

标题作者发布日期PDF摘要
Spatiotemporal Analysis of Parallelized Computing at the Extreme EdgeYasser Nabil, Mahmoud Abdelhadi, Sameh Sorour, Hesham ElSawy, Sara A. Elsayed, Hossam S. Hassanein2025-04-25下载Extreme Edge Computing (EEC) pushes computing even closer to end users than traditional Multi-access Edge Computing (MEC), harnessing the idle resources of Extreme Edge Devices (EEDs) to enable low-la...

基于 VitePress 构建