2025-10-21

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Hazel: Secure and Efficient Disaggregated Storage	Marcin Chrapek, Meni Orenbach, Ahmad Atamli, Marcin Copik, Mikhail Khalilov, Fritz Alder, Torsten Hoefler	2025-10-21	下载	Disaggregated storage with NVMe-over-Fabrics (NVMe-oF) has emerged as the standard solution in modern supercomputers and data center clusters, achieving superior performance, resource utilization, and...
DRsam: Detection of Fault-Based Microarchitectural Side-Channel Attacks in RISC-V Using Statistical Preprocessing and Association Rule Mining	Muhammad Hassan, Maria Mushtaq, Jaan Raik, Tara Ghasempouri	2025-10-21	下载	RISC-V processors are becoming ubiquitous in critical applications, but their susceptibility to microarchitectural side-channel attacks is a serious concern.
From Quarter to All: Accelerating Speculative LLM Decoding via Floating-Point Exponent Remapping and Parameter Sharing	Yushu Zhao, Yubin Qin, Yang Wang, Xiaolong Yang, Huiming Han, Shaojun Wei, Yang Hu, Shouyi Yin	2025-10-21	下载	Large language models achieve impressive performance across diverse tasks but exhibit high inference latency due to their large parameter sizes.
EdgeReasoning: Characterizing Reasoning LLM Deployment on Edge GPUs	Benjamin Kubwimana, Qijing Huang	2025-10-21	下载	Edge intelligence paradigm is increasingly demanded by the emerging autonomous systems, such as robotics. Beyond ensuring privacy-preserving operation and resilience in connectivity-limited environmen...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Learned Cost Model for Placement on Reconfigurable Dataflow Hardware	Etash Guha, Tianxiao Jiang, Andrew Deng, Jian Zhang, Muthu Annamalai	2025-10-21	下载	Mapping a dataflow-graph of an ML model onto a reconfigurable system is difficult, as different mappings have different throughputs and consume resource constraints differently.
Comparative analysis of large data processing in Apache Spark using Java, Python and Scala	Ivan Borodii, Illia Fedorovych, Halyna Osukhivska, Diana Velychko, Roman Butsii	2025-10-21	下载	During the study, the results of a comparative analysis of the process of handling large datasets using the Apache Spark platform in Java, Python, and Scala programming languages were obtained.
PCMS: Parallel Coupler For Multimodel Simulations	Jacob S. Merson, Cameron W. Smith, Mark S. Shephard, Fuad Hasan, Abhiyan Paudel, Angel Castillo-Crooke, Joyal Mathew, Mohammad Elahi	2025-10-21	下载	This paper presents the Parallel Coupler for Multimodel Simulations (PCMS), a new GPU accelerated generalized coupling framework for coupling simulation codes on leadership class supercomputers.
MTraining: Distributed Dynamic Sparse Attention for Efficient Ultra-Long Context Training	Wenxuan Li, Chengruidong Zhang, Huiqiang Jiang, Yucheng Li, Yuqing Yang, Lili Qiu	2025-10-21	下载	The adoption of long context windows has become a standard feature in Large Language Models (LLMs), as extended contexts significantly enhance their capacity for complex reasoning and broaden their ap...
Hazel: Secure and Efficient Disaggregated Storage	Marcin Chrapek, Meni Orenbach, Ahmad Atamli, Marcin Copik, Mikhail Khalilov, Fritz Alder, Torsten Hoefler	2025-10-21	下载	Disaggregated storage with NVMe-over-Fabrics (NVMe-oF) has emerged as the standard solution in modern supercomputers and data center clusters, achieving superior performance, resource utilization, and...
Towards an Optimized Benchmarking Platform for CI/CD Pipelines	Nils Japke, Sebastian Koch, Helmut Lukasczyk, David Bermbach	2025-10-21	下载	Performance regressions in large-scale software systems can lead to substantial resource inefficiencies, making their early detection critical.
Distributed Interactive Proofs for Planarity with Log-Star Communication	Yuval Gil, Merav Parter	2025-10-21	下载	We provide new communication-efficient distributed interactive proofs for planarity. The notion of a \emph{distributed interactive proof (DIP)} was introduced by Kol, Oshman, and Saxena (PODC 2018).
Tokencake: A KV-Cache-centric Serving Framework for LLM-based Multi-Agent Applications	Zhuohang Bian, Feiyang Wu, Teng Ma, Youwei Zhuo	2025-10-21	下载	Large Language Models (LLMs) are increasingly deployed in complex multi-agent applications that use external function calls. This workload creates severe performance challenges for the KV Cache: space...
Structural Analysis of Multi-Core Processor and Reliability Evaluation Model	S. Tsiramua, H. Meladze, T. Davitashvili, J. M. Sanchez, F. Criado-Aldeanueva	2025-10-21	下载	In the present paper, the models of structural analysis and evaluation of efficiency indicators (reliability, fault tolerance, viability, and flexibility) of a multi core processor with variable struc...
SLICE: SLO-Driven Scheduling for LLM Inference on Edge Computing Devices	Will Chow	2025-10-21	下载	Large Language Models (LLMs), as the foundational architecture for next-generation interactive AI applications, not only power intelligent dialogue systems but also drive the evolution of embodied int...
LAFA: Agentic LLM-Driven Federated Analytics over Decentralized Data Sources	Haichao Ji, Zibo Wang, Cheng Pan, Meng Han, Yifei Zhu, Dan Wang, Zhu Han	2025-10-21	下载	Large Language Models (LLMs) have shown great promise in automating data analytics tasks by interpreting natural language queries and generating multi-operation execution plans.
A Distributed Framework for Causal Modeling of Performance Variability in GPU Traces	Ankur Lahiry, Ayush Pokharel, Banooqa Banday, Seth Ockerman, Amal Gueroudji, Mohammad Zaeed, Tanzima Z. Islam, Line Pouchard	2025-10-21	下载	Large-scale GPU traces play a critical role in identifying performance bottlenecks within heterogeneous High-Performance Computing (HPC) architectures.
EdgeReasoning: Characterizing Reasoning LLM Deployment on Edge GPUs	Benjamin Kubwimana, Qijing Huang	2025-10-21	下载	Edge intelligence paradigm is increasingly demanded by the emerging autonomous systems, such as robotics. Beyond ensuring privacy-preserving operation and resilience in connectivity-limited environmen...
Distributed Allocation and Resource Scheduling Algorithms Resilient to Link Failure	Mohammadreza Doostmohammadian, Sergio Pequito	2025-10-21	下载	Distributed resource allocation (DRA) is fundamental to modern networked systems, spanning applications from economic dispatch in smart grids to CPU scheduling in data centers.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Hazel: Secure and Efficient Disaggregated Storage	Marcin Chrapek, Meni Orenbach, Ahmad Atamli, Marcin Copik, Mikhail Khalilov, Fritz Alder, Torsten Hoefler	2025-10-21	下载	Disaggregated storage with NVMe-over-Fabrics (NVMe-oF) has emerged as the standard solution in modern supercomputers and data center clusters, achieving superior performance, resource utilization, and...
Formal Methods for Mobile Ad Hoc Networks: A Survey	Wan Fokkink, Rob van Glabbeek	2025-10-21	下载	In a mobile ad hoc network (MANET), communication is wireless and nodes can move independently. Properly analyzing the functional correctness, performance, and security of MANET protocols is a challen...
Forward to Hell? On the Potentials of Misusing Transparent DNS Forwarders in Reflective Amplification Attacks	Maynard Koch, Florian Dolzmann, Thomas C. Schmidt, Matthias Wählisch	2025-10-21	下载	The DNS infrastructure is infamous for facilitating reflective amplification attacks. Various countermeasures such as server shielding, access control, rate limiting, and protocol restrictions have be...
JAUNT: Joint Alignment of User Intent and Network State for QoE-centric LLM Tool Routing	Enhan Li, Hongyang Du	2025-10-21	下载	Large Language Models (LLMs) increasingly rely on emerging protocols such as the Model Context Protocol (MCP) to invoke external tools and services.
On AI Verification in Open RAN	Rahul Soundrarajan, Claudio Fiandrino, Michele Polese, Salvatore D'Oro, Leonardo Bonati, Tommaso Melodia	2025-10-21	下载	Open RAN introduces a flexible, cloud-based architecture for the Radio Access Network (RAN), enabling Artificial Intelligence (AI)/Machine Learning (ML)-driven automation across heterogeneous, multi-v...
How2Compress: Scalable and Efficient Edge Video Analytics via Adaptive Granular Video Compression	Yuheng Wu, Thanh-Tung Nguyen, Lucas Liebe, Quang Tau, Pablo Espinosa Campos, Jinghan Cheng, Dongman Lee	2025-10-21	下载	With the rapid proliferation of the Internet of Things, video analytics has become a cornerstone application in wireless multimedia sensor networks.
Censorship Chokepoints: New Battlegrounds for Regional Surveillance, Censorship and Influence on the Internet	Yong Zhang, Nishanth Sastry	2025-10-21	下载	Undoubtedly, the Internet has become one of the most important conduits to information for the general public. Nonetheless, Internet access can be and has been limited systematically or blocked comple...
Revisiting RFID Missing Tag Identification	Kanghuai Liu, Lin Chen, Jihong Yu, Junyi Huang, Shiyuan Liu	2025-10-21	下载	We revisit the problem of missing tag identification in RFID networks by making three contributions. Firstly, we quantitatively compare and gauge the existing propositions spanning over a decade on mi...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Hazel: Secure and Efficient Disaggregated Storage	Marcin Chrapek, Meni Orenbach, Ahmad Atamli, Marcin Copik, Mikhail Khalilov, Fritz Alder, Torsten Hoefler	2025-10-21	下载	Disaggregated storage with NVMe-over-Fabrics (NVMe-oF) has emerged as the standard solution in modern supercomputers and data center clusters, achieving superior performance, resource utilization, and...
LatticeHashForest: An Efficient Data Structure for Repetitive Data and Operations	Anamitra Ghorui, Uday P. Khedker	2025-10-21	下载	Analysis of entire programs as a single unit, or whole-program analysis, involves propagation of large amounts of information through the control flow of the program.