Skip to content

2025-10-21

cs.AR - Architecture

标题作者发布日期PDF摘要
Hazel: Secure and Efficient Disaggregated StorageMarcin Chrapek, Meni Orenbach, Ahmad Atamli, Marcin Copik, Mikhail Khalilov, Fritz Alder, Torsten Hoefler2025-10-21下载Disaggregated storage with NVMe-over-Fabrics (NVMe-oF) has emerged as the standard solution in modern supercomputers and data center clusters, achieving superior performance, resource utilization, and...
DRsam: Detection of Fault-Based Microarchitectural Side-Channel Attacks in RISC-V Using Statistical Preprocessing and Association Rule MiningMuhammad Hassan, Maria Mushtaq, Jaan Raik, Tara Ghasempouri2025-10-21下载RISC-V processors are becoming ubiquitous in critical applications, but their susceptibility to microarchitectural side-channel attacks is a serious concern.
From Quarter to All: Accelerating Speculative LLM Decoding via Floating-Point Exponent Remapping and Parameter SharingYushu Zhao, Yubin Qin, Yang Wang, Xiaolong Yang, Huiming Han, Shaojun Wei, Yang Hu, Shouyi Yin2025-10-21下载Large language models achieve impressive performance across diverse tasks but exhibit high inference latency due to their large parameter sizes.
EdgeReasoning: Characterizing Reasoning LLM Deployment on Edge GPUsBenjamin Kubwimana, Qijing Huang2025-10-21下载Edge intelligence paradigm is increasingly demanded by the emerging autonomous systems, such as robotics. Beyond ensuring privacy-preserving operation and resilience in connectivity-limited environmen...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Learned Cost Model for Placement on Reconfigurable Dataflow HardwareEtash Guha, Tianxiao Jiang, Andrew Deng, Jian Zhang, Muthu Annamalai2025-10-21下载Mapping a dataflow-graph of an ML model onto a reconfigurable system is difficult, as different mappings have different throughputs and consume resource constraints differently.
Comparative analysis of large data processing in Apache Spark using Java, Python and ScalaIvan Borodii, Illia Fedorovych, Halyna Osukhivska, Diana Velychko, Roman Butsii2025-10-21下载During the study, the results of a comparative analysis of the process of handling large datasets using the Apache Spark platform in Java, Python, and Scala programming languages were obtained.
PCMS: Parallel Coupler For Multimodel SimulationsJacob S. Merson, Cameron W. Smith, Mark S. Shephard, Fuad Hasan, Abhiyan Paudel, Angel Castillo-Crooke, Joyal Mathew, Mohammad Elahi2025-10-21下载This paper presents the Parallel Coupler for Multimodel Simulations (PCMS), a new GPU accelerated generalized coupling framework for coupling simulation codes on leadership class supercomputers.
MTraining: Distributed Dynamic Sparse Attention for Efficient Ultra-Long Context TrainingWenxuan Li, Chengruidong Zhang, Huiqiang Jiang, Yucheng Li, Yuqing Yang, Lili Qiu2025-10-21下载The adoption of long context windows has become a standard feature in Large Language Models (LLMs), as extended contexts significantly enhance their capacity for complex reasoning and broaden their ap...
Hazel: Secure and Efficient Disaggregated StorageMarcin Chrapek, Meni Orenbach, Ahmad Atamli, Marcin Copik, Mikhail Khalilov, Fritz Alder, Torsten Hoefler2025-10-21下载Disaggregated storage with NVMe-over-Fabrics (NVMe-oF) has emerged as the standard solution in modern supercomputers and data center clusters, achieving superior performance, resource utilization, and...
Towards an Optimized Benchmarking Platform for CI/CD PipelinesNils Japke, Sebastian Koch, Helmut Lukasczyk, David Bermbach2025-10-21下载Performance regressions in large-scale software systems can lead to substantial resource inefficiencies, making their early detection critical.
Distributed Interactive Proofs for Planarity with Log-Star CommunicationYuval Gil, Merav Parter2025-10-21下载We provide new communication-efficient distributed interactive proofs for planarity. The notion of a \emph{distributed interactive proof (DIP)} was introduced by Kol, Oshman, and Saxena (PODC 2018).
Tokencake: A KV-Cache-centric Serving Framework for LLM-based Multi-Agent ApplicationsZhuohang Bian, Feiyang Wu, Teng Ma, Youwei Zhuo2025-10-21下载Large Language Models (LLMs) are increasingly deployed in complex multi-agent applications that use external function calls. This workload creates severe performance challenges for the KV Cache: space...
Structural Analysis of Multi-Core Processor and Reliability Evaluation ModelS. Tsiramua, H. Meladze, T. Davitashvili, J. M. Sanchez, F. Criado-Aldeanueva2025-10-21下载In the present paper, the models of structural analysis and evaluation of efficiency indicators (reliability, fault tolerance, viability, and flexibility) of a multi core processor with variable struc...
SLICE: SLO-Driven Scheduling for LLM Inference on Edge Computing DevicesWill Chow2025-10-21下载Large Language Models (LLMs), as the foundational architecture for next-generation interactive AI applications, not only power intelligent dialogue systems but also drive the evolution of embodied int...
LAFA: Agentic LLM-Driven Federated Analytics over Decentralized Data SourcesHaichao Ji, Zibo Wang, Cheng Pan, Meng Han, Yifei Zhu, Dan Wang, Zhu Han2025-10-21下载Large Language Models (LLMs) have shown great promise in automating data analytics tasks by interpreting natural language queries and generating multi-operation execution plans.
A Distributed Framework for Causal Modeling of Performance Variability in GPU TracesAnkur Lahiry, Ayush Pokharel, Banooqa Banday, Seth Ockerman, Amal Gueroudji, Mohammad Zaeed, Tanzima Z. Islam, Line Pouchard2025-10-21下载Large-scale GPU traces play a critical role in identifying performance bottlenecks within heterogeneous High-Performance Computing (HPC) architectures.
EdgeReasoning: Characterizing Reasoning LLM Deployment on Edge GPUsBenjamin Kubwimana, Qijing Huang2025-10-21下载Edge intelligence paradigm is increasingly demanded by the emerging autonomous systems, such as robotics. Beyond ensuring privacy-preserving operation and resilience in connectivity-limited environmen...
Distributed Allocation and Resource Scheduling Algorithms Resilient to Link FailureMohammadreza Doostmohammadian, Sergio Pequito2025-10-21下载Distributed resource allocation (DRA) is fundamental to modern networked systems, spanning applications from economic dispatch in smart grids to CPU scheduling in data centers.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Hazel: Secure and Efficient Disaggregated StorageMarcin Chrapek, Meni Orenbach, Ahmad Atamli, Marcin Copik, Mikhail Khalilov, Fritz Alder, Torsten Hoefler2025-10-21下载Disaggregated storage with NVMe-over-Fabrics (NVMe-oF) has emerged as the standard solution in modern supercomputers and data center clusters, achieving superior performance, resource utilization, and...
Formal Methods for Mobile Ad Hoc Networks: A SurveyWan Fokkink, Rob van Glabbeek2025-10-21下载In a mobile ad hoc network (MANET), communication is wireless and nodes can move independently. Properly analyzing the functional correctness, performance, and security of MANET protocols is a challen...
Forward to Hell? On the Potentials of Misusing Transparent DNS Forwarders in Reflective Amplification AttacksMaynard Koch, Florian Dolzmann, Thomas C. Schmidt, Matthias Wählisch2025-10-21下载The DNS infrastructure is infamous for facilitating reflective amplification attacks. Various countermeasures such as server shielding, access control, rate limiting, and protocol restrictions have be...
JAUNT: Joint Alignment of User Intent and Network State for QoE-centric LLM Tool RoutingEnhan Li, Hongyang Du2025-10-21下载Large Language Models (LLMs) increasingly rely on emerging protocols such as the Model Context Protocol (MCP) to invoke external tools and services.
On AI Verification in Open RANRahul Soundrarajan, Claudio Fiandrino, Michele Polese, Salvatore D'Oro, Leonardo Bonati, Tommaso Melodia2025-10-21下载Open RAN introduces a flexible, cloud-based architecture for the Radio Access Network (RAN), enabling Artificial Intelligence (AI)/Machine Learning (ML)-driven automation across heterogeneous, multi-v...
How2Compress: Scalable and Efficient Edge Video Analytics via Adaptive Granular Video CompressionYuheng Wu, Thanh-Tung Nguyen, Lucas Liebe, Quang Tau, Pablo Espinosa Campos, Jinghan Cheng, Dongman Lee2025-10-21下载With the rapid proliferation of the Internet of Things, video analytics has become a cornerstone application in wireless multimedia sensor networks.
Censorship Chokepoints: New Battlegrounds for Regional Surveillance, Censorship and Influence on the InternetYong Zhang, Nishanth Sastry2025-10-21下载Undoubtedly, the Internet has become one of the most important conduits to information for the general public. Nonetheless, Internet access can be and has been limited systematically or blocked comple...
Revisiting RFID Missing Tag IdentificationKanghuai Liu, Lin Chen, Jihong Yu, Junyi Huang, Shiyuan Liu2025-10-21下载We revisit the problem of missing tag identification in RFID networks by making three contributions. Firstly, we quantitatively compare and gauge the existing propositions spanning over a decade on mi...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Hazel: Secure and Efficient Disaggregated StorageMarcin Chrapek, Meni Orenbach, Ahmad Atamli, Marcin Copik, Mikhail Khalilov, Fritz Alder, Torsten Hoefler2025-10-21下载Disaggregated storage with NVMe-over-Fabrics (NVMe-oF) has emerged as the standard solution in modern supercomputers and data center clusters, achieving superior performance, resource utilization, and...
LatticeHashForest: An Efficient Data Structure for Repetitive Data and OperationsAnamitra Ghorui, Uday P. Khedker2025-10-21下载Analysis of entire programs as a single unit, or whole-program analysis, involves propagation of large amounts of information through the control flow of the program.

基于 VitePress 构建