Appearance
2025-10-21
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Hazel: Secure and Efficient Disaggregated Storage | Marcin Chrapek, Meni Orenbach, Ahmad Atamli, Marcin Copik, Mikhail Khalilov, Fritz Alder, Torsten Hoefler | 2025-10-21 | 下载 | Disaggregated storage with NVMe-over-Fabrics (NVMe-oF) has emerged as the standard solution in modern supercomputers and data center clusters, achieving superior performance, resource utilization, and... |
| DRsam: Detection of Fault-Based Microarchitectural Side-Channel Attacks in RISC-V Using Statistical Preprocessing and Association Rule Mining | Muhammad Hassan, Maria Mushtaq, Jaan Raik, Tara Ghasempouri | 2025-10-21 | 下载 | RISC-V processors are becoming ubiquitous in critical applications, but their susceptibility to microarchitectural side-channel attacks is a serious concern. |
| From Quarter to All: Accelerating Speculative LLM Decoding via Floating-Point Exponent Remapping and Parameter Sharing | Yushu Zhao, Yubin Qin, Yang Wang, Xiaolong Yang, Huiming Han, Shaojun Wei, Yang Hu, Shouyi Yin | 2025-10-21 | 下载 | Large language models achieve impressive performance across diverse tasks but exhibit high inference latency due to their large parameter sizes. |
| EdgeReasoning: Characterizing Reasoning LLM Deployment on Edge GPUs | Benjamin Kubwimana, Qijing Huang | 2025-10-21 | 下载 | Edge intelligence paradigm is increasingly demanded by the emerging autonomous systems, such as robotics. Beyond ensuring privacy-preserving operation and resilience in connectivity-limited environmen... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Learned Cost Model for Placement on Reconfigurable Dataflow Hardware | Etash Guha, Tianxiao Jiang, Andrew Deng, Jian Zhang, Muthu Annamalai | 2025-10-21 | 下载 | Mapping a dataflow-graph of an ML model onto a reconfigurable system is difficult, as different mappings have different throughputs and consume resource constraints differently. |
| Comparative analysis of large data processing in Apache Spark using Java, Python and Scala | Ivan Borodii, Illia Fedorovych, Halyna Osukhivska, Diana Velychko, Roman Butsii | 2025-10-21 | 下载 | During the study, the results of a comparative analysis of the process of handling large datasets using the Apache Spark platform in Java, Python, and Scala programming languages were obtained. |
| PCMS: Parallel Coupler For Multimodel Simulations | Jacob S. Merson, Cameron W. Smith, Mark S. Shephard, Fuad Hasan, Abhiyan Paudel, Angel Castillo-Crooke, Joyal Mathew, Mohammad Elahi | 2025-10-21 | 下载 | This paper presents the Parallel Coupler for Multimodel Simulations (PCMS), a new GPU accelerated generalized coupling framework for coupling simulation codes on leadership class supercomputers. |
| MTraining: Distributed Dynamic Sparse Attention for Efficient Ultra-Long Context Training | Wenxuan Li, Chengruidong Zhang, Huiqiang Jiang, Yucheng Li, Yuqing Yang, Lili Qiu | 2025-10-21 | 下载 | The adoption of long context windows has become a standard feature in Large Language Models (LLMs), as extended contexts significantly enhance their capacity for complex reasoning and broaden their ap... |
| Hazel: Secure and Efficient Disaggregated Storage | Marcin Chrapek, Meni Orenbach, Ahmad Atamli, Marcin Copik, Mikhail Khalilov, Fritz Alder, Torsten Hoefler | 2025-10-21 | 下载 | Disaggregated storage with NVMe-over-Fabrics (NVMe-oF) has emerged as the standard solution in modern supercomputers and data center clusters, achieving superior performance, resource utilization, and... |
| Towards an Optimized Benchmarking Platform for CI/CD Pipelines | Nils Japke, Sebastian Koch, Helmut Lukasczyk, David Bermbach | 2025-10-21 | 下载 | Performance regressions in large-scale software systems can lead to substantial resource inefficiencies, making their early detection critical. |
| Distributed Interactive Proofs for Planarity with Log-Star Communication | Yuval Gil, Merav Parter | 2025-10-21 | 下载 | We provide new communication-efficient distributed interactive proofs for planarity. The notion of a \emph{distributed interactive proof (DIP)} was introduced by Kol, Oshman, and Saxena (PODC 2018). |
| Tokencake: A KV-Cache-centric Serving Framework for LLM-based Multi-Agent Applications | Zhuohang Bian, Feiyang Wu, Teng Ma, Youwei Zhuo | 2025-10-21 | 下载 | Large Language Models (LLMs) are increasingly deployed in complex multi-agent applications that use external function calls. This workload creates severe performance challenges for the KV Cache: space... |
| Structural Analysis of Multi-Core Processor and Reliability Evaluation Model | S. Tsiramua, H. Meladze, T. Davitashvili, J. M. Sanchez, F. Criado-Aldeanueva | 2025-10-21 | 下载 | In the present paper, the models of structural analysis and evaluation of efficiency indicators (reliability, fault tolerance, viability, and flexibility) of a multi core processor with variable struc... |
| SLICE: SLO-Driven Scheduling for LLM Inference on Edge Computing Devices | Will Chow | 2025-10-21 | 下载 | Large Language Models (LLMs), as the foundational architecture for next-generation interactive AI applications, not only power intelligent dialogue systems but also drive the evolution of embodied int... |
| LAFA: Agentic LLM-Driven Federated Analytics over Decentralized Data Sources | Haichao Ji, Zibo Wang, Cheng Pan, Meng Han, Yifei Zhu, Dan Wang, Zhu Han | 2025-10-21 | 下载 | Large Language Models (LLMs) have shown great promise in automating data analytics tasks by interpreting natural language queries and generating multi-operation execution plans. |
| A Distributed Framework for Causal Modeling of Performance Variability in GPU Traces | Ankur Lahiry, Ayush Pokharel, Banooqa Banday, Seth Ockerman, Amal Gueroudji, Mohammad Zaeed, Tanzima Z. Islam, Line Pouchard | 2025-10-21 | 下载 | Large-scale GPU traces play a critical role in identifying performance bottlenecks within heterogeneous High-Performance Computing (HPC) architectures. |
| EdgeReasoning: Characterizing Reasoning LLM Deployment on Edge GPUs | Benjamin Kubwimana, Qijing Huang | 2025-10-21 | 下载 | Edge intelligence paradigm is increasingly demanded by the emerging autonomous systems, such as robotics. Beyond ensuring privacy-preserving operation and resilience in connectivity-limited environmen... |
| Distributed Allocation and Resource Scheduling Algorithms Resilient to Link Failure | Mohammadreza Doostmohammadian, Sergio Pequito | 2025-10-21 | 下载 | Distributed resource allocation (DRA) is fundamental to modern networked systems, spanning applications from economic dispatch in smart grids to CPU scheduling in data centers. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Hazel: Secure and Efficient Disaggregated Storage | Marcin Chrapek, Meni Orenbach, Ahmad Atamli, Marcin Copik, Mikhail Khalilov, Fritz Alder, Torsten Hoefler | 2025-10-21 | 下载 | Disaggregated storage with NVMe-over-Fabrics (NVMe-oF) has emerged as the standard solution in modern supercomputers and data center clusters, achieving superior performance, resource utilization, and... |
| Formal Methods for Mobile Ad Hoc Networks: A Survey | Wan Fokkink, Rob van Glabbeek | 2025-10-21 | 下载 | In a mobile ad hoc network (MANET), communication is wireless and nodes can move independently. Properly analyzing the functional correctness, performance, and security of MANET protocols is a challen... |
| Forward to Hell? On the Potentials of Misusing Transparent DNS Forwarders in Reflective Amplification Attacks | Maynard Koch, Florian Dolzmann, Thomas C. Schmidt, Matthias Wählisch | 2025-10-21 | 下载 | The DNS infrastructure is infamous for facilitating reflective amplification attacks. Various countermeasures such as server shielding, access control, rate limiting, and protocol restrictions have be... |
| JAUNT: Joint Alignment of User Intent and Network State for QoE-centric LLM Tool Routing | Enhan Li, Hongyang Du | 2025-10-21 | 下载 | Large Language Models (LLMs) increasingly rely on emerging protocols such as the Model Context Protocol (MCP) to invoke external tools and services. |
| On AI Verification in Open RAN | Rahul Soundrarajan, Claudio Fiandrino, Michele Polese, Salvatore D'Oro, Leonardo Bonati, Tommaso Melodia | 2025-10-21 | 下载 | Open RAN introduces a flexible, cloud-based architecture for the Radio Access Network (RAN), enabling Artificial Intelligence (AI)/Machine Learning (ML)-driven automation across heterogeneous, multi-v... |
| How2Compress: Scalable and Efficient Edge Video Analytics via Adaptive Granular Video Compression | Yuheng Wu, Thanh-Tung Nguyen, Lucas Liebe, Quang Tau, Pablo Espinosa Campos, Jinghan Cheng, Dongman Lee | 2025-10-21 | 下载 | With the rapid proliferation of the Internet of Things, video analytics has become a cornerstone application in wireless multimedia sensor networks. |
| Censorship Chokepoints: New Battlegrounds for Regional Surveillance, Censorship and Influence on the Internet | Yong Zhang, Nishanth Sastry | 2025-10-21 | 下载 | Undoubtedly, the Internet has become one of the most important conduits to information for the general public. Nonetheless, Internet access can be and has been limited systematically or blocked comple... |
| Revisiting RFID Missing Tag Identification | Kanghuai Liu, Lin Chen, Jihong Yu, Junyi Huang, Shiyuan Liu | 2025-10-21 | 下载 | We revisit the problem of missing tag identification in RFID networks by making three contributions. Firstly, we quantitatively compare and gauge the existing propositions spanning over a decade on mi... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Hazel: Secure and Efficient Disaggregated Storage | Marcin Chrapek, Meni Orenbach, Ahmad Atamli, Marcin Copik, Mikhail Khalilov, Fritz Alder, Torsten Hoefler | 2025-10-21 | 下载 | Disaggregated storage with NVMe-over-Fabrics (NVMe-oF) has emerged as the standard solution in modern supercomputers and data center clusters, achieving superior performance, resource utilization, and... |
| LatticeHashForest: An Efficient Data Structure for Repetitive Data and Operations | Anamitra Ghorui, Uday P. Khedker | 2025-10-21 | 下载 | Analysis of entire programs as a single unit, or whole-program analysis, involves propagation of large amounts of information through the control flow of the program. |