Skip to content

2025-09-21

cs.AR - Architecture

标题作者发布日期PDF摘要
SnipSnap: A Joint Compression Format and Dataflow Co-Optimization Framework for Efficient Sparse LLM Accelerator DesignJunyi Wu, Chao Fang, Zhongfeng Wang2025-09-21下载The growing scale of large language models (LLMs) has intensified demands on computation and memory, making efficient inference a key challenge.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
MoA-Off: Adaptive Heterogeneous Modality-Aware Offloading with Edge-Cloud Collaboration for Efficient Multimodal LLM InferenceZheming Yang, Qi Guo, Yunqing Hu, Chang Zhao, Chang Zhang, Jian Zhao, Wen Ji2025-09-21下载Multimodal large language models (MLLMs) enable powerful cross-modal inference but impose significant computational and latency burdens, posing severe challenges for deployment in resource-constrained...
ShadowServe: Interference-Free KV Cache Fetching for Distributed Prefix CachingXingyu Xiang, Raj Joshi, Yuhan Liu, Jiayi Yao, Chenxingyu Zhao, Junchen Jiang, Yang Zhou, Eddie Kohler, Minlan Yu2025-09-21下载Distributed prefix caching accelerates long-context LLM serving by reusing KV cache entries for common context prefixes. However, KV cache fetches can become a bottleneck when network bandwidth is lim...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Task-Oriented Communications for 3D Scene Representation: Balancing Timeliness and FidelityXiangmin Xu, Zhen Meng, Kan Chen, Jiaming Yang, Emma Li, Philip G. Zhao, David Flynn2025-09-21下载Real-time Three-dimensional (3D) scene representation is a foundational element that supports a broad spectrum of cutting-edge applications, including digital manufacturing, Virtual, Augmented, and Mi...
Impact of Packetization on Network Calculus AnalysisYming Jiang2025-09-21下载For packet-switched networks, when the packetization effect is overlooked, network calculus analysis can produce faulty results. To exemplify, network calculus analysis is applied in this paper to two...
System Relaxation for Interpretable and Adaptive Network ControlZhiyuan Ren, Zhiliang Shuai, Wenchi Cheng2025-09-21下载Prevailing network control strategies, which rely on static shortest-path logic, suffer from catastrophic "stress concentration" on critical nodes.
Analysis of an Architecture for Integrated Sensing and Communication in 5G OpenRANDaniel Lindenschmitt, Tobias Jung, Prudhvi Kumar Kakani, Torsten Reissland, Norman Franchi, Hans D. Schotten2025-09-21下载This paper analyzes the functional requirements and architectural considerations for Integrated Sensing and Communication ( ISAC) in a 5G Open Radio Access Network (OpenRAN) environment, with emphasis...
BENNS: A Surrogate Model for Hybrid Online-Offline Evolution of SFC EmbeddingTheviyanthan Krishnamohan, Lauritz Thamsen, Paul Harvey2025-09-21下载Service Function Chains (SFCs) enable programmatic control of the functions and services in a computer network. By leveraging Software Defined Networking to control the links between virtualised netwo...

cs.PF - Performance

标题作者发布日期PDF摘要
Impact of RHIs and ipSIC on Active RIS-NOMA Systems with Low-Precision ADCsQianqian Li, Hua Li, Shiya Hao, Lintao Li, Xiaoming Dai2025-09-21下载This study evaluates the performance of an active reconfigurable intelligent surface (ARIS)-assisted non-orthogonal multiple access (NOMA) system employing low-precision analog-to-digital converters (...
Impact of Packetization on Network Calculus AnalysisYming Jiang2025-09-21下载For packet-switched networks, when the packetization effect is overlooked, network calculus analysis can produce faulty results. To exemplify, network calculus analysis is applied in this paper to two...

基于 VitePress 构建