Appearance
2025-09-21
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| SnipSnap: A Joint Compression Format and Dataflow Co-Optimization Framework for Efficient Sparse LLM Accelerator Design | Junyi Wu, Chao Fang, Zhongfeng Wang | 2025-09-21 | 下载 | The growing scale of large language models (LLMs) has intensified demands on computation and memory, making efficient inference a key challenge. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| MoA-Off: Adaptive Heterogeneous Modality-Aware Offloading with Edge-Cloud Collaboration for Efficient Multimodal LLM Inference | Zheming Yang, Qi Guo, Yunqing Hu, Chang Zhao, Chang Zhang, Jian Zhao, Wen Ji | 2025-09-21 | 下载 | Multimodal large language models (MLLMs) enable powerful cross-modal inference but impose significant computational and latency burdens, posing severe challenges for deployment in resource-constrained... |
| ShadowServe: Interference-Free KV Cache Fetching for Distributed Prefix Caching | Xingyu Xiang, Raj Joshi, Yuhan Liu, Jiayi Yao, Chenxingyu Zhao, Junchen Jiang, Yang Zhou, Eddie Kohler, Minlan Yu | 2025-09-21 | 下载 | Distributed prefix caching accelerates long-context LLM serving by reusing KV cache entries for common context prefixes. However, KV cache fetches can become a bottleneck when network bandwidth is lim... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Task-Oriented Communications for 3D Scene Representation: Balancing Timeliness and Fidelity | Xiangmin Xu, Zhen Meng, Kan Chen, Jiaming Yang, Emma Li, Philip G. Zhao, David Flynn | 2025-09-21 | 下载 | Real-time Three-dimensional (3D) scene representation is a foundational element that supports a broad spectrum of cutting-edge applications, including digital manufacturing, Virtual, Augmented, and Mi... |
| Impact of Packetization on Network Calculus Analysis | Yming Jiang | 2025-09-21 | 下载 | For packet-switched networks, when the packetization effect is overlooked, network calculus analysis can produce faulty results. To exemplify, network calculus analysis is applied in this paper to two... |
| System Relaxation for Interpretable and Adaptive Network Control | Zhiyuan Ren, Zhiliang Shuai, Wenchi Cheng | 2025-09-21 | 下载 | Prevailing network control strategies, which rely on static shortest-path logic, suffer from catastrophic "stress concentration" on critical nodes. |
| Analysis of an Architecture for Integrated Sensing and Communication in 5G OpenRAN | Daniel Lindenschmitt, Tobias Jung, Prudhvi Kumar Kakani, Torsten Reissland, Norman Franchi, Hans D. Schotten | 2025-09-21 | 下载 | This paper analyzes the functional requirements and architectural considerations for Integrated Sensing and Communication ( ISAC) in a 5G Open Radio Access Network (OpenRAN) environment, with emphasis... |
| BENNS: A Surrogate Model for Hybrid Online-Offline Evolution of SFC Embedding | Theviyanthan Krishnamohan, Lauritz Thamsen, Paul Harvey | 2025-09-21 | 下载 | Service Function Chains (SFCs) enable programmatic control of the functions and services in a computer network. By leveraging Software Defined Networking to control the links between virtualised netwo... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Impact of RHIs and ipSIC on Active RIS-NOMA Systems with Low-Precision ADCs | Qianqian Li, Hua Li, Shiya Hao, Lintao Li, Xiaoming Dai | 2025-09-21 | 下载 | This study evaluates the performance of an active reconfigurable intelligent surface (ARIS)-assisted non-orthogonal multiple access (NOMA) system employing low-precision analog-to-digital converters (... |
| Impact of Packetization on Network Calculus Analysis | Yming Jiang | 2025-09-21 | 下载 | For packet-switched networks, when the packetization effect is overlooked, network calculus analysis can produce faulty results. To exemplify, network calculus analysis is applied in this paper to two... |