2025-08-17

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Accelerating LLM Inference via Dynamic KV Cache Placement in Heterogeneous Memory System	Yunhua Fang, Rui Xie, Asad Ul Haq, Linsen Ma, Kaoutar El Maghraoui, Naigang Wang, Meng Wang, Liu Liu, Tong Zhang	2025-08-17	下载	Large Language Model (LLM) inference is increasingly constrained by memory bandwidth, with frequent access to the key-value (KV) cache dominating data movement.
ATLAS: A Self-Supervised and Cross-Stage Netlist Power Model for Fine-Grained Time-Based Layout Power Analysis	Wenkai Li, Yao Lu, Wenji Fang, Jing Wang, Qijun Zhang, Zhiyao Xie	2025-08-17	下载	Accurate power prediction in VLSI design is crucial for effective power optimization, especially as designs get transformed from gate-level netlist to layout stages.
An ECC-based Fault Tolerance Approach for DNNs	Mohsen Raji, Mohammad Zaree, Kimia Soroush	2025-08-17	下载	Deep Neural Network (DNN) has achieve great success in solving a wide range of machine learning problems. Recently, they have been deployed in datacenters (potentially for business-critical or industr...
Soft Error Probability Estimation of Nano-scale Combinational Circuits	Ali Jockar, Mohsen Raji	2025-08-17	下载	As technology scales, nano-scale digital circuits face heightened susceptibility to single event upsets (SEUs) and transients (SETs) due to shrinking feature sizes and reduced operating voltages.
AutoPower: Automated Few-Shot Architecture-Level Power Modeling by Power Group Decoupling	Qijun Zhang, Yao Lu, Mengming Li, Zhiyao Xie	2025-08-17	下载	Power efficiency is a critical design objective in modern CPU design. Architects need a fast yet accurate architecture-level power evaluation tool to perform early-stage power estimation.
TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform	Jun Liu, Zhenglun Kong, Pu Zhao, Weihao Zeng, Hao Tang, Xuan Shen, Changdi Yang, Wenbin Zhang, Geng Yuan, Wei Niu, Xue Lin, Yanzhi Wang	2025-08-17	下载	Autonomous driving platforms encounter diverse driving scenarios, each with varying hardware resources and precision requirements. Given the computational limitations of embedded devices, it is crucia...
A Time- and Energy-Efficient CNN with Dense Connections on Memristor-Based Chips	Wenyong Zhou, Yuan Ren, Jiajun Zhou, Tianshu Hou, Ngai Wong	2025-08-17	下载	Designing lightweight convolutional neural network (CNN) models is an active research area in edge AI. Compute-in-memory (CIM) provides a new computing paradigm to alleviate time and energy consumptio...
Special Session: Sustainable Deployment of Deep Neural Networks on Non-Volatile Compute-in-Memory Accelerators	Yifan Qin, Zheyu Yan, Wujie Wen, Xiaobo Sharon Hu, Yiyu Shi	2025-08-17	下载	Non-volatile memory (NVM) based compute-in-memory (CIM) accelerators have emerged as a sustainable solution to significantly boost energy efficiency and minimize latency for Deep Neural Networks (DNNs...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
GPU Acceleration for Faster Evolutionary Spatial Cyclic Game Systems	Louie Sinadjan	2025-08-17	下载	This dissertation presents the design, implementation and evaluation of GPU-accelerated simulation frameworks for Evolutionary Spatial Cyclic Games (ESCGs), a class of agent-based models used to study...
Breaking the Aggregation Bottleneck in Federated Recommendation: A Personalized Model Merging Approach	Jundong Chen, Honglei Zhang, Chunxu Zhang, Fangyuan Luo, Yidong Li	2025-08-17	下载	Federated recommendation (FR) facilitates collaborative training by aggregating local models from massive devices, enabling client-specific personalization while ensuring privacy.
A Large-Scale Web Search Dataset for Federated Online Learning to Rank	Marcel Gregoriadis, Jingwei Kang, Johan Pouwelse	2025-08-17	下载	The centralized collection of search interaction logs for training ranking models raises significant privacy concerns. Federated Online Learning to Rank (FOLTR) offers a privacy-preserving alternative...
Proceedings 18th Interaction and Concurrency Experience	Clément Aubert, Cinzia Di Giusto, Simon Fowler, Violet Ka I Pun	2025-08-17	下载	This volume contains the proceedings of ICE'25, the 18th Interaction and Concurrency Experience, which was held on Friday 20th June 2025 at the École National Supérieure des Arts et Métiers in Lille, ...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
ChamaleoNet: Programmable Passive Probe for Enhanced Visibility on Erroneous Traffic	Zhihao Wang, Alessandro Cornacchia, Andrea Bianco, Idilio Drago, Paolo Giaccone, Dingde Jiang, Marco Mellia	2025-08-17	下载	Traffic visibility remains a key component for management and security operations. Observing unsolicited and erroneous traffic, such as unanswered traffic or errors, is fundamental to detect misconfig...
Cold-RL: Learning Cache Eviction with Offline Reinforcement Learning for NGINX	Aayush Gupta, Arpit Bhayani	2025-08-17	下载	Web proxies such as NGINX commonly rely on least-recently-used (LRU) eviction, which is size agnostic and can thrash under periodic bursts and mixed object sizes.
Straggler-Resilient Federated Learning over A Hybrid Conventional and Pinching Antenna Network	Bibo Wu, Fang Fang, Ming Zeng, Xianbin Wang	2025-08-17	下载	Leveraging pinching antennas in wireless network enabled federated learning (FL) can effectively mitigate the common "straggler" issue in FL by dynamically establishing strong line-of-sight (LoS) link...
Agent Communications toward Agentic AI at Edge -- A Case Study of the Agent2Agent Protocol	Qiang Duan, Zhihui Lu	2025-08-17	下载	The current evolution of artificial intelligence introduces a paradigm shift toward agentic AI built upon multi-agent systems (MAS). Agent communications serve as a key to effective agent interactions...
Better Together: Leveraging Multiple Digital Twins for Deployment Optimization of Airborne Base Stations	Mauro Belgiovine, Chris Dick, Kaushik Chowdhury	2025-08-17	下载	Airborne Base Stations (ABSs) allow for flexible geographical allocation of network resources with dynamically changing load as well as rapid deployment of alternate connectivity solutions during natu...
ATLAS: AI-Native Receiver Test-and-Measurement by Leveraging AI-Guided Search	Mauro Belgiovine, Suyash Pradhan, Johannes Lange, Michael Löhning, Kaushik Chowdhury	2025-08-17	下载	Industry adoption of Artificial Intelligence (AI)-native wireless receivers, or even modular, Machine Learning (ML)-aided wireless signal processing blocks, has been slow.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Accelerating LLM Inference via Dynamic KV Cache Placement in Heterogeneous Memory System	Yunhua Fang, Rui Xie, Asad Ul Haq, Linsen Ma, Kaoutar El Maghraoui, Naigang Wang, Meng Wang, Liu Liu, Tong Zhang	2025-08-17	下载	Large Language Model (LLM) inference is increasingly constrained by memory bandwidth, with frequent access to the key-value (KV) cache dominating data movement.