Skip to content

2025-08-17

cs.AR - Architecture

标题作者发布日期PDF摘要
Accelerating LLM Inference via Dynamic KV Cache Placement in Heterogeneous Memory SystemYunhua Fang, Rui Xie, Asad Ul Haq, Linsen Ma, Kaoutar El Maghraoui, Naigang Wang, Meng Wang, Liu Liu, Tong Zhang2025-08-17下载Large Language Model (LLM) inference is increasingly constrained by memory bandwidth, with frequent access to the key-value (KV) cache dominating data movement.
ATLAS: A Self-Supervised and Cross-Stage Netlist Power Model for Fine-Grained Time-Based Layout Power AnalysisWenkai Li, Yao Lu, Wenji Fang, Jing Wang, Qijun Zhang, Zhiyao Xie2025-08-17下载Accurate power prediction in VLSI design is crucial for effective power optimization, especially as designs get transformed from gate-level netlist to layout stages.
An ECC-based Fault Tolerance Approach for DNNsMohsen Raji, Mohammad Zaree, Kimia Soroush2025-08-17下载Deep Neural Network (DNN) has achieve great success in solving a wide range of machine learning problems. Recently, they have been deployed in datacenters (potentially for business-critical or industr...
Soft Error Probability Estimation of Nano-scale Combinational CircuitsAli Jockar, Mohsen Raji2025-08-17下载As technology scales, nano-scale digital circuits face heightened susceptibility to single event upsets (SEUs) and transients (SETs) due to shrinking feature sizes and reduced operating voltages.
AutoPower: Automated Few-Shot Architecture-Level Power Modeling by Power Group DecouplingQijun Zhang, Yao Lu, Mengming Li, Zhiyao Xie2025-08-17下载Power efficiency is a critical design objective in modern CPU design. Architects need a fast yet accurate architecture-level power evaluation tool to perform early-stage power estimation.
TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles PlatformJun Liu, Zhenglun Kong, Pu Zhao, Weihao Zeng, Hao Tang, Xuan Shen, Changdi Yang, Wenbin Zhang, Geng Yuan, Wei Niu, Xue Lin, Yanzhi Wang2025-08-17下载Autonomous driving platforms encounter diverse driving scenarios, each with varying hardware resources and precision requirements. Given the computational limitations of embedded devices, it is crucia...
A Time- and Energy-Efficient CNN with Dense Connections on Memristor-Based ChipsWenyong Zhou, Yuan Ren, Jiajun Zhou, Tianshu Hou, Ngai Wong2025-08-17下载Designing lightweight convolutional neural network (CNN) models is an active research area in edge AI. Compute-in-memory (CIM) provides a new computing paradigm to alleviate time and energy consumptio...
Special Session: Sustainable Deployment of Deep Neural Networks on Non-Volatile Compute-in-Memory AcceleratorsYifan Qin, Zheyu Yan, Wujie Wen, Xiaobo Sharon Hu, Yiyu Shi2025-08-17下载Non-volatile memory (NVM) based compute-in-memory (CIM) accelerators have emerged as a sustainable solution to significantly boost energy efficiency and minimize latency for Deep Neural Networks (DNNs...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
GPU Acceleration for Faster Evolutionary Spatial Cyclic Game SystemsLouie Sinadjan2025-08-17下载This dissertation presents the design, implementation and evaluation of GPU-accelerated simulation frameworks for Evolutionary Spatial Cyclic Games (ESCGs), a class of agent-based models used to study...
Breaking the Aggregation Bottleneck in Federated Recommendation: A Personalized Model Merging ApproachJundong Chen, Honglei Zhang, Chunxu Zhang, Fangyuan Luo, Yidong Li2025-08-17下载Federated recommendation (FR) facilitates collaborative training by aggregating local models from massive devices, enabling client-specific personalization while ensuring privacy.
A Large-Scale Web Search Dataset for Federated Online Learning to RankMarcel Gregoriadis, Jingwei Kang, Johan Pouwelse2025-08-17下载The centralized collection of search interaction logs for training ranking models raises significant privacy concerns. Federated Online Learning to Rank (FOLTR) offers a privacy-preserving alternative...
Proceedings 18th Interaction and Concurrency ExperienceClément Aubert, Cinzia Di Giusto, Simon Fowler, Violet Ka I Pun2025-08-17下载This volume contains the proceedings of ICE'25, the 18th Interaction and Concurrency Experience, which was held on Friday 20th June 2025 at the École National Supérieure des Arts et Métiers in Lille, ...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
ChamaleoNet: Programmable Passive Probe for Enhanced Visibility on Erroneous TrafficZhihao Wang, Alessandro Cornacchia, Andrea Bianco, Idilio Drago, Paolo Giaccone, Dingde Jiang, Marco Mellia2025-08-17下载Traffic visibility remains a key component for management and security operations. Observing unsolicited and erroneous traffic, such as unanswered traffic or errors, is fundamental to detect misconfig...
Cold-RL: Learning Cache Eviction with Offline Reinforcement Learning for NGINXAayush Gupta, Arpit Bhayani2025-08-17下载Web proxies such as NGINX commonly rely on least-recently-used (LRU) eviction, which is size agnostic and can thrash under periodic bursts and mixed object sizes.
Straggler-Resilient Federated Learning over A Hybrid Conventional and Pinching Antenna NetworkBibo Wu, Fang Fang, Ming Zeng, Xianbin Wang2025-08-17下载Leveraging pinching antennas in wireless network enabled federated learning (FL) can effectively mitigate the common "straggler" issue in FL by dynamically establishing strong line-of-sight (LoS) link...
Agent Communications toward Agentic AI at Edge -- A Case Study of the Agent2Agent ProtocolQiang Duan, Zhihui Lu2025-08-17下载The current evolution of artificial intelligence introduces a paradigm shift toward agentic AI built upon multi-agent systems (MAS). Agent communications serve as a key to effective agent interactions...
Better Together: Leveraging Multiple Digital Twins for Deployment Optimization of Airborne Base StationsMauro Belgiovine, Chris Dick, Kaushik Chowdhury2025-08-17下载Airborne Base Stations (ABSs) allow for flexible geographical allocation of network resources with dynamically changing load as well as rapid deployment of alternate connectivity solutions during natu...
ATLAS: AI-Native Receiver Test-and-Measurement by Leveraging AI-Guided SearchMauro Belgiovine, Suyash Pradhan, Johannes Lange, Michael Löhning, Kaushik Chowdhury2025-08-17下载Industry adoption of Artificial Intelligence (AI)-native wireless receivers, or even modular, Machine Learning (ML)-aided wireless signal processing blocks, has been slow.

cs.PF - Performance

标题作者发布日期PDF摘要
Accelerating LLM Inference via Dynamic KV Cache Placement in Heterogeneous Memory SystemYunhua Fang, Rui Xie, Asad Ul Haq, Linsen Ma, Kaoutar El Maghraoui, Naigang Wang, Meng Wang, Liu Liu, Tong Zhang2025-08-17下载Large Language Model (LLM) inference is increasingly constrained by memory bandwidth, with frequent access to the key-value (KV) cache dominating data movement.

基于 VitePress 构建