Appearance
2025-08-17
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Accelerating LLM Inference via Dynamic KV Cache Placement in Heterogeneous Memory System | Yunhua Fang, Rui Xie, Asad Ul Haq, Linsen Ma, Kaoutar El Maghraoui, Naigang Wang, Meng Wang, Liu Liu, Tong Zhang | 2025-08-17 | 下载 | Large Language Model (LLM) inference is increasingly constrained by memory bandwidth, with frequent access to the key-value (KV) cache dominating data movement. |
| ATLAS: A Self-Supervised and Cross-Stage Netlist Power Model for Fine-Grained Time-Based Layout Power Analysis | Wenkai Li, Yao Lu, Wenji Fang, Jing Wang, Qijun Zhang, Zhiyao Xie | 2025-08-17 | 下载 | Accurate power prediction in VLSI design is crucial for effective power optimization, especially as designs get transformed from gate-level netlist to layout stages. |
| An ECC-based Fault Tolerance Approach for DNNs | Mohsen Raji, Mohammad Zaree, Kimia Soroush | 2025-08-17 | 下载 | Deep Neural Network (DNN) has achieve great success in solving a wide range of machine learning problems. Recently, they have been deployed in datacenters (potentially for business-critical or industr... |
| Soft Error Probability Estimation of Nano-scale Combinational Circuits | Ali Jockar, Mohsen Raji | 2025-08-17 | 下载 | As technology scales, nano-scale digital circuits face heightened susceptibility to single event upsets (SEUs) and transients (SETs) due to shrinking feature sizes and reduced operating voltages. |
| AutoPower: Automated Few-Shot Architecture-Level Power Modeling by Power Group Decoupling | Qijun Zhang, Yao Lu, Mengming Li, Zhiyao Xie | 2025-08-17 | 下载 | Power efficiency is a critical design objective in modern CPU design. Architects need a fast yet accurate architecture-level power evaluation tool to perform early-stage power estimation. |
| TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform | Jun Liu, Zhenglun Kong, Pu Zhao, Weihao Zeng, Hao Tang, Xuan Shen, Changdi Yang, Wenbin Zhang, Geng Yuan, Wei Niu, Xue Lin, Yanzhi Wang | 2025-08-17 | 下载 | Autonomous driving platforms encounter diverse driving scenarios, each with varying hardware resources and precision requirements. Given the computational limitations of embedded devices, it is crucia... |
| A Time- and Energy-Efficient CNN with Dense Connections on Memristor-Based Chips | Wenyong Zhou, Yuan Ren, Jiajun Zhou, Tianshu Hou, Ngai Wong | 2025-08-17 | 下载 | Designing lightweight convolutional neural network (CNN) models is an active research area in edge AI. Compute-in-memory (CIM) provides a new computing paradigm to alleviate time and energy consumptio... |
| Special Session: Sustainable Deployment of Deep Neural Networks on Non-Volatile Compute-in-Memory Accelerators | Yifan Qin, Zheyu Yan, Wujie Wen, Xiaobo Sharon Hu, Yiyu Shi | 2025-08-17 | 下载 | Non-volatile memory (NVM) based compute-in-memory (CIM) accelerators have emerged as a sustainable solution to significantly boost energy efficiency and minimize latency for Deep Neural Networks (DNNs... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| GPU Acceleration for Faster Evolutionary Spatial Cyclic Game Systems | Louie Sinadjan | 2025-08-17 | 下载 | This dissertation presents the design, implementation and evaluation of GPU-accelerated simulation frameworks for Evolutionary Spatial Cyclic Games (ESCGs), a class of agent-based models used to study... |
| Breaking the Aggregation Bottleneck in Federated Recommendation: A Personalized Model Merging Approach | Jundong Chen, Honglei Zhang, Chunxu Zhang, Fangyuan Luo, Yidong Li | 2025-08-17 | 下载 | Federated recommendation (FR) facilitates collaborative training by aggregating local models from massive devices, enabling client-specific personalization while ensuring privacy. |
| A Large-Scale Web Search Dataset for Federated Online Learning to Rank | Marcel Gregoriadis, Jingwei Kang, Johan Pouwelse | 2025-08-17 | 下载 | The centralized collection of search interaction logs for training ranking models raises significant privacy concerns. Federated Online Learning to Rank (FOLTR) offers a privacy-preserving alternative... |
| Proceedings 18th Interaction and Concurrency Experience | Clément Aubert, Cinzia Di Giusto, Simon Fowler, Violet Ka I Pun | 2025-08-17 | 下载 | This volume contains the proceedings of ICE'25, the 18th Interaction and Concurrency Experience, which was held on Friday 20th June 2025 at the École National Supérieure des Arts et Métiers in Lille, ... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| ChamaleoNet: Programmable Passive Probe for Enhanced Visibility on Erroneous Traffic | Zhihao Wang, Alessandro Cornacchia, Andrea Bianco, Idilio Drago, Paolo Giaccone, Dingde Jiang, Marco Mellia | 2025-08-17 | 下载 | Traffic visibility remains a key component for management and security operations. Observing unsolicited and erroneous traffic, such as unanswered traffic or errors, is fundamental to detect misconfig... |
| Cold-RL: Learning Cache Eviction with Offline Reinforcement Learning for NGINX | Aayush Gupta, Arpit Bhayani | 2025-08-17 | 下载 | Web proxies such as NGINX commonly rely on least-recently-used (LRU) eviction, which is size agnostic and can thrash under periodic bursts and mixed object sizes. |
| Straggler-Resilient Federated Learning over A Hybrid Conventional and Pinching Antenna Network | Bibo Wu, Fang Fang, Ming Zeng, Xianbin Wang | 2025-08-17 | 下载 | Leveraging pinching antennas in wireless network enabled federated learning (FL) can effectively mitigate the common "straggler" issue in FL by dynamically establishing strong line-of-sight (LoS) link... |
| Agent Communications toward Agentic AI at Edge -- A Case Study of the Agent2Agent Protocol | Qiang Duan, Zhihui Lu | 2025-08-17 | 下载 | The current evolution of artificial intelligence introduces a paradigm shift toward agentic AI built upon multi-agent systems (MAS). Agent communications serve as a key to effective agent interactions... |
| Better Together: Leveraging Multiple Digital Twins for Deployment Optimization of Airborne Base Stations | Mauro Belgiovine, Chris Dick, Kaushik Chowdhury | 2025-08-17 | 下载 | Airborne Base Stations (ABSs) allow for flexible geographical allocation of network resources with dynamically changing load as well as rapid deployment of alternate connectivity solutions during natu... |
| ATLAS: AI-Native Receiver Test-and-Measurement by Leveraging AI-Guided Search | Mauro Belgiovine, Suyash Pradhan, Johannes Lange, Michael Löhning, Kaushik Chowdhury | 2025-08-17 | 下载 | Industry adoption of Artificial Intelligence (AI)-native wireless receivers, or even modular, Machine Learning (ML)-aided wireless signal processing blocks, has been slow. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Accelerating LLM Inference via Dynamic KV Cache Placement in Heterogeneous Memory System | Yunhua Fang, Rui Xie, Asad Ul Haq, Linsen Ma, Kaoutar El Maghraoui, Naigang Wang, Meng Wang, Liu Liu, Tong Zhang | 2025-08-17 | 下载 | Large Language Model (LLM) inference is increasingly constrained by memory bandwidth, with frequent access to the key-value (KV) cache dominating data movement. |