2026-01-29

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Heterogeneous Computing: The Key to Powering the Future of AI Agent Inference	Yiren Zhao, Junyi Liu	2026-01-29	下载	AI agent inference is driving an inference heavy datacenter future and exposes bottlenecks beyond compute - especially memory capacity, memory bandwidth and high-speed interconnect.
PowerGenie: Analytically-Guided Evolutionary Discovery of Superior Reconfigurable Power Converters	Jian Gao, Yiwei Zou, Abhishek Pradhan, Wenhao Huang, Yumin Su, Kaiyuan Yang, Xuan Zhang	2026-01-29	下载	Discovering superior circuit topologies requires navigating an exponentially large design space-a challenge traditionally reserved for human experts.
Frequency as Aperture: Enabling Embeddable Near-Field Sensing for 6G Wireless Radios	Pin-Han Ho, Limei Peng, Yiming Miao, Xu Fan, Kairan Liang, Haoran Mei, Wei Duan	2026-01-29	下载	Integrated sensing and communication (ISAC) is expected to be natively supported by future 6G wireless radios, yet most mmWave sensing solutions still rely on dedicated radar hardware incompatible wit...
ChipBench: A Next-Step Benchmark for Evaluating LLM Performance in AI-Aided Chip Design	Zhongkai Yu, Chenyang Zhou, Yichen Lin, Hejia Zhang, Haotian Ye, Junxia Cui, Zaifeng Pan, Jishen Zhao, Yufei Ding	2026-01-29	下载	While Large Language Models (LLMs) show significant potential in hardware engineering, current benchmarks suffer from saturation and limited task diversity, failing to reflect LLMs' performance in rea...
FireFly-P: FPGA-Accelerated Spiking Neural Network Plasticity for Robust Adaptive Control	Tenglong Li, Jindong Li, Guobin Shen, Dongcheng Zhao, Qian Zhang, Yi Zeng	2026-01-29	下载	Spiking Neural Networks (SNNs) offer a biologically plausible learning mechanism through synaptic plasticity, enabling unsupervised adaptation without the computational overhead of backpropagation.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
SAIR: Cost-Efficient Multi-Stage ML Pipeline Autoscaling via In-Context Reinforcement Learning	Jianchang Su, Yifan Zhang, Shengkai Lin, Shizhen Zhao, Yusheng Zheng, Yiwei Yang, Wei Zhang	2026-01-29	下载	Multi-stage ML inference pipelines are difficult to autoscale due to heterogeneous resources, cross-stage coupling, and dynamic bottleneck migration.
Learning Provably Correct Distributed Protocols Without Human Knowledge	Yujie Hui, Xiaoyi Lu, Andrew Perrault, Yang Wang	2026-01-29	下载	Provably correct distributed protocols, which are a critical component of modern distributed systems, are highly challenging to design and have often required decades of human effort.
ZK-HybridFL: Zero-Knowledge Proof-Enhanced Hybrid Ledger for Federated Learning	Amirhossein Taherpour, Xiaodong Wang	2026-01-29	下载	Federated learning (FL) enables collaborative model training while preserving data privacy, yet both centralized and decentralized approaches face challenges in scalability, security, and update valid...
A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine	Anran Li, Yuanyuan Chen, Wenjun Long, Yu Yin, Yan Hu, Hyunjae Kim, Weipeng Zhou, Yujia Zhou, Hongyi Peng, Yang Ren, Xuguang Ai, Zhenyue Qin, Ming Hu, Xiaoxiao Li, Han Yu, Yih-Chung Tham, Lucila Ohno-Machado, Hua Xu, Qingyu Chen	2026-01-29	下载	Large language models (LLMs) have demonstrated strong performance on medical benchmarks, including question answering and diagnosis. To enable their use in clinical settings, LLMs are typically furthe...
Where Do the Joules Go? Diagnosing Inference Energy Consumption	Jae-Won Chung, Ruofan Wu, Jeff J. Ma, Mosharaf Chowdhury	2026-01-29	下载	Energy is now a critical ML computing resource. While measuring energy consumption and observing trends is a valuable first step, accurately understanding and diagnosing why those differences occur is...
FedAdaVR: Adaptive Variance Reduction for Robust Federated Learning under Limited Client Participation	S M Ruhul Kabir Howlader, Xiao Chen, Yifei Xie, Lu Liu	2026-01-29	下载	Federated learning (FL) encounters substantial challenges due to heterogeneity, leading to gradient noise, client drift, and partial client participation errors, the last of which is the most pervasiv...
Heterogeneous Computing: The Key to Powering the Future of AI Agent Inference	Yiren Zhao, Junyi Liu	2026-01-29	下载	AI agent inference is driving an inference heavy datacenter future and exposes bottlenecks beyond compute - especially memory capacity, memory bandwidth and high-speed interconnect.
Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic	Shuo Liu, Tianle Chen, Ryan Amiri, Christopher Amato	2026-01-29	下载	Recent work has explored optimizing LLM collaboration through Multi-Agent Reinforcement Learning (MARL). However, most MARL fine-tuning approaches rely on predefined execution protocols, which often r...
Belief Propagation Converges to Gaussian Distributions in Sparsely-Connected Factor Graphs	Tom Yates, Yuzhou Cheng, Ignacio Alzugaray, Danyal Akarca, Pedro A. M. Mediano, Andrew J. Davison	2026-01-29	下载	Belief Propagation (BP) is a powerful algorithm for distributed inference in probabilistic graphical models, however it quickly becomes infeasible for practical compute and memory budgets.
Self-Adaptive Probabilistic Skyline Query Processing in Distributed Edge Computing via Deep Reinforcement Learning	Chuan-Chi Lai	2026-01-29	下载	In the era of the Internet of Everything (IoE), the exponential growth of sensor-generated data at the network edge renders efficient Probabilistic Skyline Query (PSKY) processing a critical challenge...
DASH: Deterministic Attention Scheduling for High-throughput Reproducible LLM Training	Xinwei Qiang, Hongmin Chen, Shixuan Sun, Jingwen Leng, Xin Liu, Minyi Guo	2026-01-29	下载	Determinism is indispensable for reproducibility in large language model (LLM) training, yet it often exacts a steep performance cost. In widely used attention implementations such as FlashAttention-3...
EWSJF: An Adaptive Scheduler with Hybrid Partitioning for Mixed-Workload LLM Inference	Bronislav Sidik, Chaya Levi, Joseph Kampeas	2026-01-29	下载	Serving Large Language Models (LLMs) under mixed workloads--short, latency-sensitive interactive queries alongside long, throughput-oriented batch requests--poses a fundamental scheduling challenge.
bigMICE: Multiple Imputation of Big Data	Hugo Morvan, Jonas Agholme, Bjorn Eliasson, Katarina Olofsson, Ludger Grote, Fredrik Iredahl, Oleg Sysoev	2026-01-29	下载	Missing data is a prevalent issue in many applications, including large medical registries such as the Swedish Healthcare Quality Registries, potentially leading to biased or inefficient analyses if n...
ScaleSim: Serving Large-Scale Multi-Agent Simulation with Invocation Distance-Based Memory Management	Zaifeng Pan, Yipeng Shen, Zhengding Hu, Zhuang Wang, Aninda Manocha, Zheng Wang, Zhongkai Yu, Yue Guan, Yufei Ding	2026-01-29	下载	LLM-based multi-agent simulations are increasingly adopted across application domains, but remain difficult to scale due to GPU memory pressure.
Nimbus: A Unified Embodied Synthetic Data Generation Framework	Zeyu He, Yuchang Zhang, Yuanzhen Zhou, Miao Tao, Hengjie Li, Hui Wang, Yang Tian, Jia Zeng, Tai Wang, Wenzhe Cai, Yilun Chen, Ning Gao, Jiangmiao Pang	2026-01-29	下载	Scaling data volume and diversity is critical for generalizing embodied intelligence. While synthetic data generation offers a scalable alternative to expensive physical data acquisition, existing pip...
Ira: Efficient Transaction Replay for Distributed Systems	Adithya Bhat, Harshal Bhadreshkumar Shah, Mohsen Minaei	2026-01-29	下载	In primary-backup replication, consensus latency is bounded by the time for backup nodes to replay (re-execute) transactions proposed by the primary.
ZipMoE: Efficient On-Device MoE Serving via Lossless Compression and Cache-Affinity Scheduling	Yuchen Yang, Yaru Zhao, Pu Yang, Shaowei Wang, Zhi-Hua Zhou	2026-01-29	下载	While Mixture-of-Experts (MoE) architectures substantially bolster the expressive power of large-language models, their prohibitive memory footprint severely impedes the practical deployment on resour...
Maxwait: A Generalized Mechanism for Distributed Time-Sensitive Systems	Francesco Paladino, Shulu Li, Edward A. Lee	2026-01-29	下载	Distributed time-sensitive systems must balance timing requirements (availability) and consistency in the presence of communication delays and synchronization uncertainty.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Privacy-Preserving Sensor-Based Human Activity Recognition for Low-Resource Healthcare Using Classical Machine Learning	Ramakant Kumar, Pravin Kumar	2026-01-29	下载	Limited access to medical infrastructure forces elderly and vulnerable patients to rely on home-based care, often leading to neglect and poor adherence to therapeutic exercises such as yoga or physiot...
Securing SIM-Assisted Wireless Networks via Quantum Reinforcement Learning	Le-Hung Hoang, Quang-Trung Luu, Dinh Thai Hoang, Diep N. Nguyen, Van-Dinh Nguyen	2026-01-29	下载	Stacked intelligent metasurfaces (SIMs) have recently emerged as a powerful wave-domain technology that enables multi-stage manipulation of electromagnetic signals through multilayer programmable arch...
SIA: Symbolic Interpretability for Anticipatory Deep Reinforcement Learning in Network Control	MohammadErfan Jabbari, Abhishek Duttagupta, Claudio Fiandrino, Leonardo Bonati, Salvatore D'Oro, Michele Polese, Marco Fiore, Tommaso Melodia	2026-01-29	下载	Deep reinforcement learning (DRL) promises adaptive control for future mobile networks but conventional agents remain reactive: they act on past and current measurements and cannot leverage short-term...
SymbXRL: Symbolic Explainable Deep Reinforcement Learning for Mobile Networks	Abhishek Duttagupta, MohammadErfan Jabbari, Claudio Fiandrino, Marco Fiore, Joerg Widmer	2026-01-29	下载	The operation of future 6th-generation (6G) mobile networks will increasingly rely on the ability of deep reinforcement learning (DRL) to optimize network decisions in real-time.
Spatiotemporal Continual Learning for Mobile Edge UAV Networks: Mitigating Catastrophic Forgetting	Chuan-Chi Lai	2026-01-29	下载	This paper addresses catastrophic forgetting in mobile edge UAV networks within dynamic spatiotemporal environments. Conventional deep reinforcement learning often fails during task transitions, neces...
Self-Adaptive Probabilistic Skyline Query Processing in Distributed Edge Computing via Deep Reinforcement Learning	Chuan-Chi Lai	2026-01-29	下载	In the era of the Internet of Everything (IoE), the exponential growth of sensor-generated data at the network edge renders efficient Probabilistic Skyline Query (PSKY) processing a critical challenge...
Age Aware Content Fetching and Broadcast in a Sensing-as-a-Service System	Ankita Koley, Anu Krishna, Chandramani Singh, V Mahendran	2026-01-29	下载	We consider a Sensing-as-a-Service (S2aaS) system consisting of a sensor, a set of users, and a sensor cloud service provider (SCSP). The sensor updates its content each time it captures a new measure...
Authenticated encryption for space telemetry	Andrew Savchenko	2026-01-29	下载	We explore how command stack protection requirements outlined in NASA-STD-1006A can be satisfied within the context of emergency space telemetry.
KubeSpace: A Low-Latency and Stable Control Plane for LEO Satellite Container Orchestration	Zhiyuan Zhao, Jiasheng Wu, Shaojie Su, Wenjun Zhu, Yue Gao	2026-01-29	下载	Low Earth orbit (LEO) satellites play a pivotal role in global connectivity-delivering high-speed Internet, cellular coverage, and massive IoT support.
ViTMAlis: Towards Latency-Critical Mobile Video Analytics with Vision Transformers	Miao Zhang, Guanzhen Wu, Hao Fang, Yifei Zhu, Fangxin Wang, Ruixiao Zhang, Jiangchuan Liu	2026-01-29	下载	Edge-assisted mobile video analytics (MVA) applications are increasingly shifting from using vision models based on convolutional neural networks (CNNs) to those built on vision transformers (ViTs) to...