Appearance
2026-01-29
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Heterogeneous Computing: The Key to Powering the Future of AI Agent Inference | Yiren Zhao, Junyi Liu | 2026-01-29 | 下载 | AI agent inference is driving an inference heavy datacenter future and exposes bottlenecks beyond compute - especially memory capacity, memory bandwidth and high-speed interconnect. |
| PowerGenie: Analytically-Guided Evolutionary Discovery of Superior Reconfigurable Power Converters | Jian Gao, Yiwei Zou, Abhishek Pradhan, Wenhao Huang, Yumin Su, Kaiyuan Yang, Xuan Zhang | 2026-01-29 | 下载 | Discovering superior circuit topologies requires navigating an exponentially large design space-a challenge traditionally reserved for human experts. |
| Frequency as Aperture: Enabling Embeddable Near-Field Sensing for 6G Wireless Radios | Pin-Han Ho, Limei Peng, Yiming Miao, Xu Fan, Kairan Liang, Haoran Mei, Wei Duan | 2026-01-29 | 下载 | Integrated sensing and communication (ISAC) is expected to be natively supported by future 6G wireless radios, yet most mmWave sensing solutions still rely on dedicated radar hardware incompatible wit... |
| ChipBench: A Next-Step Benchmark for Evaluating LLM Performance in AI-Aided Chip Design | Zhongkai Yu, Chenyang Zhou, Yichen Lin, Hejia Zhang, Haotian Ye, Junxia Cui, Zaifeng Pan, Jishen Zhao, Yufei Ding | 2026-01-29 | 下载 | While Large Language Models (LLMs) show significant potential in hardware engineering, current benchmarks suffer from saturation and limited task diversity, failing to reflect LLMs' performance in rea... |
| FireFly-P: FPGA-Accelerated Spiking Neural Network Plasticity for Robust Adaptive Control | Tenglong Li, Jindong Li, Guobin Shen, Dongcheng Zhao, Qian Zhang, Yi Zeng | 2026-01-29 | 下载 | Spiking Neural Networks (SNNs) offer a biologically plausible learning mechanism through synaptic plasticity, enabling unsupervised adaptation without the computational overhead of backpropagation. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| SAIR: Cost-Efficient Multi-Stage ML Pipeline Autoscaling via In-Context Reinforcement Learning | Jianchang Su, Yifan Zhang, Shengkai Lin, Shizhen Zhao, Yusheng Zheng, Yiwei Yang, Wei Zhang | 2026-01-29 | 下载 | Multi-stage ML inference pipelines are difficult to autoscale due to heterogeneous resources, cross-stage coupling, and dynamic bottleneck migration. |
| Learning Provably Correct Distributed Protocols Without Human Knowledge | Yujie Hui, Xiaoyi Lu, Andrew Perrault, Yang Wang | 2026-01-29 | 下载 | Provably correct distributed protocols, which are a critical component of modern distributed systems, are highly challenging to design and have often required decades of human effort. |
| ZK-HybridFL: Zero-Knowledge Proof-Enhanced Hybrid Ledger for Federated Learning | Amirhossein Taherpour, Xiaodong Wang | 2026-01-29 | 下载 | Federated learning (FL) enables collaborative model training while preserving data privacy, yet both centralized and decentralized approaches face challenges in scalability, security, and update valid... |
| A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine | Anran Li, Yuanyuan Chen, Wenjun Long, Yu Yin, Yan Hu, Hyunjae Kim, Weipeng Zhou, Yujia Zhou, Hongyi Peng, Yang Ren, Xuguang Ai, Zhenyue Qin, Ming Hu, Xiaoxiao Li, Han Yu, Yih-Chung Tham, Lucila Ohno-Machado, Hua Xu, Qingyu Chen | 2026-01-29 | 下载 | Large language models (LLMs) have demonstrated strong performance on medical benchmarks, including question answering and diagnosis. To enable their use in clinical settings, LLMs are typically furthe... |
| Where Do the Joules Go? Diagnosing Inference Energy Consumption | Jae-Won Chung, Ruofan Wu, Jeff J. Ma, Mosharaf Chowdhury | 2026-01-29 | 下载 | Energy is now a critical ML computing resource. While measuring energy consumption and observing trends is a valuable first step, accurately understanding and diagnosing why those differences occur is... |
| FedAdaVR: Adaptive Variance Reduction for Robust Federated Learning under Limited Client Participation | S M Ruhul Kabir Howlader, Xiao Chen, Yifei Xie, Lu Liu | 2026-01-29 | 下载 | Federated learning (FL) encounters substantial challenges due to heterogeneity, leading to gradient noise, client drift, and partial client participation errors, the last of which is the most pervasiv... |
| Heterogeneous Computing: The Key to Powering the Future of AI Agent Inference | Yiren Zhao, Junyi Liu | 2026-01-29 | 下载 | AI agent inference is driving an inference heavy datacenter future and exposes bottlenecks beyond compute - especially memory capacity, memory bandwidth and high-speed interconnect. |
| Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic | Shuo Liu, Tianle Chen, Ryan Amiri, Christopher Amato | 2026-01-29 | 下载 | Recent work has explored optimizing LLM collaboration through Multi-Agent Reinforcement Learning (MARL). However, most MARL fine-tuning approaches rely on predefined execution protocols, which often r... |
| Belief Propagation Converges to Gaussian Distributions in Sparsely-Connected Factor Graphs | Tom Yates, Yuzhou Cheng, Ignacio Alzugaray, Danyal Akarca, Pedro A. M. Mediano, Andrew J. Davison | 2026-01-29 | 下载 | Belief Propagation (BP) is a powerful algorithm for distributed inference in probabilistic graphical models, however it quickly becomes infeasible for practical compute and memory budgets. |
| Self-Adaptive Probabilistic Skyline Query Processing in Distributed Edge Computing via Deep Reinforcement Learning | Chuan-Chi Lai | 2026-01-29 | 下载 | In the era of the Internet of Everything (IoE), the exponential growth of sensor-generated data at the network edge renders efficient Probabilistic Skyline Query (PSKY) processing a critical challenge... |
| DASH: Deterministic Attention Scheduling for High-throughput Reproducible LLM Training | Xinwei Qiang, Hongmin Chen, Shixuan Sun, Jingwen Leng, Xin Liu, Minyi Guo | 2026-01-29 | 下载 | Determinism is indispensable for reproducibility in large language model (LLM) training, yet it often exacts a steep performance cost. In widely used attention implementations such as FlashAttention-3... |
| EWSJF: An Adaptive Scheduler with Hybrid Partitioning for Mixed-Workload LLM Inference | Bronislav Sidik, Chaya Levi, Joseph Kampeas | 2026-01-29 | 下载 | Serving Large Language Models (LLMs) under mixed workloads--short, latency-sensitive interactive queries alongside long, throughput-oriented batch requests--poses a fundamental scheduling challenge. |
| bigMICE: Multiple Imputation of Big Data | Hugo Morvan, Jonas Agholme, Bjorn Eliasson, Katarina Olofsson, Ludger Grote, Fredrik Iredahl, Oleg Sysoev | 2026-01-29 | 下载 | Missing data is a prevalent issue in many applications, including large medical registries such as the Swedish Healthcare Quality Registries, potentially leading to biased or inefficient analyses if n... |
| ScaleSim: Serving Large-Scale Multi-Agent Simulation with Invocation Distance-Based Memory Management | Zaifeng Pan, Yipeng Shen, Zhengding Hu, Zhuang Wang, Aninda Manocha, Zheng Wang, Zhongkai Yu, Yue Guan, Yufei Ding | 2026-01-29 | 下载 | LLM-based multi-agent simulations are increasingly adopted across application domains, but remain difficult to scale due to GPU memory pressure. |
| Nimbus: A Unified Embodied Synthetic Data Generation Framework | Zeyu He, Yuchang Zhang, Yuanzhen Zhou, Miao Tao, Hengjie Li, Hui Wang, Yang Tian, Jia Zeng, Tai Wang, Wenzhe Cai, Yilun Chen, Ning Gao, Jiangmiao Pang | 2026-01-29 | 下载 | Scaling data volume and diversity is critical for generalizing embodied intelligence. While synthetic data generation offers a scalable alternative to expensive physical data acquisition, existing pip... |
| Ira: Efficient Transaction Replay for Distributed Systems | Adithya Bhat, Harshal Bhadreshkumar Shah, Mohsen Minaei | 2026-01-29 | 下载 | In primary-backup replication, consensus latency is bounded by the time for backup nodes to replay (re-execute) transactions proposed by the primary. |
| ZipMoE: Efficient On-Device MoE Serving via Lossless Compression and Cache-Affinity Scheduling | Yuchen Yang, Yaru Zhao, Pu Yang, Shaowei Wang, Zhi-Hua Zhou | 2026-01-29 | 下载 | While Mixture-of-Experts (MoE) architectures substantially bolster the expressive power of large-language models, their prohibitive memory footprint severely impedes the practical deployment on resour... |
| Maxwait: A Generalized Mechanism for Distributed Time-Sensitive Systems | Francesco Paladino, Shulu Li, Edward A. Lee | 2026-01-29 | 下载 | Distributed time-sensitive systems must balance timing requirements (availability) and consistency in the presence of communication delays and synchronization uncertainty. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Privacy-Preserving Sensor-Based Human Activity Recognition for Low-Resource Healthcare Using Classical Machine Learning | Ramakant Kumar, Pravin Kumar | 2026-01-29 | 下载 | Limited access to medical infrastructure forces elderly and vulnerable patients to rely on home-based care, often leading to neglect and poor adherence to therapeutic exercises such as yoga or physiot... |
| Securing SIM-Assisted Wireless Networks via Quantum Reinforcement Learning | Le-Hung Hoang, Quang-Trung Luu, Dinh Thai Hoang, Diep N. Nguyen, Van-Dinh Nguyen | 2026-01-29 | 下载 | Stacked intelligent metasurfaces (SIMs) have recently emerged as a powerful wave-domain technology that enables multi-stage manipulation of electromagnetic signals through multilayer programmable arch... |
| SIA: Symbolic Interpretability for Anticipatory Deep Reinforcement Learning in Network Control | MohammadErfan Jabbari, Abhishek Duttagupta, Claudio Fiandrino, Leonardo Bonati, Salvatore D'Oro, Michele Polese, Marco Fiore, Tommaso Melodia | 2026-01-29 | 下载 | Deep reinforcement learning (DRL) promises adaptive control for future mobile networks but conventional agents remain reactive: they act on past and current measurements and cannot leverage short-term... |
| SymbXRL: Symbolic Explainable Deep Reinforcement Learning for Mobile Networks | Abhishek Duttagupta, MohammadErfan Jabbari, Claudio Fiandrino, Marco Fiore, Joerg Widmer | 2026-01-29 | 下载 | The operation of future 6th-generation (6G) mobile networks will increasingly rely on the ability of deep reinforcement learning (DRL) to optimize network decisions in real-time. |
| Spatiotemporal Continual Learning for Mobile Edge UAV Networks: Mitigating Catastrophic Forgetting | Chuan-Chi Lai | 2026-01-29 | 下载 | This paper addresses catastrophic forgetting in mobile edge UAV networks within dynamic spatiotemporal environments. Conventional deep reinforcement learning often fails during task transitions, neces... |
| Self-Adaptive Probabilistic Skyline Query Processing in Distributed Edge Computing via Deep Reinforcement Learning | Chuan-Chi Lai | 2026-01-29 | 下载 | In the era of the Internet of Everything (IoE), the exponential growth of sensor-generated data at the network edge renders efficient Probabilistic Skyline Query (PSKY) processing a critical challenge... |
| Age Aware Content Fetching and Broadcast in a Sensing-as-a-Service System | Ankita Koley, Anu Krishna, Chandramani Singh, V Mahendran | 2026-01-29 | 下载 | We consider a Sensing-as-a-Service (S2aaS) system consisting of a sensor, a set of users, and a sensor cloud service provider (SCSP). The sensor updates its content each time it captures a new measure... |
| Authenticated encryption for space telemetry | Andrew Savchenko | 2026-01-29 | 下载 | We explore how command stack protection requirements outlined in NASA-STD-1006A can be satisfied within the context of emergency space telemetry. |
| KubeSpace: A Low-Latency and Stable Control Plane for LEO Satellite Container Orchestration | Zhiyuan Zhao, Jiasheng Wu, Shaojie Su, Wenjun Zhu, Yue Gao | 2026-01-29 | 下载 | Low Earth orbit (LEO) satellites play a pivotal role in global connectivity-delivering high-speed Internet, cellular coverage, and massive IoT support. |
| ViTMAlis: Towards Latency-Critical Mobile Video Analytics with Vision Transformers | Miao Zhang, Guanzhen Wu, Hao Fang, Yifei Zhu, Fangxin Wang, Ruixiao Zhang, Jiangchuan Liu | 2026-01-29 | 下载 | Edge-assisted mobile video analytics (MVA) applications are increasingly shifting from using vision models based on convolutional neural networks (CNNs) to those built on vision transformers (ViTs) to... |