Skip to content

2026-01-29

cs.AR - Architecture

标题作者发布日期PDF摘要
Heterogeneous Computing: The Key to Powering the Future of AI Agent InferenceYiren Zhao, Junyi Liu2026-01-29下载AI agent inference is driving an inference heavy datacenter future and exposes bottlenecks beyond compute - especially memory capacity, memory bandwidth and high-speed interconnect.
PowerGenie: Analytically-Guided Evolutionary Discovery of Superior Reconfigurable Power ConvertersJian Gao, Yiwei Zou, Abhishek Pradhan, Wenhao Huang, Yumin Su, Kaiyuan Yang, Xuan Zhang2026-01-29下载Discovering superior circuit topologies requires navigating an exponentially large design space-a challenge traditionally reserved for human experts.
Frequency as Aperture: Enabling Embeddable Near-Field Sensing for 6G Wireless RadiosPin-Han Ho, Limei Peng, Yiming Miao, Xu Fan, Kairan Liang, Haoran Mei, Wei Duan2026-01-29下载Integrated sensing and communication (ISAC) is expected to be natively supported by future 6G wireless radios, yet most mmWave sensing solutions still rely on dedicated radar hardware incompatible wit...
ChipBench: A Next-Step Benchmark for Evaluating LLM Performance in AI-Aided Chip DesignZhongkai Yu, Chenyang Zhou, Yichen Lin, Hejia Zhang, Haotian Ye, Junxia Cui, Zaifeng Pan, Jishen Zhao, Yufei Ding2026-01-29下载While Large Language Models (LLMs) show significant potential in hardware engineering, current benchmarks suffer from saturation and limited task diversity, failing to reflect LLMs' performance in rea...
FireFly-P: FPGA-Accelerated Spiking Neural Network Plasticity for Robust Adaptive ControlTenglong Li, Jindong Li, Guobin Shen, Dongcheng Zhao, Qian Zhang, Yi Zeng2026-01-29下载Spiking Neural Networks (SNNs) offer a biologically plausible learning mechanism through synaptic plasticity, enabling unsupervised adaptation without the computational overhead of backpropagation.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
SAIR: Cost-Efficient Multi-Stage ML Pipeline Autoscaling via In-Context Reinforcement LearningJianchang Su, Yifan Zhang, Shengkai Lin, Shizhen Zhao, Yusheng Zheng, Yiwei Yang, Wei Zhang2026-01-29下载Multi-stage ML inference pipelines are difficult to autoscale due to heterogeneous resources, cross-stage coupling, and dynamic bottleneck migration.
Learning Provably Correct Distributed Protocols Without Human KnowledgeYujie Hui, Xiaoyi Lu, Andrew Perrault, Yang Wang2026-01-29下载Provably correct distributed protocols, which are a critical component of modern distributed systems, are highly challenging to design and have often required decades of human effort.
ZK-HybridFL: Zero-Knowledge Proof-Enhanced Hybrid Ledger for Federated LearningAmirhossein Taherpour, Xiaodong Wang2026-01-29下载Federated learning (FL) enables collaborative model training while preserving data privacy, yet both centralized and decentralized approaches face challenges in scalability, security, and update valid...
A Federated and Parameter-Efficient Framework for Large Language Model Training in MedicineAnran Li, Yuanyuan Chen, Wenjun Long, Yu Yin, Yan Hu, Hyunjae Kim, Weipeng Zhou, Yujia Zhou, Hongyi Peng, Yang Ren, Xuguang Ai, Zhenyue Qin, Ming Hu, Xiaoxiao Li, Han Yu, Yih-Chung Tham, Lucila Ohno-Machado, Hua Xu, Qingyu Chen2026-01-29下载Large language models (LLMs) have demonstrated strong performance on medical benchmarks, including question answering and diagnosis. To enable their use in clinical settings, LLMs are typically furthe...
Where Do the Joules Go? Diagnosing Inference Energy ConsumptionJae-Won Chung, Ruofan Wu, Jeff J. Ma, Mosharaf Chowdhury2026-01-29下载Energy is now a critical ML computing resource. While measuring energy consumption and observing trends is a valuable first step, accurately understanding and diagnosing why those differences occur is...
FedAdaVR: Adaptive Variance Reduction for Robust Federated Learning under Limited Client ParticipationS M Ruhul Kabir Howlader, Xiao Chen, Yifei Xie, Lu Liu2026-01-29下载Federated learning (FL) encounters substantial challenges due to heterogeneity, leading to gradient noise, client drift, and partial client participation errors, the last of which is the most pervasiv...
Heterogeneous Computing: The Key to Powering the Future of AI Agent InferenceYiren Zhao, Junyi Liu2026-01-29下载AI agent inference is driving an inference heavy datacenter future and exposes bottlenecks beyond compute - especially memory capacity, memory bandwidth and high-speed interconnect.
Learning Decentralized LLM Collaboration with Multi-Agent Actor CriticShuo Liu, Tianle Chen, Ryan Amiri, Christopher Amato2026-01-29下载Recent work has explored optimizing LLM collaboration through Multi-Agent Reinforcement Learning (MARL). However, most MARL fine-tuning approaches rely on predefined execution protocols, which often r...
Belief Propagation Converges to Gaussian Distributions in Sparsely-Connected Factor GraphsTom Yates, Yuzhou Cheng, Ignacio Alzugaray, Danyal Akarca, Pedro A. M. Mediano, Andrew J. Davison2026-01-29下载Belief Propagation (BP) is a powerful algorithm for distributed inference in probabilistic graphical models, however it quickly becomes infeasible for practical compute and memory budgets.
Self-Adaptive Probabilistic Skyline Query Processing in Distributed Edge Computing via Deep Reinforcement LearningChuan-Chi Lai2026-01-29下载In the era of the Internet of Everything (IoE), the exponential growth of sensor-generated data at the network edge renders efficient Probabilistic Skyline Query (PSKY) processing a critical challenge...
DASH: Deterministic Attention Scheduling for High-throughput Reproducible LLM TrainingXinwei Qiang, Hongmin Chen, Shixuan Sun, Jingwen Leng, Xin Liu, Minyi Guo2026-01-29下载Determinism is indispensable for reproducibility in large language model (LLM) training, yet it often exacts a steep performance cost. In widely used attention implementations such as FlashAttention-3...
EWSJF: An Adaptive Scheduler with Hybrid Partitioning for Mixed-Workload LLM InferenceBronislav Sidik, Chaya Levi, Joseph Kampeas2026-01-29下载Serving Large Language Models (LLMs) under mixed workloads--short, latency-sensitive interactive queries alongside long, throughput-oriented batch requests--poses a fundamental scheduling challenge.
bigMICE: Multiple Imputation of Big DataHugo Morvan, Jonas Agholme, Bjorn Eliasson, Katarina Olofsson, Ludger Grote, Fredrik Iredahl, Oleg Sysoev2026-01-29下载Missing data is a prevalent issue in many applications, including large medical registries such as the Swedish Healthcare Quality Registries, potentially leading to biased or inefficient analyses if n...
ScaleSim: Serving Large-Scale Multi-Agent Simulation with Invocation Distance-Based Memory ManagementZaifeng Pan, Yipeng Shen, Zhengding Hu, Zhuang Wang, Aninda Manocha, Zheng Wang, Zhongkai Yu, Yue Guan, Yufei Ding2026-01-29下载LLM-based multi-agent simulations are increasingly adopted across application domains, but remain difficult to scale due to GPU memory pressure.
Nimbus: A Unified Embodied Synthetic Data Generation FrameworkZeyu He, Yuchang Zhang, Yuanzhen Zhou, Miao Tao, Hengjie Li, Hui Wang, Yang Tian, Jia Zeng, Tai Wang, Wenzhe Cai, Yilun Chen, Ning Gao, Jiangmiao Pang2026-01-29下载Scaling data volume and diversity is critical for generalizing embodied intelligence. While synthetic data generation offers a scalable alternative to expensive physical data acquisition, existing pip...
Ira: Efficient Transaction Replay for Distributed SystemsAdithya Bhat, Harshal Bhadreshkumar Shah, Mohsen Minaei2026-01-29下载In primary-backup replication, consensus latency is bounded by the time for backup nodes to replay (re-execute) transactions proposed by the primary.
ZipMoE: Efficient On-Device MoE Serving via Lossless Compression and Cache-Affinity SchedulingYuchen Yang, Yaru Zhao, Pu Yang, Shaowei Wang, Zhi-Hua Zhou2026-01-29下载While Mixture-of-Experts (MoE) architectures substantially bolster the expressive power of large-language models, their prohibitive memory footprint severely impedes the practical deployment on resour...
Maxwait: A Generalized Mechanism for Distributed Time-Sensitive SystemsFrancesco Paladino, Shulu Li, Edward A. Lee2026-01-29下载Distributed time-sensitive systems must balance timing requirements (availability) and consistency in the presence of communication delays and synchronization uncertainty.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Privacy-Preserving Sensor-Based Human Activity Recognition for Low-Resource Healthcare Using Classical Machine LearningRamakant Kumar, Pravin Kumar2026-01-29下载Limited access to medical infrastructure forces elderly and vulnerable patients to rely on home-based care, often leading to neglect and poor adherence to therapeutic exercises such as yoga or physiot...
Securing SIM-Assisted Wireless Networks via Quantum Reinforcement LearningLe-Hung Hoang, Quang-Trung Luu, Dinh Thai Hoang, Diep N. Nguyen, Van-Dinh Nguyen2026-01-29下载Stacked intelligent metasurfaces (SIMs) have recently emerged as a powerful wave-domain technology that enables multi-stage manipulation of electromagnetic signals through multilayer programmable arch...
SIA: Symbolic Interpretability for Anticipatory Deep Reinforcement Learning in Network ControlMohammadErfan Jabbari, Abhishek Duttagupta, Claudio Fiandrino, Leonardo Bonati, Salvatore D'Oro, Michele Polese, Marco Fiore, Tommaso Melodia2026-01-29下载Deep reinforcement learning (DRL) promises adaptive control for future mobile networks but conventional agents remain reactive: they act on past and current measurements and cannot leverage short-term...
SymbXRL: Symbolic Explainable Deep Reinforcement Learning for Mobile NetworksAbhishek Duttagupta, MohammadErfan Jabbari, Claudio Fiandrino, Marco Fiore, Joerg Widmer2026-01-29下载The operation of future 6th-generation (6G) mobile networks will increasingly rely on the ability of deep reinforcement learning (DRL) to optimize network decisions in real-time.
Spatiotemporal Continual Learning for Mobile Edge UAV Networks: Mitigating Catastrophic ForgettingChuan-Chi Lai2026-01-29下载This paper addresses catastrophic forgetting in mobile edge UAV networks within dynamic spatiotemporal environments. Conventional deep reinforcement learning often fails during task transitions, neces...
Self-Adaptive Probabilistic Skyline Query Processing in Distributed Edge Computing via Deep Reinforcement LearningChuan-Chi Lai2026-01-29下载In the era of the Internet of Everything (IoE), the exponential growth of sensor-generated data at the network edge renders efficient Probabilistic Skyline Query (PSKY) processing a critical challenge...
Age Aware Content Fetching and Broadcast in a Sensing-as-a-Service SystemAnkita Koley, Anu Krishna, Chandramani Singh, V Mahendran2026-01-29下载We consider a Sensing-as-a-Service (S2aaS) system consisting of a sensor, a set of users, and a sensor cloud service provider (SCSP). The sensor updates its content each time it captures a new measure...
Authenticated encryption for space telemetryAndrew Savchenko2026-01-29下载We explore how command stack protection requirements outlined in NASA-STD-1006A can be satisfied within the context of emergency space telemetry.
KubeSpace: A Low-Latency and Stable Control Plane for LEO Satellite Container OrchestrationZhiyuan Zhao, Jiasheng Wu, Shaojie Su, Wenjun Zhu, Yue Gao2026-01-29下载Low Earth orbit (LEO) satellites play a pivotal role in global connectivity-delivering high-speed Internet, cellular coverage, and massive IoT support.
ViTMAlis: Towards Latency-Critical Mobile Video Analytics with Vision TransformersMiao Zhang, Guanzhen Wu, Hao Fang, Yifei Zhu, Fangxin Wang, Ruixiao Zhang, Jiangchuan Liu2026-01-29下载Edge-assisted mobile video analytics (MVA) applications are increasingly shifting from using vision models based on convolutional neural networks (CNNs) to those built on vision transformers (ViTs) to...

基于 VitePress 构建