2026-01-13

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Annotated PIM Bibliography	Peter M. Kogge	2026-01-13	下载	Processing in Memory (PIM) and similar terms such as Compute In Memory (CIM), Logic in Memory (LIM), In Memory Computing (IMC), and Near Memory Computing (NMC) have gained attention recently as a pote...
Memory DisOrder: Memory Re-orderings as a Timerless Side-channel	Sean Siddens, Sanya Srivastava, Reese Levine, Josiah Dykstra, Tyler Sorensen	2026-01-13	下载	To improve efficiency, nearly all parallel processing units (CPUs and GPUs) implement relaxed memory models in which memory operations may be re-ordered, i.e., executed out-of-order.
Bio-RV: Low-Power Resource-Efficient RISC-V Processor for Biomedical Applications	Vijay Pratap Sharma, Annu Kumar, Mohd Faisal Khan, Mukul Lokhande, Santosh Kumar Vishvakarma	2026-01-13	下载	This work presents Bio-RV, a compact and resource-efficient RISC-V processor intended for biomedical control applications, such as accelerator-based biomedical SoCs and implantable pacemaker systems.
A New Tool to Find Lightweight (And, Xor) Implementations of Quadratic Vectorial Boolean Functions up to Dimension 9	Marie Bolzer, Sébastien Duval, Marine Minier	2026-01-13	下载	The problem of finding a minimal circuit to implement a given function is one of the oldest in electronics. It is known to be NP-hard. Still, many tools exist to find sub-optimal circuits to implement...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
MixServe: An Automatic Distributed Serving System for MoE Models with Hybrid Parallelism Based on Fused Communication Algorithm	Bowen Zhou, Jinrui Jia, Wenhao He, Yong Zhang, Fang Dong	2026-01-13	下载	The Mixture of Experts (MoE) models are emerging as the latest paradigm for Large Language Models (LLMs). However, due to memory constraints, MoE models with billions or even trillions of parameters c...
Multivariate Polynomial Codes for Efficient Matrix Chain Multiplication in Distributed Systems	Jesús Gómez-Vilardebò	2026-01-13	下载	We study the problem of computing matrix chain multiplications in a distributed computing cluster. In such systems, performance is often limited by the straggler problem, where the slowest worker domi...
Shifting the Sweet Spot: High-Performance Matrix-Free Method for High-Order Elasticity	Dali Chang, Chong Zhang, Kaiqi Zhang, Mingguan Yang, Huiyuan Li, Weiqiang Kong	2026-01-13	下载	In high-order finite element analysis for elasticity, matrix-free (PA) methods are a key technology for overcoming the memory bottleneck of traditional Full Assembly (FA).
Matrix-PIC: Harnessing Matrix Outer-product for High-Performance Particle-in-Cell Simulations	Yizhuo Rao, Xingjian Cui, Jiabin Xie, Shangzhi Pang, Guangnan Feng, Jinhui Wei, Zhiguang Chen, Yutong Lu	2026-01-13	下载	Particle-in-Cell (PIC) simulations spend most of their execution time on particle--grid interactions, where fine-grained atomic updates become a major bottleneck on traditional many-core CPUs.
Improving Zero-shot ADL Recognition with Large Language Models through Event-based Context and Confidence	Michele Fiori, Gabriele Civitarese, Marco Colussi, Claudio Bettini	2026-01-13	下载	Unobtrusive sensor-based recognition of Activities of Daily Living (ADLs) in smart homes by processing data collected from IoT sensing devices supports applications such as healthcare, safety, and ene...
Hierarchical Online-Scheduling for Energy-Efficient Split Inference with Progressive Transmission	Zengzipeng Tang, Yuxuan Sun, Wei Chen, Jianwen Ding, Bo Ai, Yulin Shao	2026-01-13	下载	Device-edge collaborative inference with Deep Neural Networks (DNNs) faces fundamental trade-offs among accuracy, latency and energy consumption.
Coordinated Cooling and Compute Management for AI Datacenters	Nardos Belay Abera, Yize Chen	2026-01-13	下载	The AI datacenters are currently being deployed on a large scale to support the training and deployment of power-intensive large-language models (LLMs).

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
ABE-VVS: Attribute-Based Encrypted Volumetric Video Streaming	Mohammad Waquas Usmani, Susmit Shannigrahi, Michael Zink	2026-01-13	下载	This work introduces ABE-VVS, a framework that performs attribute based selective coordinate encryption for point cloud based volumetric video streaming, enabling lightweight yet effective digital rig...
A decentralized academic certificate issuance system using smart contracts on the tron network	Ana Julia Evangelista Andrade, Flavio Cezar Amate	2026-01-13	下载	This paper presents the design, implementation, and evaluation of a decentralized system for issuing and verifying academic certificates based on blockchain technology.
Statistical Characterization and Prediction of E2E Latency over LEO Satellite Networks	Andreas Casparsen, Jonas Ellegaard Jakobsen, Jimmy Jessen Nielsen, Petar Popovski, Israel Leyva Mayorga	2026-01-13	下载	Low Earth Orbit (LEO) satellite networks are emerging as an essential communication infrastructure, with standardized 5G-based non-terrestrial networks and their integration with terrestrial systems e...
Streamlined Pathway (SP) Approach: An Efficient Load Balancer to Enhance Quality of Service	Aymen Hasan Alawadi	2026-01-13	下载	Efficient load-balancing mechanisms are critical for maximizing performance and increasing the quality of service (QoS) of data center networks (DCNs).
Unleashing Tool Engineering and Intelligence for Agentic AI in Next-Generation Communication Networks	Yinqiu Liu, Ruichen Zhang, Dusit Niyato, Abbas Jamalipour, Trung Q. Duong, Dong In Kim	2026-01-13	下载	Nowadays, agentic AI is emerging as a transformative paradigm for next-generation communication networks, promising to evolve large language models (LLMs) from passive chatbots into autonomous operato...
Tiny-Twin: A CPU-Native Full-stack Digital Twin for NextG Cellular Networks	Ali Mamaghani, Ushasi Ghosh, Ish Kumar Jain, Srinivas Shakkottai, Dinesh Bharadia	2026-01-13	下载	Modern wireless applications demand testing environments that capture the full complexity of next-generation (NextG) cellular networks. While digital twins promise realistic emulation, existing soluti...
Multi-Objective Optimization for Joint Communication and Sensing in Multi-user MIMO Systems: Characterizing the Pareto Boundary	Thakshila Perera, Amine Mezghani, Ekram Hossain	2026-01-13	下载	This paper investigates the Pareto boundary performance of a joint communication and sensing (JCAS) system that addresses both sensing and communication functions at the same time.
Joint Communication and Sensing in RIS-Assisted MIMO System Under Mutual Coupling	Dilki Wijekoon, Amine Mezghani, Ekram Hossain	2026-01-13	下载	This paper considers a downlink Reconfigurable Intelligent Surface (RIS)-assisted Joint Communication and Sensing (JCAS) system within a physically-consistent setting, accounting for the effect of mut...
Hierarchical Online-Scheduling for Energy-Efficient Split Inference with Progressive Transmission	Zengzipeng Tang, Yuxuan Sun, Wei Chen, Jianwen Ding, Bo Ai, Yulin Shao	2026-01-13	下载	Device-edge collaborative inference with Deep Neural Networks (DNNs) faces fundamental trade-offs among accuracy, latency and energy consumption.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
LookAhead: The Optimal Non-decreasing Index Policy for a Time-Varying Holding Cost problem	Keerthana Gurushankar, Zhouzi Li, Mor Harchol-Balter, Alan Scheller-Wolf	2026-01-13	下载	In practice, the cost of delaying a job can grow as the job waits. Such behavior is modeled by the Time-Varying Holding Cost (TVHC) problem, where each job's instantaneous holding cost increases with ...
Reducing Compute Waste in LLMs through Kernel-Level DVFS	Jeffrey Spaan, Kuan-Hsun Chen, Ana-Lucia Varbanescu	2026-01-13	下载	The rapid growth of AI has fueled the expansion of accelerator- or GPU-based data centers. However, the rising operational energy consumption has emerged as a critical bottleneck and a major sustainab...
Shifting the Sweet Spot: High-Performance Matrix-Free Method for High-Order Elasticity	Dali Chang, Chong Zhang, Kaiqi Zhang, Mingguan Yang, Huiyuan Li, Weiqiang Kong	2026-01-13	下载	In high-order finite element analysis for elasticity, matrix-free (PA) methods are a key technology for overcoming the memory bottleneck of traditional Full Assembly (FA).