Skip to content

2026-01-13

cs.AR - Architecture

标题作者发布日期PDF摘要
Annotated PIM BibliographyPeter M. Kogge2026-01-13下载Processing in Memory (PIM) and similar terms such as Compute In Memory (CIM), Logic in Memory (LIM), In Memory Computing (IMC), and Near Memory Computing (NMC) have gained attention recently as a pote...
Memory DisOrder: Memory Re-orderings as a Timerless Side-channelSean Siddens, Sanya Srivastava, Reese Levine, Josiah Dykstra, Tyler Sorensen2026-01-13下载To improve efficiency, nearly all parallel processing units (CPUs and GPUs) implement relaxed memory models in which memory operations may be re-ordered, i.e., executed out-of-order.
Bio-RV: Low-Power Resource-Efficient RISC-V Processor for Biomedical ApplicationsVijay Pratap Sharma, Annu Kumar, Mohd Faisal Khan, Mukul Lokhande, Santosh Kumar Vishvakarma2026-01-13下载This work presents Bio-RV, a compact and resource-efficient RISC-V processor intended for biomedical control applications, such as accelerator-based biomedical SoCs and implantable pacemaker systems.
A New Tool to Find Lightweight (And, Xor) Implementations of Quadratic Vectorial Boolean Functions up to Dimension 9Marie Bolzer, Sébastien Duval, Marine Minier2026-01-13下载The problem of finding a minimal circuit to implement a given function is one of the oldest in electronics. It is known to be NP-hard. Still, many tools exist to find sub-optimal circuits to implement...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
MixServe: An Automatic Distributed Serving System for MoE Models with Hybrid Parallelism Based on Fused Communication AlgorithmBowen Zhou, Jinrui Jia, Wenhao He, Yong Zhang, Fang Dong2026-01-13下载The Mixture of Experts (MoE) models are emerging as the latest paradigm for Large Language Models (LLMs). However, due to memory constraints, MoE models with billions or even trillions of parameters c...
Multivariate Polynomial Codes for Efficient Matrix Chain Multiplication in Distributed SystemsJesús Gómez-Vilardebò2026-01-13下载We study the problem of computing matrix chain multiplications in a distributed computing cluster. In such systems, performance is often limited by the straggler problem, where the slowest worker domi...
Shifting the Sweet Spot: High-Performance Matrix-Free Method for High-Order ElasticityDali Chang, Chong Zhang, Kaiqi Zhang, Mingguan Yang, Huiyuan Li, Weiqiang Kong2026-01-13下载In high-order finite element analysis for elasticity, matrix-free (PA) methods are a key technology for overcoming the memory bottleneck of traditional Full Assembly (FA).
Matrix-PIC: Harnessing Matrix Outer-product for High-Performance Particle-in-Cell SimulationsYizhuo Rao, Xingjian Cui, Jiabin Xie, Shangzhi Pang, Guangnan Feng, Jinhui Wei, Zhiguang Chen, Yutong Lu2026-01-13下载Particle-in-Cell (PIC) simulations spend most of their execution time on particle--grid interactions, where fine-grained atomic updates become a major bottleneck on traditional many-core CPUs.
Improving Zero-shot ADL Recognition with Large Language Models through Event-based Context and ConfidenceMichele Fiori, Gabriele Civitarese, Marco Colussi, Claudio Bettini2026-01-13下载Unobtrusive sensor-based recognition of Activities of Daily Living (ADLs) in smart homes by processing data collected from IoT sensing devices supports applications such as healthcare, safety, and ene...
Hierarchical Online-Scheduling for Energy-Efficient Split Inference with Progressive TransmissionZengzipeng Tang, Yuxuan Sun, Wei Chen, Jianwen Ding, Bo Ai, Yulin Shao2026-01-13下载Device-edge collaborative inference with Deep Neural Networks (DNNs) faces fundamental trade-offs among accuracy, latency and energy consumption.
Coordinated Cooling and Compute Management for AI DatacentersNardos Belay Abera, Yize Chen2026-01-13下载The AI datacenters are currently being deployed on a large scale to support the training and deployment of power-intensive large-language models (LLMs).

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
ABE-VVS: Attribute-Based Encrypted Volumetric Video StreamingMohammad Waquas Usmani, Susmit Shannigrahi, Michael Zink2026-01-13下载This work introduces ABE-VVS, a framework that performs attribute based selective coordinate encryption for point cloud based volumetric video streaming, enabling lightweight yet effective digital rig...
A decentralized academic certificate issuance system using smart contracts on the tron networkAna Julia Evangelista Andrade, Flavio Cezar Amate2026-01-13下载This paper presents the design, implementation, and evaluation of a decentralized system for issuing and verifying academic certificates based on blockchain technology.
Statistical Characterization and Prediction of E2E Latency over LEO Satellite NetworksAndreas Casparsen, Jonas Ellegaard Jakobsen, Jimmy Jessen Nielsen, Petar Popovski, Israel Leyva Mayorga2026-01-13下载Low Earth Orbit (LEO) satellite networks are emerging as an essential communication infrastructure, with standardized 5G-based non-terrestrial networks and their integration with terrestrial systems e...
Streamlined Pathway (SP) Approach: An Efficient Load Balancer to Enhance Quality of ServiceAymen Hasan Alawadi2026-01-13下载Efficient load-balancing mechanisms are critical for maximizing performance and increasing the quality of service (QoS) of data center networks (DCNs).
Unleashing Tool Engineering and Intelligence for Agentic AI in Next-Generation Communication NetworksYinqiu Liu, Ruichen Zhang, Dusit Niyato, Abbas Jamalipour, Trung Q. Duong, Dong In Kim2026-01-13下载Nowadays, agentic AI is emerging as a transformative paradigm for next-generation communication networks, promising to evolve large language models (LLMs) from passive chatbots into autonomous operato...
Tiny-Twin: A CPU-Native Full-stack Digital Twin for NextG Cellular NetworksAli Mamaghani, Ushasi Ghosh, Ish Kumar Jain, Srinivas Shakkottai, Dinesh Bharadia2026-01-13下载Modern wireless applications demand testing environments that capture the full complexity of next-generation (NextG) cellular networks. While digital twins promise realistic emulation, existing soluti...
Multi-Objective Optimization for Joint Communication and Sensing in Multi-user MIMO Systems: Characterizing the Pareto BoundaryThakshila Perera, Amine Mezghani, Ekram Hossain2026-01-13下载This paper investigates the Pareto boundary performance of a joint communication and sensing (JCAS) system that addresses both sensing and communication functions at the same time.
Joint Communication and Sensing in RIS-Assisted MIMO System Under Mutual CouplingDilki Wijekoon, Amine Mezghani, Ekram Hossain2026-01-13下载This paper considers a downlink Reconfigurable Intelligent Surface (RIS)-assisted Joint Communication and Sensing (JCAS) system within a physically-consistent setting, accounting for the effect of mut...
Hierarchical Online-Scheduling for Energy-Efficient Split Inference with Progressive TransmissionZengzipeng Tang, Yuxuan Sun, Wei Chen, Jianwen Ding, Bo Ai, Yulin Shao2026-01-13下载Device-edge collaborative inference with Deep Neural Networks (DNNs) faces fundamental trade-offs among accuracy, latency and energy consumption.

cs.PF - Performance

标题作者发布日期PDF摘要
LookAhead: The Optimal Non-decreasing Index Policy for a Time-Varying Holding Cost problemKeerthana Gurushankar, Zhouzi Li, Mor Harchol-Balter, Alan Scheller-Wolf2026-01-13下载In practice, the cost of delaying a job can grow as the job waits. Such behavior is modeled by the Time-Varying Holding Cost (TVHC) problem, where each job's instantaneous holding cost increases with ...
Reducing Compute Waste in LLMs through Kernel-Level DVFSJeffrey Spaan, Kuan-Hsun Chen, Ana-Lucia Varbanescu2026-01-13下载The rapid growth of AI has fueled the expansion of accelerator- or GPU-based data centers. However, the rising operational energy consumption has emerged as a critical bottleneck and a major sustainab...
Shifting the Sweet Spot: High-Performance Matrix-Free Method for High-Order ElasticityDali Chang, Chong Zhang, Kaiqi Zhang, Mingguan Yang, Huiyuan Li, Weiqiang Kong2026-01-13下载In high-order finite element analysis for elasticity, matrix-free (PA) methods are a key technology for overcoming the memory bottleneck of traditional Full Assembly (FA).

基于 VitePress 构建