Skip to content

2026-03-22

cs.AR - Architecture

标题作者发布日期PDF摘要
Toward a Universal GPU Instruction Set Architecture: A Cross-Vendor Analysis of Hardware-Invariant Computational Primitives in Parallel ProcessorsOjima Abraham, Onyinye Okoli2026-03-22下载We present the first systematic cross-vendor analysis of GPU instruction set architectures spanning all four major GPU vendors: NVIDIA (PTX ISA v1.0 through v9.
DS2SC-Agent: A Multi-Agent Automated Pipeline for Rapid Chiplet Model GenerationYiwei Wu, Yifan Wu, Yunhao Xiong, Dengwei Zhao, Jiaxuan Shen, Jianfei Jiang, Guanghui He, Shikui Tu, Yanan Sun2026-03-22下载Constructing behavioral-level chiplet models (e.g., SystemC) is crucial for early-stage heterogeneous architecture exploration. Traditional manual modeling is notoriously time-consuming and error-pron...
PC2IM: An Efficient In-Memory Computing Accelerator for 3D Point CloudDengfeng Wang, Shunqin Cai, Yanan Sun2026-03-22下载3D point cloud neural networks have significantly enhanced the perceptual capabilities of resource-limited mobile intelligent systems. However, despite the transformative impact, the point cloud algor...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Communication-Avoiding SpGEMM via Trident Partitioning on Hierarchical GPU InterconnectsJulian Bellavita, Lorenzo Pichetti, Thomas Pasquali, Flavio Vella, Giulia Guidi2026-03-22下载The multiplication of two sparse matrices, known as SpGEMM, is a key kernel in scientific computing and large-scale data analytics, underpinning graph algorithms, machine learning, simulations, and co...
Decidability of Livelock Detection for Parameterized Self-Disabling Unidirectional RingsAly Farahat2026-03-22下载We prove that livelock detection is \emph{decidable in polynomial time} for parameterized symmetric unidirectional rings of self-disabling processes with bounded domain Zm\mathbb{Z}_m.
The Workload-Router-Pool Architecture for LLM Inference Optimization: A Vision Paper from the vLLM Semantic Router ProjectHuamin Chen, Xunzhuo Liu, Bowei He, Fuyuan Lyu, Yankai Chen, Xue Liu, Yuhan Liu, Junchen Jiang2026-03-22下载Over the past year, the vLLM Semantic Router project has released a series of work spanning: (1) core routing mechanisms -- signal-driven routing, context-length pool routing, router performance engin...
ARYA: A Physics-Constrained Composable & Deterministic World Model ArchitectureSeth Dobrin, Lukasz Chmiel2026-03-22下载This paper presents ARYA, a composable, physics-constrained, deterministic world model architecture built on five foundational principles: nano models, composability, causal reasoning, determinism, an...
Toward a Universal GPU Instruction Set Architecture: A Cross-Vendor Analysis of Hardware-Invariant Computational Primitives in Parallel ProcessorsOjima Abraham, Onyinye Okoli2026-03-22下载We present the first systematic cross-vendor analysis of GPU instruction set architectures spanning all four major GPU vendors: NVIDIA (PTX ISA v1.0 through v9.
CALVO: Improve Serving Efficiency for LLM Inferences with Intense Network DemandsWeiye Wang, Chen Chen, Junxue Zhang, Zhusheng Wang, Hui Yuan, Zixuan Guan, Xiaolong Zheng, Qizhen Weng, Yin Chen, Minyi Guo2026-03-22下载Distributed prefix caching has become a core technique for efficient LLM serving. However, for long-context requests with high cache hit ratios, retrieving reusable KVCache blocks from remote servers ...
Parallel Gauss-Jordan Elimination and System Reduction for Efficient Circuit SimulationFilip Noveski, Elena Hadzieva2026-03-22下载For the purposes of electric circuit simulation, we consider an iterative simulation model based on solving systems of linear equations by Gauss-Jordan elimination (GJE) for individual moments in time...
NeSy-Edge: Neuro-Symbolic Trustworthy Self-Healing in the Computing ContinuumPeihan Ye, Alfreds Lapkovskis, Alaa Saleh, Qiyang Zhang, Praveen Kumar Donta2026-03-22下载The computational demands of modern AI services are increasingly shifting execution beyond centralized clouds toward a computing continuum spanning edge and end devices.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
WN-Wrangle: Wireless Network Data Wrangling AssistantAnirudh Kamath, Dustin Maas, Jacobus Van der Merwe, Anna Fariha2026-03-22下载Data wrangling continues to be the most time-consuming task in the data science pipeline and wireless network data is no exception. Prior approaches for automatic or assisted data-wrangling primarily ...
WirelessBench: A Tolerance-Aware LLM Agent Benchmark for Wireless Network IntelligenceJingwen Tong, Fang Liu, Linkai Xv, Shiliang Lu, Kangqi Li, Yiqian Zhang, Yijie Song, Zeyang Xue, Jun Zhang2026-03-22下载LLM agents are emerging as a key enabler for autonomous wireless network management. Reliably deploying them, however, demands benchmarks that reflect real engineering risk.
Security and Privacy in O-RAN for 6G: A Comprehensive Review of Threats and Mitigation ApproachesLujia Liang, Lei Zhang2026-03-22下载Open Radio Access Network (O-RAN) is a major advancement in the telecommunications field, providing standardized interfaces that promote interoperability between different vendors' technologies, there...
A lightweight Outlier Detection for Characterizing Radio- and Environment-Specific Link Quality Fluctuation in Low-Power Wireless NetworksZegeye Mekasha Kidane, Waltenegus Dargie2026-03-22下载The performance of low-power wireless sensing networks can be influenced by both external environmental factors and internal imperfections which often arise due to manufacturing tolerance during mass ...
Learning to Optimize Joint Source and RIS-assisted Channel Encoding for Multi-User Semantic Communication SystemsHaidong Wang, Songhan Zhao, Bo Gu, Shimin Gong, Hongyang Du, Ping Wang2026-03-22下载In this paper, we explore a joint source and reconfigurable intelligent surface (RIS)-assisted channel encoding (JSRE) framework for multi-user semantic communications, where a deep neural network (DN...
DRL-driven Online Optimization for Joint Traffic Reshaping and Channel Reconfiguration in RIS-assisted Semantic NOMA CommunicationsSonghan Zhao, Shimin Gong, Bo Gu, Zehui Xiong, Ping Wang, Kaibin Huang2026-03-22下载This paper explores a reconfigurable intelligent surface (RIS)-assisted and semantic-aware wireless network, where multiple semantic users (SUs) transmit semantic information to an access point (AP) u...
Generative Artificial Intelligence Assisted Multi-modal Semantic Extraction for NOMA-based Image TransmissionsSonghan Zhao, Shimin Gong, Bo Gu, Hongyang Du, Xidong Mu, Zehui Xiong, Yuming Fang2026-03-22下载In this paper, we investigate a generative artificial intelligence (GAI)-assisted semantic communication framework for non-orthogonal multiple access (NOMA)-based image transmissions.
AnyPro: Preference-Preserving Anycast Optimization based on Strategic AS-Path PrependingMinyuan Zhou, Yuning Chen, Jiaqi Zheng, Yifei Xu, Pan Hu, Yongping Tang, Wendong Yin, Jie Lin, Qingyan Yu, Yuanchao Su, Guihai Chen, Wanchun Dou, Songwu Lu, Wan Du2026-03-22下载Operating large-scale anycast networks is challenging because client-to-site mappings often misalign with operator's expectation due to opaque inter-domain routing.
OrbitStream: Training-Free Adaptive 360-degree Video Streaming via Semantic Potential FieldsAizierjiang Aiersilan, Zhangfei Yang2026-03-22下载Adaptive 360° video streaming for teleoperation faces dual challenges: viewport prediction under uncertain gaze patterns and bitrate adaptation over volatile wireless channels.

cs.PF - Performance

标题作者发布日期PDF摘要
AutoKernel: Autonomous GPU Kernel Optimization via Iterative Agent-Driven SearchJaber Jaber, Osama Jaber2026-03-22下载Writing high-performance GPU kernels is among the most labor-intensive tasks in machine learning systems engineering. We present AutoKernel, an open-source framework that applies an autonomous agent l...
A lightweight Outlier Detection for Characterizing Radio- and Environment-Specific Link Quality Fluctuation in Low-Power Wireless NetworksZegeye Mekasha Kidane, Waltenegus Dargie2026-03-22下载The performance of low-power wireless sensing networks can be influenced by both external environmental factors and internal imperfections which often arise due to manufacturing tolerance during mass ...

基于 VitePress 构建