Appearance
2026-03-22
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Toward a Universal GPU Instruction Set Architecture: A Cross-Vendor Analysis of Hardware-Invariant Computational Primitives in Parallel Processors | Ojima Abraham, Onyinye Okoli | 2026-03-22 | 下载 | We present the first systematic cross-vendor analysis of GPU instruction set architectures spanning all four major GPU vendors: NVIDIA (PTX ISA v1.0 through v9. |
| DS2SC-Agent: A Multi-Agent Automated Pipeline for Rapid Chiplet Model Generation | Yiwei Wu, Yifan Wu, Yunhao Xiong, Dengwei Zhao, Jiaxuan Shen, Jianfei Jiang, Guanghui He, Shikui Tu, Yanan Sun | 2026-03-22 | 下载 | Constructing behavioral-level chiplet models (e.g., SystemC) is crucial for early-stage heterogeneous architecture exploration. Traditional manual modeling is notoriously time-consuming and error-pron... |
| PC2IM: An Efficient In-Memory Computing Accelerator for 3D Point Cloud | Dengfeng Wang, Shunqin Cai, Yanan Sun | 2026-03-22 | 下载 | 3D point cloud neural networks have significantly enhanced the perceptual capabilities of resource-limited mobile intelligent systems. However, despite the transformative impact, the point cloud algor... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Communication-Avoiding SpGEMM via Trident Partitioning on Hierarchical GPU Interconnects | Julian Bellavita, Lorenzo Pichetti, Thomas Pasquali, Flavio Vella, Giulia Guidi | 2026-03-22 | 下载 | The multiplication of two sparse matrices, known as SpGEMM, is a key kernel in scientific computing and large-scale data analytics, underpinning graph algorithms, machine learning, simulations, and co... |
| Decidability of Livelock Detection for Parameterized Self-Disabling Unidirectional Rings | Aly Farahat | 2026-03-22 | 下载 | We prove that livelock detection is \emph{decidable in polynomial time} for parameterized symmetric unidirectional rings of self-disabling processes with bounded domain . |
| The Workload-Router-Pool Architecture for LLM Inference Optimization: A Vision Paper from the vLLM Semantic Router Project | Huamin Chen, Xunzhuo Liu, Bowei He, Fuyuan Lyu, Yankai Chen, Xue Liu, Yuhan Liu, Junchen Jiang | 2026-03-22 | 下载 | Over the past year, the vLLM Semantic Router project has released a series of work spanning: (1) core routing mechanisms -- signal-driven routing, context-length pool routing, router performance engin... |
| ARYA: A Physics-Constrained Composable & Deterministic World Model Architecture | Seth Dobrin, Lukasz Chmiel | 2026-03-22 | 下载 | This paper presents ARYA, a composable, physics-constrained, deterministic world model architecture built on five foundational principles: nano models, composability, causal reasoning, determinism, an... |
| Toward a Universal GPU Instruction Set Architecture: A Cross-Vendor Analysis of Hardware-Invariant Computational Primitives in Parallel Processors | Ojima Abraham, Onyinye Okoli | 2026-03-22 | 下载 | We present the first systematic cross-vendor analysis of GPU instruction set architectures spanning all four major GPU vendors: NVIDIA (PTX ISA v1.0 through v9. |
| CALVO: Improve Serving Efficiency for LLM Inferences with Intense Network Demands | Weiye Wang, Chen Chen, Junxue Zhang, Zhusheng Wang, Hui Yuan, Zixuan Guan, Xiaolong Zheng, Qizhen Weng, Yin Chen, Minyi Guo | 2026-03-22 | 下载 | Distributed prefix caching has become a core technique for efficient LLM serving. However, for long-context requests with high cache hit ratios, retrieving reusable KVCache blocks from remote servers ... |
| Parallel Gauss-Jordan Elimination and System Reduction for Efficient Circuit Simulation | Filip Noveski, Elena Hadzieva | 2026-03-22 | 下载 | For the purposes of electric circuit simulation, we consider an iterative simulation model based on solving systems of linear equations by Gauss-Jordan elimination (GJE) for individual moments in time... |
| NeSy-Edge: Neuro-Symbolic Trustworthy Self-Healing in the Computing Continuum | Peihan Ye, Alfreds Lapkovskis, Alaa Saleh, Qiyang Zhang, Praveen Kumar Donta | 2026-03-22 | 下载 | The computational demands of modern AI services are increasingly shifting execution beyond centralized clouds toward a computing continuum spanning edge and end devices. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| WN-Wrangle: Wireless Network Data Wrangling Assistant | Anirudh Kamath, Dustin Maas, Jacobus Van der Merwe, Anna Fariha | 2026-03-22 | 下载 | Data wrangling continues to be the most time-consuming task in the data science pipeline and wireless network data is no exception. Prior approaches for automatic or assisted data-wrangling primarily ... |
| WirelessBench: A Tolerance-Aware LLM Agent Benchmark for Wireless Network Intelligence | Jingwen Tong, Fang Liu, Linkai Xv, Shiliang Lu, Kangqi Li, Yiqian Zhang, Yijie Song, Zeyang Xue, Jun Zhang | 2026-03-22 | 下载 | LLM agents are emerging as a key enabler for autonomous wireless network management. Reliably deploying them, however, demands benchmarks that reflect real engineering risk. |
| Security and Privacy in O-RAN for 6G: A Comprehensive Review of Threats and Mitigation Approaches | Lujia Liang, Lei Zhang | 2026-03-22 | 下载 | Open Radio Access Network (O-RAN) is a major advancement in the telecommunications field, providing standardized interfaces that promote interoperability between different vendors' technologies, there... |
| A lightweight Outlier Detection for Characterizing Radio- and Environment-Specific Link Quality Fluctuation in Low-Power Wireless Networks | Zegeye Mekasha Kidane, Waltenegus Dargie | 2026-03-22 | 下载 | The performance of low-power wireless sensing networks can be influenced by both external environmental factors and internal imperfections which often arise due to manufacturing tolerance during mass ... |
| Learning to Optimize Joint Source and RIS-assisted Channel Encoding for Multi-User Semantic Communication Systems | Haidong Wang, Songhan Zhao, Bo Gu, Shimin Gong, Hongyang Du, Ping Wang | 2026-03-22 | 下载 | In this paper, we explore a joint source and reconfigurable intelligent surface (RIS)-assisted channel encoding (JSRE) framework for multi-user semantic communications, where a deep neural network (DN... |
| DRL-driven Online Optimization for Joint Traffic Reshaping and Channel Reconfiguration in RIS-assisted Semantic NOMA Communications | Songhan Zhao, Shimin Gong, Bo Gu, Zehui Xiong, Ping Wang, Kaibin Huang | 2026-03-22 | 下载 | This paper explores a reconfigurable intelligent surface (RIS)-assisted and semantic-aware wireless network, where multiple semantic users (SUs) transmit semantic information to an access point (AP) u... |
| Generative Artificial Intelligence Assisted Multi-modal Semantic Extraction for NOMA-based Image Transmissions | Songhan Zhao, Shimin Gong, Bo Gu, Hongyang Du, Xidong Mu, Zehui Xiong, Yuming Fang | 2026-03-22 | 下载 | In this paper, we investigate a generative artificial intelligence (GAI)-assisted semantic communication framework for non-orthogonal multiple access (NOMA)-based image transmissions. |
| AnyPro: Preference-Preserving Anycast Optimization based on Strategic AS-Path Prepending | Minyuan Zhou, Yuning Chen, Jiaqi Zheng, Yifei Xu, Pan Hu, Yongping Tang, Wendong Yin, Jie Lin, Qingyan Yu, Yuanchao Su, Guihai Chen, Wanchun Dou, Songwu Lu, Wan Du | 2026-03-22 | 下载 | Operating large-scale anycast networks is challenging because client-to-site mappings often misalign with operator's expectation due to opaque inter-domain routing. |
| OrbitStream: Training-Free Adaptive 360-degree Video Streaming via Semantic Potential Fields | Aizierjiang Aiersilan, Zhangfei Yang | 2026-03-22 | 下载 | Adaptive 360° video streaming for teleoperation faces dual challenges: viewport prediction under uncertain gaze patterns and bitrate adaptation over volatile wireless channels. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| AutoKernel: Autonomous GPU Kernel Optimization via Iterative Agent-Driven Search | Jaber Jaber, Osama Jaber | 2026-03-22 | 下载 | Writing high-performance GPU kernels is among the most labor-intensive tasks in machine learning systems engineering. We present AutoKernel, an open-source framework that applies an autonomous agent l... |
| A lightweight Outlier Detection for Characterizing Radio- and Environment-Specific Link Quality Fluctuation in Low-Power Wireless Networks | Zegeye Mekasha Kidane, Waltenegus Dargie | 2026-03-22 | 下载 | The performance of low-power wireless sensing networks can be influenced by both external environmental factors and internal imperfections which often arise due to manufacturing tolerance during mass ... |