2026-03-12

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
System-Technology Co-Optimization of Bitline Routing and Bonding Pathways in Monolithic 3D DRAM Architectures	Kiseok Lee, Sungwon Cho, Seongkwang Lim, Suman Datta, Shimeng Yu	2026-03-12	下载	3D DRAM has emerged as a promising approach for continued density scaling, but its viability is limited by routing and hybrid bonding constraints to periphery, which may degrade sensing margin, latenc...
DiscoRD: An Experimental Methodology for Quickly Discovering the Reliable Read Disturbance Threshold of Real DRAM Chips	Ataberk Olgun, F. Nisa Bostanci, Ismail Emir Yuksel, Haocong Luo, Minesh Patel, A. Giray Yaglikci, Onur Mutlu	2026-03-12	下载	State-of-the-art DRAM read disturbance mitigations rely on the read disturbance threshold (RDT) (e.g., the number of aggressor row activations needed to induce the first read disturbance bitflip) to s...
SNAP-V: A RISC-V SoC with Configurable Neuromorphic Acceleration for Small-Scale Spiking Neural Networks	Kanishka Gunawardana, Sanka Peeris, Kavishka Rambukwella, Thamish Wanduragala, Saadia Jameel, Roshan Ragel, Isuru Nawinne	2026-03-12	下载	Spiking Neural Networks (SNNs) have gained significant attention in edge computing due to their low power consumption and computational efficiency.
HyperCroc: End-to-End Open-Source RISC-V MCU with a Plug-In Interface for Domain-Specific Accelerators	Philippe Sauter, Thomas Benz, Paul Scheffler, Luca Benini	2026-03-12	下载	Domain-Specific architectures with accelerators for machine learning and signal processing require efficient bulk data movement and high-bandwidth access to large datasets.
Implementing and Optimizing an Open-Source SD-card Host Controller for RISC-V SoCs	Axel Vanoni, Philippe Sauter, Paul Scheffler, Anton Buchner, Micha Wehrli, Thomas Benz, Luca Benini	2026-03-12	下载	Recent announcements have shown the viability of end-to-end open-source (OS) Linux-capable RISC-V systems on chip (SoCs). However, practical application and software development platforms require effi...
Link Quality Aware Pathfinding for Chiplet Interconnects	Aaron Yen, Jooyeon Jeong, Puneet Gupta	2026-03-12	下载	As chiplet-based integration advances, designers must select among short-reach die-to-die interconnect technologies with widely varying shoreline and areal bandwidth density, energy per bit, reach, an...
AutoVeriFix+: High-Correctness RTL Generation via Trace-Aware Causal Fix and Semantic Redundancy Pruning	Yan Tan, Xiangchen Meng, Zijun Jiang, Yangdi Lyu	2026-03-12	下载	Large language models (LLMs) have demonstrated impressive capabilities in generating software code for high-level programming languages such as Python and C++.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
TaxBreak: Unmasking the Hidden Costs of LLM Inference Through Overhead Decomposition	Prabhu Vellaisamy, Shreesh Tripathi, Vignesh Natarajan, Surya Santhan Thenarasu, Shawn Blanton, John P. Shen	2026-03-12	下载	Large Language Model (LLM) inference is widely used in interactive assistants and agentic systems. In latency-sensitive deployments, inference time can become dominated by host-side overheads.
KernelFoundry: Hardware-aware evolutionary GPU kernel optimization	Nina Wiedemann, Quentin Leboutet, Michael Paulitsch, Diana Wofk, Benjamin Ummenhofer	2026-03-12	下载	Optimizing GPU kernels presents a significantly greater challenge for large language models (LLMs) than standard code generation tasks, as it requires understanding hardware architecture, parallel opt...
OpenDC-STEAM: Realistic Modeling and Systematic Exploration of Composable Techniques for Sustainable Datacenters	Dante Niewenhuis, Sacheendra Talluri, Alexandru Iosup, Tiziano de Matteis	2026-03-12	下载	The need to reduce datacenter carbon footprint is urgent. While many sustainability techniques have been proposed, they are often evaluated in isolation, using limited setups or analytical models that...
WORKSWORLD: A Domain for Integrated Numeric Planning and Scheduling of Distributed Pipelined Workflows	Taylor Paul, William Regli	2026-03-12	下载	This work pursues automated planning and scheduling of distributed data pipelines, or workflows. We develop a general workflow and resource graph representation that includes both data processing and ...
Cornserve: A Distributed Serving System for Any-to-Any Multimodal Models	Jae-Won Chung, Jeff J. Ma, Jisang Ahn, Yizhuo Liang, Akshay Jajoo, Myungjin Lee, Mosharaf Chowdhury	2026-03-12	下载	Any-to-Any models are an emerging class of multimodal models that accept combinations of multimodal data (e.g., text, image, video, audio) as input and generate them as output.
HPC Containers for EBRAINS: Towards Portable Cross-Domain Software Environment	Krishna Kant Singh, Eric Müller, Eleni Mathioulaki, Wouter Klijn, Lena Oden	2026-03-12	下载	Deploying complex, distributed scientific workflows across diverse HPC sites is often hindered by site-specific dependencies and complex build environments.
AGMARL-DKS: An Adaptive Graph-Enhanced Multi-Agent Reinforcement Learning for Dynamic Kubernetes Scheduling	Hamed Hamzeh	2026-03-12	下载	State-of-the-art cloud-native applications require intelligent schedulers that can effectively balance system stability, resource utilisation, and associated costs.
Decentralized Orchestration Architecture for Fluid Computing: A Secure Distributed AI Use Case	Diego Cajaraville-Aboy, Ana Fernández-Vilas, Rebeca P. Díaz-Redondo, Manuel Fernández-Veiga, Pablo Picallo-López	2026-03-12	下载	Distributed AI and IoT applications increasingly execute across heterogeneous resources spanning end devices, edge/fog infrastructure, and cloud platforms, often under different administrative domains...
Deep Learning-based Assessment of the Relation Between the Third Molar and Mandibular Canal on Panoramic Radiographs using Local, Centralized, and Federated Learning	Johan Andreas Balle Rubak, Sara Haghighat, Sanyam Jain, Mostafa Aldesoki, Akhilanand Chaurasia, Sarah Sadat Ehsani, Faezeh Dehghan Ghanatkaman, Ahmad Badruddin Ghazali, Julien Issa, Basel Khalil, Rishi Ramani, Ruben Pauwels	2026-03-12	下载	Impaction of the mandibular third molar in proximity to the mandibular canal increases the risk of inferior alveolar nerve injury. Panoramic radiography is routinely used to assess this relationship.
The Carnot Bound: Limits and Possibilities for Bandwidth-Efficient Consensus	Andrew Lewis-Pye, Patrick O'Grady	2026-03-12	下载	In leader-based protocols for State Machine Replication (SMR), the leader's outgoing bandwidth is a natural throughput bottleneck. Erasure coding can alleviate this by allowing the leader to send each...
Beyond BFS: A Comparative Study of Rooted Spanning Tree Algorithms on GPUs	Abhijeet Sahu, Srikar Vilas Donur	2026-03-12	下载	Rooted spanning trees (RSTs) are a core primitive in parallel graph analytics, underpinning algorithms such as biconnected components and planarity testing.
Subtime: Reversible Information Exchange and the Emergence of Classical Time	Paul L. Borrill	2026-03-12	下载	We formalize the concept of subtime -- a reversible mode of information interchange within entangled systems -- and show how classical time emerges as an asymptotic limit through decoherence.
NCCLbpf: Verified, Composable Policy Execution for GPU Collective Communication	Yusheng Zheng	2026-03-12	下载	NCCL is the de facto standard for collective GPU communication in large-scale distributed training, relying heavily on plugins to customize runtime behavior.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Keys on Doormats: Exposed API Credentials on the Web	Nurullah Demir, Yash Vekaria, Georgios Smaragdakis, Zakir Durumeric	2026-03-12	下载	Application programming interfaces (APIs) have become a central part of the modern IT environment, allowing developers to enrich the functionality of applications and interact with third parties such ...
RadEar: A Self-Supervised RF Backscatter System for Voice Eavesdropping and Separation	Qijun Wang, Peihao Yan, Chunqi Qian, Huacheng Zeng	2026-03-12	下载	Eavesdropping on voice conversations presents a growing threat to personal privacy and information security. In this paper, we present RadEar, a novel RF backscatter-based system designed to enable co...
Intelligent 6G Edge Connectivity: A Knowledge Driven Optimization Framework for Small Cell Selection	Tuğçe Bilen, Ian F. Akyildiz	2026-03-12	下载	Sixth-generation (6G) wireless networks are expected to support immersive and mission-critical applications requiring ultra-reliable communication, sub-second responsiveness, and multi-Gbps data rates...
Kraken*: Architecting Generative, Semantic, and Goal-Oriented Network Management for 6G Wireless Systems	Ian F. Akyildiz, Tuğçe Bilen	2026-03-12	下载	Sixth-generation (6G) wireless networks are expected to support autonomous, immersive, and mission-critical services that require not only extreme data rates and ultra-low latency but also adaptive re...
The Network That Thinks: Kraken* and the Dawn of Cognitive 6G	Ian F. Akyildiz, Tuğçe Bilen	2026-03-12	下载	Future sixth-generation (6G) networks must evolve beyond high-speed data delivery to support intelligent, context-aware services. Emerging applications such as autonomous transportation, immersive ext...
Direct-to-Device Connectivity for Integrated Communication, Navigation and Surveillance	Muhammad Asad Ullah, Davi Brilhante, Luís Eduardo Partichelli Potrich, José Suárez-Varela, Paul Almasan, Charles Cleary, Vadim Kramar	2026-03-12	下载	Sixth-generation (6G) communication systems are expected to support direct-to-device (D2D) connectivity, enabling standard user equipment (UE) to seamlessly transition to non-terrestrial network (NTN)...
Radio Radiance Field: The New Frontier of Spatial Wireless Channel Representation	Haijian Sun, Feng Ye	2026-03-12	下载	Massive MIMO, among other ground-breaking technologies, is being developed for the next-generation wireless systems to support requirements in terms of data rates, reliability, latency, intelligence, ...
Internet-Scale Measurement of React2Shell Exploitation Using an Active Network Telescope	Aakash Singh, Kuldeep Singh Yadav, Md Talib Hasan Ansari, V. Anil Kumar	2026-03-12	下载	The increasing adoption of server-side component-based web frameworks has introduced new application-layer attack surfaces that remain insufficiently understood at Internet scale.
Deep Learning Network-Temporal Models For Traffic Prediction	Yufeng Xin, Ethan Fan	2026-03-12	下载	Time series analysis is critical for emerging net- work intelligent control and management functions. However, existing statistical-based and shallow machine learning models have shown limited predict...
Efficient Cross-View Localization in 6G Space-Air-Ground Integrated Network	Min Hao, Yanbing Xu, Maoqiang Wu, Jinglin Huang, Chen Shang, Jiacheng Wang, Ruichen Zhang, Jiawen Kang, Dusit Niyato, Zhu Han, Wei Ni	2026-03-12	下载	Recently, visual localization has become an important supplement to improve localization reliability, and cross-view approaches can greatly enhance coverage and adaptability.
Agentic AI for Embodied-enhanced Beam Prediction in Low-Altitude Economy Networks	Min Hao, Zhizhuo Li, Zirui Zhang, Maoqiang Wu, Han Zhang, Rong Yu	2026-03-12	下载	Millimeter-wave or terahertz communications can meet demands of low-altitude economy networks for high-throughput sensing and real-time decision making.
SliceFed: Federated Constrained Multi-Agent DRL for Dynamic Spectrum Slicing in 6G	Hossein Mohammadi, Seyed Bagher Hashemi Natanzi, Ramak Nassiri, Jamshid Hassanpour, Bo Tang, Vuk Marojevic	2026-03-12	下载	Dynamic spectrum slicing is a critical enabler for 6G Radio Access Networks (RANs), allowing the coexistence of heterogeneous services. However, optimizing resource allocation in dense, interference-l...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
NCCLbpf: Verified, Composable Policy Execution for GPU Collective Communication	Yusheng Zheng	2026-03-12	下载	NCCL is the de facto standard for collective GPU communication in large-scale distributed training, relying heavily on plugins to customize runtime behavior.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
TaxBreak: Unmasking the Hidden Costs of LLM Inference Through Overhead Decomposition	Prabhu Vellaisamy, Shreesh Tripathi, Vignesh Natarajan, Surya Santhan Thenarasu, Shawn Blanton, John P. Shen	2026-03-12	下载	Large Language Model (LLM) inference is widely used in interactive assistants and agentic systems. In latency-sensitive deployments, inference time can become dominated by host-side overheads.