Skip to content

2026-03-12

cs.AR - Architecture

标题作者发布日期PDF摘要
System-Technology Co-Optimization of Bitline Routing and Bonding Pathways in Monolithic 3D DRAM ArchitecturesKiseok Lee, Sungwon Cho, Seongkwang Lim, Suman Datta, Shimeng Yu2026-03-12下载3D DRAM has emerged as a promising approach for continued density scaling, but its viability is limited by routing and hybrid bonding constraints to periphery, which may degrade sensing margin, latenc...
DiscoRD: An Experimental Methodology for Quickly Discovering the Reliable Read Disturbance Threshold of Real DRAM ChipsAtaberk Olgun, F. Nisa Bostanci, Ismail Emir Yuksel, Haocong Luo, Minesh Patel, A. Giray Yaglikci, Onur Mutlu2026-03-12下载State-of-the-art DRAM read disturbance mitigations rely on the read disturbance threshold (RDT) (e.g., the number of aggressor row activations needed to induce the first read disturbance bitflip) to s...
SNAP-V: A RISC-V SoC with Configurable Neuromorphic Acceleration for Small-Scale Spiking Neural NetworksKanishka Gunawardana, Sanka Peeris, Kavishka Rambukwella, Thamish Wanduragala, Saadia Jameel, Roshan Ragel, Isuru Nawinne2026-03-12下载Spiking Neural Networks (SNNs) have gained significant attention in edge computing due to their low power consumption and computational efficiency.
HyperCroc: End-to-End Open-Source RISC-V MCU with a Plug-In Interface for Domain-Specific AcceleratorsPhilippe Sauter, Thomas Benz, Paul Scheffler, Luca Benini2026-03-12下载Domain-Specific architectures with accelerators for machine learning and signal processing require efficient bulk data movement and high-bandwidth access to large datasets.
Implementing and Optimizing an Open-Source SD-card Host Controller for RISC-V SoCsAxel Vanoni, Philippe Sauter, Paul Scheffler, Anton Buchner, Micha Wehrli, Thomas Benz, Luca Benini2026-03-12下载Recent announcements have shown the viability of end-to-end open-source (OS) Linux-capable RISC-V systems on chip (SoCs). However, practical application and software development platforms require effi...
Link Quality Aware Pathfinding for Chiplet InterconnectsAaron Yen, Jooyeon Jeong, Puneet Gupta2026-03-12下载As chiplet-based integration advances, designers must select among short-reach die-to-die interconnect technologies with widely varying shoreline and areal bandwidth density, energy per bit, reach, an...
AutoVeriFix+: High-Correctness RTL Generation via Trace-Aware Causal Fix and Semantic Redundancy PruningYan Tan, Xiangchen Meng, Zijun Jiang, Yangdi Lyu2026-03-12下载Large language models (LLMs) have demonstrated impressive capabilities in generating software code for high-level programming languages such as Python and C++.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
TaxBreak: Unmasking the Hidden Costs of LLM Inference Through Overhead DecompositionPrabhu Vellaisamy, Shreesh Tripathi, Vignesh Natarajan, Surya Santhan Thenarasu, Shawn Blanton, John P. Shen2026-03-12下载Large Language Model (LLM) inference is widely used in interactive assistants and agentic systems. In latency-sensitive deployments, inference time can become dominated by host-side overheads.
KernelFoundry: Hardware-aware evolutionary GPU kernel optimizationNina Wiedemann, Quentin Leboutet, Michael Paulitsch, Diana Wofk, Benjamin Ummenhofer2026-03-12下载Optimizing GPU kernels presents a significantly greater challenge for large language models (LLMs) than standard code generation tasks, as it requires understanding hardware architecture, parallel opt...
OpenDC-STEAM: Realistic Modeling and Systematic Exploration of Composable Techniques for Sustainable DatacentersDante Niewenhuis, Sacheendra Talluri, Alexandru Iosup, Tiziano de Matteis2026-03-12下载The need to reduce datacenter carbon footprint is urgent. While many sustainability techniques have been proposed, they are often evaluated in isolation, using limited setups or analytical models that...
WORKSWORLD: A Domain for Integrated Numeric Planning and Scheduling of Distributed Pipelined WorkflowsTaylor Paul, William Regli2026-03-12下载This work pursues automated planning and scheduling of distributed data pipelines, or workflows. We develop a general workflow and resource graph representation that includes both data processing and ...
Cornserve: A Distributed Serving System for Any-to-Any Multimodal ModelsJae-Won Chung, Jeff J. Ma, Jisang Ahn, Yizhuo Liang, Akshay Jajoo, Myungjin Lee, Mosharaf Chowdhury2026-03-12下载Any-to-Any models are an emerging class of multimodal models that accept combinations of multimodal data (e.g., text, image, video, audio) as input and generate them as output.
HPC Containers for EBRAINS: Towards Portable Cross-Domain Software EnvironmentKrishna Kant Singh, Eric Müller, Eleni Mathioulaki, Wouter Klijn, Lena Oden2026-03-12下载Deploying complex, distributed scientific workflows across diverse HPC sites is often hindered by site-specific dependencies and complex build environments.
AGMARL-DKS: An Adaptive Graph-Enhanced Multi-Agent Reinforcement Learning for Dynamic Kubernetes SchedulingHamed Hamzeh2026-03-12下载State-of-the-art cloud-native applications require intelligent schedulers that can effectively balance system stability, resource utilisation, and associated costs.
Decentralized Orchestration Architecture for Fluid Computing: A Secure Distributed AI Use CaseDiego Cajaraville-Aboy, Ana Fernández-Vilas, Rebeca P. Díaz-Redondo, Manuel Fernández-Veiga, Pablo Picallo-López2026-03-12下载Distributed AI and IoT applications increasingly execute across heterogeneous resources spanning end devices, edge/fog infrastructure, and cloud platforms, often under different administrative domains...
Deep Learning-based Assessment of the Relation Between the Third Molar and Mandibular Canal on Panoramic Radiographs using Local, Centralized, and Federated LearningJohan Andreas Balle Rubak, Sara Haghighat, Sanyam Jain, Mostafa Aldesoki, Akhilanand Chaurasia, Sarah Sadat Ehsani, Faezeh Dehghan Ghanatkaman, Ahmad Badruddin Ghazali, Julien Issa, Basel Khalil, Rishi Ramani, Ruben Pauwels2026-03-12下载Impaction of the mandibular third molar in proximity to the mandibular canal increases the risk of inferior alveolar nerve injury. Panoramic radiography is routinely used to assess this relationship.
The Carnot Bound: Limits and Possibilities for Bandwidth-Efficient ConsensusAndrew Lewis-Pye, Patrick O'Grady2026-03-12下载In leader-based protocols for State Machine Replication (SMR), the leader's outgoing bandwidth is a natural throughput bottleneck. Erasure coding can alleviate this by allowing the leader to send each...
Beyond BFS: A Comparative Study of Rooted Spanning Tree Algorithms on GPUsAbhijeet Sahu, Srikar Vilas Donur2026-03-12下载Rooted spanning trees (RSTs) are a core primitive in parallel graph analytics, underpinning algorithms such as biconnected components and planarity testing.
Subtime: Reversible Information Exchange and the Emergence of Classical TimePaul L. Borrill2026-03-12下载We formalize the concept of subtime -- a reversible mode of information interchange within entangled systems -- and show how classical time emerges as an asymptotic limit through decoherence.
NCCLbpf: Verified, Composable Policy Execution for GPU Collective CommunicationYusheng Zheng2026-03-12下载NCCL is the de facto standard for collective GPU communication in large-scale distributed training, relying heavily on plugins to customize runtime behavior.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Keys on Doormats: Exposed API Credentials on the WebNurullah Demir, Yash Vekaria, Georgios Smaragdakis, Zakir Durumeric2026-03-12下载Application programming interfaces (APIs) have become a central part of the modern IT environment, allowing developers to enrich the functionality of applications and interact with third parties such ...
RadEar: A Self-Supervised RF Backscatter System for Voice Eavesdropping and SeparationQijun Wang, Peihao Yan, Chunqi Qian, Huacheng Zeng2026-03-12下载Eavesdropping on voice conversations presents a growing threat to personal privacy and information security. In this paper, we present RadEar, a novel RF backscatter-based system designed to enable co...
Intelligent 6G Edge Connectivity: A Knowledge Driven Optimization Framework for Small Cell SelectionTuğçe Bilen, Ian F. Akyildiz2026-03-12下载Sixth-generation (6G) wireless networks are expected to support immersive and mission-critical applications requiring ultra-reliable communication, sub-second responsiveness, and multi-Gbps data rates...
Kraken*: Architecting Generative, Semantic, and Goal-Oriented Network Management for 6G Wireless SystemsIan F. Akyildiz, Tuğçe Bilen2026-03-12下载Sixth-generation (6G) wireless networks are expected to support autonomous, immersive, and mission-critical services that require not only extreme data rates and ultra-low latency but also adaptive re...
The Network That Thinks: Kraken* and the Dawn of Cognitive 6GIan F. Akyildiz, Tuğçe Bilen2026-03-12下载Future sixth-generation (6G) networks must evolve beyond high-speed data delivery to support intelligent, context-aware services. Emerging applications such as autonomous transportation, immersive ext...
Direct-to-Device Connectivity for Integrated Communication, Navigation and SurveillanceMuhammad Asad Ullah, Davi Brilhante, Luís Eduardo Partichelli Potrich, José Suárez-Varela, Paul Almasan, Charles Cleary, Vadim Kramar2026-03-12下载Sixth-generation (6G) communication systems are expected to support direct-to-device (D2D) connectivity, enabling standard user equipment (UE) to seamlessly transition to non-terrestrial network (NTN)...
Radio Radiance Field: The New Frontier of Spatial Wireless Channel RepresentationHaijian Sun, Feng Ye2026-03-12下载Massive MIMO, among other ground-breaking technologies, is being developed for the next-generation wireless systems to support requirements in terms of data rates, reliability, latency, intelligence, ...
Internet-Scale Measurement of React2Shell Exploitation Using an Active Network TelescopeAakash Singh, Kuldeep Singh Yadav, Md Talib Hasan Ansari, V. Anil Kumar2026-03-12下载The increasing adoption of server-side component-based web frameworks has introduced new application-layer attack surfaces that remain insufficiently understood at Internet scale.
Deep Learning Network-Temporal Models For Traffic PredictionYufeng Xin, Ethan Fan2026-03-12下载Time series analysis is critical for emerging net- work intelligent control and management functions. However, existing statistical-based and shallow machine learning models have shown limited predict...
Efficient Cross-View Localization in 6G Space-Air-Ground Integrated NetworkMin Hao, Yanbing Xu, Maoqiang Wu, Jinglin Huang, Chen Shang, Jiacheng Wang, Ruichen Zhang, Jiawen Kang, Dusit Niyato, Zhu Han, Wei Ni2026-03-12下载Recently, visual localization has become an important supplement to improve localization reliability, and cross-view approaches can greatly enhance coverage and adaptability.
Agentic AI for Embodied-enhanced Beam Prediction in Low-Altitude Economy NetworksMin Hao, Zhizhuo Li, Zirui Zhang, Maoqiang Wu, Han Zhang, Rong Yu2026-03-12下载Millimeter-wave or terahertz communications can meet demands of low-altitude economy networks for high-throughput sensing and real-time decision making.
SliceFed: Federated Constrained Multi-Agent DRL for Dynamic Spectrum Slicing in 6GHossein Mohammadi, Seyed Bagher Hashemi Natanzi, Ramak Nassiri, Jamshid Hassanpour, Bo Tang, Vuk Marojevic2026-03-12下载Dynamic spectrum slicing is a critical enabler for 6G Radio Access Networks (RANs), allowing the coexistence of heterogeneous services. However, optimizing resource allocation in dense, interference-l...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
NCCLbpf: Verified, Composable Policy Execution for GPU Collective CommunicationYusheng Zheng2026-03-12下载NCCL is the de facto standard for collective GPU communication in large-scale distributed training, relying heavily on plugins to customize runtime behavior.

cs.PF - Performance

标题作者发布日期PDF摘要
TaxBreak: Unmasking the Hidden Costs of LLM Inference Through Overhead DecompositionPrabhu Vellaisamy, Shreesh Tripathi, Vignesh Natarajan, Surya Santhan Thenarasu, Shawn Blanton, John P. Shen2026-03-12下载Large Language Model (LLM) inference is widely used in interactive assistants and agentic systems. In latency-sensitive deployments, inference time can become dominated by host-side overheads.

基于 VitePress 构建