Skip to content

2026-04-01

cs.AR - Architecture

标题作者发布日期PDF摘要
LightGuard: Transparent WiFi Security via Physical-Layer LiFi Key BootstrappingShiqi Xu, Yuyang Du, Mingyue Zhang, Hongwei Cui, Soung Chang Liew2026-04-01下载WiFi is inherently vulnerable to eavesdropping because RF signals may penetrate many physical boundaries, such as walls and floors. LiFi, by contrast, is an optical method confined to line-of-sight an...
Escaping Flatland: A Placement Flow for Enabling 3D FPGAsCong Hao, Andrew B. Kahng, Bodhisatta Pramanik, Ismael Youssef2026-04-01下载3D field-programmable gate arrays (FPGAs) promise higher performance through vertical integration. However, existing placement tools, largely inherited from 2D frameworks, fail to capture the unique d...
Highly-Parallel Atom-Detection Accelerator for Tweezer-Based Neutral Atom Quantum ComputersJonas Winklmann, Yian Yu, Xiaorang Guo, Korbinian Staudacher, Martin Schulz2026-04-01下载Neutral atom quantum computers (NAQCs) are among the most promising computational platforms for quantum computing. Controlling and measuring individual atoms and their states, which often requires mul...
RePart: Efficient Hypergraph Partitioning with Logic Replication Optimization for Multi-FPGA SystemZizhuo Fu, Yifan Zhou, Zhaoxin Lu, Guangyu Sun, Runsheng Wang, Meng Li, Yibo Lin2026-04-01下载Multi-FPGA systems (MFS) are widely adopted for VLSI emulation and rapid prototyping. In an MFS, FPGAs connect only to a limited number of neighbors through bandwidth-constrained links, so inter-FPGA ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
CuTeGen: An LLM-Based Agentic Framework for Generation and Optimization of High-Performance GPU Kernels using CuTeTara Saba, Anne Ouyang, Xujie Si, Fan Long2026-04-01下载High-performance GPU kernels are critical to modern machine learning systems, yet developing efficient implementations remains a challenging, expert-driven process due to the tight coupling between al...
Distributed Variational Quantum Linear SolverTong Shen, Zeru Zhu, Ji Liu2026-04-01下载This paper develops a distributed variational quantum algorithm for solving large-scale linear equations. For a linear system of the form Ax=bAx=b, the large square matrix AA is partitioned into smalle...
EXaCTz: Guaranteed Extremum Graph and Contour Tree Preservation for Distributed- and GPU-Parallel Lossy CompressionYuxiao Li, Mingze Xia, Xin Liang, Bei Wang, Hanqi Guo2026-04-01下载This paper introduces EXaCTz, a parallel algorithm that concurrently preserves extremum graphs and contour trees in lossy-compressed scalar field data.
EmbedPart: Embedding-Driven Graph Partitioning for Scalable Graph Neural Network TrainingNikolai Merkel, Ruben Mayer, Volker Markl, Hans-Arno Jacobsen2026-04-01下载Graph Neural Networks (GNNs) are widely used for learning on graph-structured data, but scaling GNN training to massive graphs remains challenging.
Scalable Pretraining of Large Mixture of Experts Language Models on Aurora Super ComputerDharma Teja Vooturi, Dhiraj Kalamkar, Dipankar Das, Bharat Kaul2026-04-01下载Pretraining Large Language Models (LLMs) from scratch requires massive amount of compute. Aurora super computer is an ExaScale machine with 127,488 Intel PVC (Ponte Vechio) GPU tiles.
Is RISC-V Ready for Machine Learning? Portable Gaussian Processes Using Asynchronous TasksAlexander Strack, Patrick Diehl, Dirk Pflüger2026-04-01下载Gaussian processes are widely used in machine learning domains but remain computationally demanding, limiting their efficient scalability across diverse hardware platforms.
Fast Deterministic Distributed Degree SplittingYannic Maus, Alexandre Nolin, Florian Schager2026-04-01下载We obtain better algorithms for computing more balanced orientations and degree splits in LOCAL. Important to our result is a connection to the hypergraph sinkless orientation problem [BMNSU, SODA'25]...
MPI-Q: A Message Communication Library for Large-Scale Classical-Quantum Heterogeneous Hybrid Distributed ComputingFeng Wang, Junchao Wang, Zeyuan Wang, Lei Li, Hang Lian, Yangyang Fei, Jinyang Yao, Xuyan Qi, Fudong Liu, Yifan Hou, Shibo Liang, Zheng Shan2026-04-01下载The classical-quantum system heterogeneity (different data characteristics, execution paradigms and synchronization mechanism etc.) renders existing distributed communication mechanisms (e.g.
Reclaiming Idle CPU Cycles on Kubernetes: Sparse-Domain Multiplexing for Concurrent MPI-CFD SimulationsTianfang Xie2026-04-01下载When MPI-parallel simulations run on shared Kubernetes clusters, conventional CPU scheduling leaves the vast majority of provisioned cycles idle at synchronization barriers.
TENT: A Declarative Slice Spraying Engine for Performant and Resilient Data Movement in Disaggregated LLM ServingFeng Ren, Ruoyu Qin, Teng Ma, Shangming Cai, Zheng Liu, Chao Lei, Dejiang Zhu, Ke Yang, Zheming Li, Jialei Cui, Weixiao Huang, Yikai Zhao, Yineng Zhang, Hao Wu, Xiang Gao, Yuhao Fu, Jinlei Jiang, Yongwei Wu, Mingxing Zhang2026-04-01下载Modern GPU clusters are built upon a complex hierarchy of heterogeneous interconnects, ranging from multi-rail RDMA to proprietary fabrics such as Multi-Node NVLink and Ascend UB.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
LightGuard: Transparent WiFi Security via Physical-Layer LiFi Key BootstrappingShiqi Xu, Yuyang Du, Mingyue Zhang, Hongwei Cui, Soung Chang Liew2026-04-01下载WiFi is inherently vulnerable to eavesdropping because RF signals may penetrate many physical boundaries, such as walls and floors. LiFi, by contrast, is an optical method confined to line-of-sight an...
POLARIS: PHY-Aware Spectrum Steering for Dynamic Spectrum SharingStavros Dimou, Guevara Noubir2026-04-01下载Dynamic Spectrum Sharing (DSS) enables flexible activation of additional spectrum resources but leaves open a key runtime question: once new spectrum becomes available, which steering mechanism should...
Agentic AI-Empowered Wireless Agent Networks With Semantic-Aware Collaboration via ILACZhouxiang Zhao, Jiaxiang Wang, Zhaohui Yang, Kun Yang, Zhaoyang Zhang, Mingzhe Chen, Kaibin Huang2026-04-01下载The rapid development of agentic artificial intelligence (AI) is driving future wireless networks to evolve from passive data pipes into intelligent collaborative ecosystems under the emerging paradig...
Adversarial Attacks in AI-Driven RAN Slicing: SLA Violations and RecoveryDeemah H. Tashman, Soumaya Cherkaoui2026-04-01下载Next-generation (NextG) cellular networks are designed to support emerging applications with diverse data rate and latency requirements, such as immersive multimedia services and large-scale Internet ...
Cardinality is Not Enough: Super Host Detection via Segmented Cardinality EstimationYilin Zhao, Jiawei Huang, Xianshi Su, Weihe Li, Xin Li, Yan Liu, Jiacheng Xie, Qichen Su, Jin Ye, Wanchun Jiang, Jianxin Wang2026-04-01下载Accurately detecting super host that establishes connections to a large number of distinct peers is significant for mitigating web attacks and ensuring high quality of web service.
Optimal Sampling and Actuation Policies of a Markov Source over a Wireless ChannelMehrdad Salimnejad, Anthony Ephremides, Marios Kountouris, Nikolaos Pappas2026-04-01下载This paper studies efficient data management and timely information dissemination for real-time monitoring of an NN-state Markov process, enabling accurate state estimation and reliable actuation dec...
Online Network Slice Deployment across Multiple Domains under Trust ConstraintsJulien Ali El Amine, Nour El Houda Nouar, Olivier Brun2026-04-01下载Network slicing across multiple administrative domains raises two coupled challenges: enforcing slice-specific trust constraints while enabling fast online admission and placement decisions.
Birdcast: Interest-aware BEV Multicasting for Infrastructure-assisted Collaborative PerceptionYanan Ma, Zhengru Fang, Yihang Tao, Yu Guo, Yiqin Deng, Xianhao Chen, Yuguang Fang2026-04-01下载Vehicle-to-infrastructure collaborative perception (V2I-CP) leverages a high-vantage node to transmit supplementary information, i.e., bird's-eye-view (BEV) feature maps, to vehicles, effectively over...
Hybrid Classical--Quantum Optimization of Wireless Routing Using QAOA and Quantum WalksEric Howard, Hardique Dasore, Hom Nath Dhungana, Radhika Kuttala, Samuel Murphy, Emma Soo, Shah Haque2026-04-01下载Routing in wireless communication networks is shaped by mobility, interference, congestion, and competing service requirements, making route selection a high-dimensional constrained optimization probl...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Generative Profiling for Soft Real-Time Systems and its Applications to Resource AllocationGeorgiy A. Bondar, Abigail Eisenklam, Yifan Cai, Robert Gifford, Tushar Sial, Linh Thi Xuan Phan, Abhishek Halder2026-04-01下载Modern real-time systems require accurate characterization of task timing behavior to ensure predictable performance, particularly on complex hardware architectures.

cs.PF - Performance

标题作者发布日期PDF摘要
CuTeGen: An LLM-Based Agentic Framework for Generation and Optimization of High-Performance GPU Kernels using CuTeTara Saba, Anne Ouyang, Xujie Si, Fan Long2026-04-01下载High-performance GPU kernels are critical to modern machine learning systems, yet developing efficient implementations remains a challenging, expert-driven process due to the tight coupling between al...
Dual-Select FMA Butterfly for FFT: Eliminating Twiddle Factor Singularities with Bounded Precomputed RatiosMohamed Amine Bergach2026-04-01下载The fused multiply-add (FMA) instruction enables the radix-2 FFT butterfly to be computed in 6~FMA operations -- the proven minimum. The classical factorization by Linzer and Feig~\cite{linzer1993} pr...

基于 VitePress 构建