Skip to content

2026-03-31

cs.AR - Architecture

标题作者发布日期PDF摘要
Computer Architecture's AlphaZero Moment: Automated Discovery in an Encircled WorldKarthikeyan Sankaralingam2026-03-31下载The end of Moore's Law and Dennard scaling has fundamentally changed the economics of computer architecture. With transistor scaling delivering diminishing returns, architectural innovation is now the...
A Security-Aware Nonlinearity Study of FPGA-Based Time-to-Digital Converters for Quantum Key Distribution SystemsKun Qin, Carsten Trinitis2026-03-31下载Intrinsic nonlinearity in FPGA-based time-to-digital converters (TDCs) is often treated as a calibration issue and evaluated mainly through post-correction metrics.
SISA: A Scale-In Systolic Array for GEMM AccelerationLuigi Altamura, Alessio Cicero, Mateo Vázquez Maceiras, Mohammad Ali Maleki, Pedro Trancoso2026-03-31下载The currently dominant AI/ML workloads, such as Large Language Models (LLMs), rely on the efficient execution of General Matrix-Matrix Multiplication (GEMM) operations.
HLC: A High-Quality Lightweight Mezzanine Codec Featuring High-Throughput PaletteChenlong He, Leilei Huang, Wei Li, Hanyang Cui, Zhijian Hao, Xiaoyang Zeng, Yibo Fan2026-03-31下载Existing mezzanine image codecs lack specialized screen content coding tools and therefore struggle to maintain high image quality under bandwidth constraints, especially in areas with dense text.
CXLRAMSim v1.0: System-Level Exploration of CXL Memory Expander CardsKaran Pathak, David Atienza, Marina Zapater2026-03-31下载The growing demands in the training and inference of Large Language Models (LLMs) are accelerating the adoption of scale-up systems that extend server shared memory through the use of Compute Express ...
Deep Learning-Based Anomaly Detection in Spacecraft Telemetry on Edge DevicesChristopher Goetze, Tim Schlippe, Daniel Lakey2026-03-31下载Spacecraft anomaly detection is critical for mission safety, yet deploying sophisticated models on-board presents significant challenges due to hardware constraints.
AP-DRL: A Synergistic Algorithm-Hardware Framework for Automatic Task Partitioning of Deep Reinforcement Learning on Versal ACAPEnlai Li, Zhe Lin, Sharad Sinha, Wei Zhang2026-03-31下载Deep reinforcement learning has demonstrated remarkable success across various domains. However, the tight coupling between training and inference processes makes accelerating DRL training an essentia...
From Physics to Surrogate Intelligence: A Unified Electro-Thermo-Optimization Framework for TSV NetworksMohamed Gharib, Leonid Popryho, Inna Partin-Vaisband2026-03-31下载High-density through-substrate vias (TSVs) enable 2.5D/3D heterogeneous integration but introduce significant signal-integrity and thermal-reliability challenges due to electrical coupling, insertion ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
From Skew to Symmetry: Node-Interconnect Multi-Path Balancing with Execution-time Planning for Modern GPU ClustersJinghan Yao, Kaushik Kandadi, Bharath Ramesh, Hari Subramoni, Dhabaleswar K. Panda2026-03-31下载Modern GPU-based high-performance computing clusters offer unprecedented communication bandwidth through heterogeneous intra-node interconnects and inter-node networks.
MAC-Attention: a Match-Amend-Complete Scheme for Fast and Accurate Attention ComputationJinghan Yao, Sam Adé Jacobs, Walid Krichene, Masahiro Tanaka, Dhabaleswar K Panda2026-03-31下载Long-context decoding in LLMs is IO-bound: each token re-reads an ever-growing KV cache. Prior accelerations cut bytes via compression, which lowers fidelity, or selection/eviction, which restricts wh...
Source Known Identifiers: A Three-Tier Identity System for Distributed ApplicationsDuran Serkan Kılıç2026-03-31下载Distributed applications need identifiers that satisfy storage efficiency, chronological sortability, origin metadata embedding, zero-lookup verifiability, confidentiality for external consumers, and ...
A Lightweight Hybrid Publish/Subscribe Event Fabric for IPC and Modular Distributed SystemsDimitris Gkoulis2026-03-31下载Modular software deployed on mini compute units in controlled distributed environments often needs two messaging paths: low-overhead in-process coordination and selective cross-node distribution.
Scalable AI-assisted Workflow Management for Detector Design Optimization Using Distributed ComputingDerek Anderson, Amit Bashyal, Markus Diefenthaler, Cristiano Fanelli, Wen Guan, Tanja Horn, Alex Jentsch Meifeng Lin, Tadashi Maeno, Kei Nagai, Hemalata Nayak, Connor Pecar, Karthik Suresh, Fang-Ying Tsai, Anselm Vossen, Tianle Wang, Torre Wenaus2026-03-31下载The Production and Distributed Analysis (PanDA) system, originally developed for the ATLAS experiment at the CERN Large Hadron Collider (LHC), has evolved into a robust platform for orchestrating larg...
A Precision Emulation Approach to the GPU Acceleration of Ab Initio Electronic Structure CalculationsHang Liu, Junjie Li, Yinzhi Wang, Niraj K. Nepal, Yang Wang2026-03-31下载This study explores the use of INT8-based emulation for accelerating traditional FP64-based HPC workloads on modern GPU architectures. Through SCILIB-Accel automatic BLAS offload tool for cache-cohere...
M3SA: Exploring Datacenter Performance and Climate-Impact with Multi- and Meta-Model Simulation and AnalysisRadu Nicolae, Dante Niewenhuis, Sacheendra Talluri, Alexandru Iosup2026-03-31下载Datacenters are vital to our digital society, but consume a considerable fraction of global electricity and demand is projected to increase. To improve their sustainability and performance, we envisio...
Storing Less, Finding More: How Novelty Filtering Improves Cross-Modal Retrieval on Edge CamerasSherif Abdelwahab2026-03-31下载Always-on edge cameras generate continuous video streams where redundant frames degrade cross-modal retrieval by crowding correct results out of top-k search.
Efficient Parallel Compilation and Profiling of Quantum Circuits at Large ScalesJane Moore, Michael Hart, John McAllister2026-03-31下载Compiling quantum circuits is a major bottleneck in quantum computing, and given the scale required in a few years, is likely to become infeasibly long.
Polynomial Time Local Decision RevisitedLaurent Feuilloley, Soumyadeep Paul, Ami Paz2026-03-31下载We consider three classification systems for distributed decision tasks: With unbounded computation and certificates, defined by Balliu, D'Angelo, Fraigniaud, and Olivetti [JCSS'18], and with (two fla...
Exploration of Energy and Throughput Tradeoffs for Dataflow NetworksAbrarul Karim, Joachim Falk, Jürgen Teich2026-03-31下载The introduction of dynamic power management strategies such as clock gating and power gating in dataflow networks has been shown to provide significant energy savings when applied during idle times.
Beyond Corner Patches: Semantics-Aware Backdoor Attack in Federated LearningKavindu Herath, Joshua Zhao, Saurabh Bagchi2026-03-31下载Backdoor attacks on federated learning (FL) are most often evaluated with synthetic corner patches or out-of-distribution (OOD) patterns that are unlikely to arise in practice.
Downsides of Smartness Across Edge-Cloud Continuum in Modern IndustryAkhil Gupta Chigullapally, Sharvan Vittala, Razin Farhan Hussian, Mohsen Amini Salehi2026-03-31下载The fast pace of modern AI is rapidly transforming traditional industrial systems into vast, intelligent and potentially unmanned autonomous operational environments driven by AI-based solutions.
1.5 Million Messages Per Second on 3 Machines: Benchmarking and Latency Optimization of Apache Pulsar at Enterprise ScaleMuhamed Ramees Cheriya Mukkolakkal2026-03-31下载This paper presents two independent contributions for Apache Pulsar practitioners. First, we validate 1,499,947 msg/s at 3.88 ms median publish latency on just three bare-metal Kubernetes nodes runnin...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
From Skew to Symmetry: Node-Interconnect Multi-Path Balancing with Execution-time Planning for Modern GPU ClustersJinghan Yao, Kaushik Kandadi, Bharath Ramesh, Hari Subramoni, Dhabaleswar K. Panda2026-03-31下载Modern GPU-based high-performance computing clusters offer unprecedented communication bandwidth through heterogeneous intra-node interconnects and inter-node networks.
Making Sense of AI Agents Hype: Adoption, Architectures, and Takeaways from PractitionersRuoyu Su, Matteo Esposito, Roberta Capuano, Rafiullah Omar, June Sallou, Henry Muccini, Davide Taibi2026-03-31下载To support practitioners in understanding how agentic systems are designed in real-world industrial practice, we present a review of practitioner conference talks on AI agents.
GreenFLag: A Green Agentic Approach for Energy-Efficient Federated LearningTheodora Panagea, Nikolaos Koursioumpas, Lina Magoula, Ramin Khalili2026-03-31下载Progressing toward a new generation of mobile networks, a clear focus on integrating distributed intelligence across the system is observed to drive performance, autonomy, and real-time adaptability.
6GAgentGym: Tool Use, Data Synthesis, and Agentic Learning for Network ManagementJiao Chen, Jianhua Tang, Xiaotong Yang, Zuohong Lv2026-03-31下载Autonomous 6G network management requires agents that can execute tools, observe the resulting state changes, and adapt their decisions accordingly.
Mean Masked Autoencoder with Flow-Mixing for Encrypted Traffic ClassificationXiao Liu, Xiaowei Fu, Fuxiang Huang, Lei Zhang2026-03-31下载Network traffic classification using self-supervised pre-training models based on Masked Autoencoders (MAE) has demonstrated a huge potential.
TrafficMoE: Heterogeneity-aware Mixture of Experts for Encrypted Traffic ClassificationQing He, Xiaowei Fu, Lei Zhang2026-03-31下载Encrypted traffic classification is a critical task for network security. While deep learning has advanced this field, the occlusion of payload semantics by encryption severely challenges standard mod...
Multi-AUV Cooperative Target Tracking Based on Supervised Diffusion-Aided Multi-Agent Reinforcement LearningJiaao Ma, Chuan Lin, Guangjie Han, Shengchao Zhu, Zhenyu Wang, Chen An2026-03-31下载In recent years, advances in underwater networking and multi-agent reinforcement learning (MARL) have significantly expanded multi-autonomous underwater vehicle (AUV) applications in marine exploratio...
TORCH: Characterizing Invalid Route Filtering via Tunnelled ObservationRenrui Tian, Yahui Li, Xia Yin, Han Zhang, Xingang Shi, Zhiliang Wang2026-03-31下载To mitigate BGP prefix hijacking, the Resource Public Key Infrastructure (RPKI) provides prefix origin authentication via Route Origin Validation (ROV).
Needle in a Haystack: Tracking UAVs from Massive Noise in Real-World 5G-A Base Station DataChengzhen Meng, Chenming He, Yidong Jiang, Xiaoran Fan, Dequan Wang, Lingyu Wang, Jianmin Ji, Yanyong Zhang2026-03-31下载The potential usage of UAVs in daily life has made monitoring them essential. However, existing systems for monitoring UAVs typically rely on cameras, LiDARs, or radars, whose limited sensing range or...
Enabling Programmable Inference and ISAC at the 6GR Edge with dAppsMichele Polese, Rajeev Gangula, Tommaso Melodia2026-03-31下载The convergence of communication, sensing, and Artificial Intelligence (AI) in the Radio Access Network (RAN) offers compelling economic advantages through shared spectrum and infrastructure.
A Multi-Sensor Fusion Parking Barrier System with Lightweight Vision on EdgeYuwen Zhu, Feiyang Qi, Zhengzhe Xiang2026-03-31下载To address the challenges of simultaneously satisfying detection accuracy, edge real-time performance, low-power operation, and end-to-end business linkage in parking scenarios, this paper proposes an...
1.5 Million Messages Per Second on 3 Machines: Benchmarking and Latency Optimization of Apache Pulsar at Enterprise ScaleMuhamed Ramees Cheriya Mukkolakkal2026-03-31下载This paper presents two independent contributions for Apache Pulsar practitioners. First, we validate 1,499,947 msg/s at 3.88 ms median publish latency on just three bare-metal Kubernetes nodes runnin...
LoRaWAN Gateway Placement for Network Planning Using Ray Tracing-based Channel ModelsCláudio Modesto, Lucas Mozart, Glauco Gonçalves, Cleverson Nahum, Bruno Castro, Aldebaro Klautau2026-03-31下载Network planning is a fundamental task in wireless communications, primarily focused on guaranteeing adequate coverage for every network device.

cs.PF - Performance

标题作者发布日期PDF摘要
Risk-Aware Batch Testing for Performance Regression DetectionAli Sayedsalehi, Peter C. Rigby, Gregory Mierzwinski2026-03-31下载Performance regression testing is essential in large-scale continuous-integration (CI) systems, yet executing full performance suites for every commit is prohibitively expensive.
An Empirical Study on How Architectural Topology Affects Microservice Performance and Energy UsageIrena Ristova, Vincenzo Stoico2026-03-31下载Microservice architectures form the backbone of modern software systems for their scalability, resilience, and maintainability, but their rise in cloud-native environments raises energy efficiency con...
A Precision Emulation Approach to the GPU Acceleration of Ab Initio Electronic Structure CalculationsHang Liu, Junjie Li, Yinzhi Wang, Niraj K. Nepal, Yang Wang2026-03-31下载This study explores the use of INT8-based emulation for accelerating traditional FP64-based HPC workloads on modern GPU architectures. Through SCILIB-Accel automatic BLAS offload tool for cache-cohere...
SysOM-AI: Continuous Cross-Layer Performance Diagnosis for Production AI TrainingYusheng Zheng, Wenan Mao, Shuyi Cheng, Fuqiu Feng, Guangshui Li, Zhaoyan Liao, Yongzhuo Huang, Zhenwei Xiao, Yuqing Li, Andi Quinn, Tao Ma2026-03-31下载Performance diagnosis in production-scale AI training is challenging because subtle OS-level issues can trigger cascading GPU delays and network slowdowns, degrading training efficiency across thousan...
Closed-Loop Integrated Sensing, Communication, and Control for Efficient Drone FlightJingli Li, Yiyan Ma, Bo Ai, Wei Chen, Weijie Yuan, Qingqing Cheng, Tongyang Xu, Guoyu Ma, Mi Yang, Yunlong Lu, Wenwei Yue, Christos Masouros, Zhangdui Zhong2026-03-31下载Low-altitude wireless networks (LAWN) require drones to follow specific trajectories controlled by ground base stations (GBSs). However, given complex low-altitude channel conditions and limited spect...

基于 VitePress 构建