2026-03-19

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Exploring the Agentic Frontier of Verilog Code Generation	Patrick Yubeaton, Siddharth Garg, Chinmay Hegde	2026-03-19	下载	Large language models (LLMs) have made rapid advancements in code generation for popular languages such as Python and C++. Many of these recent gains can be attributed to the use of ``agents'' that wr...
Mitigating the Bandwidth Wall via Data-Streaming System-Accelerator Co-Design	Qunyou Liu, Marina Zapater, David Atienza	2026-03-19	下载	Transformers have revolutionized AI in natural language processing and computer vision, but their large computation and memory demands pose major challenges for hardware acceleration.
Sequence-Aware Split Heuristic to Mitigate SM Underutilization in FlashAttention-3 Low-Head-Count Decoding	Martí Llopart Font, Javier Hernando, Cristina España-Bonet	2026-03-19	下载	The standard FlashAttention-3 heuristic exhibits a GPU occupancy bottleneck in low-head-count decoding configurations because it disables sequence splitting based on sequence length alone, underutiliz...
Benchmarking NIST-Standardised ML-KEM and ML-DSA on ARM Cortex-M0+: Performance, Memory, and Energy on the RP2040	Rojin Chhetri	2026-03-19	下载	The migration to post-quantum cryptography is urgent for Internet of Things devices with 10--20 year lifespans, yet no systematic benchmarks exist for the finalised NIST standards on the most constrai...
Brain-inspired AI for Edge Intelligence: a systematic review	Yingchao Cheng, Meijia Wang, Zhifeng Hao, Rajkumar Buyya	2026-03-19	下载	While Spiking Neural Networks (SNNs) promise to circumvent the severe Size, Weight, and Power (SWaP) constraints of edge intelligence, the field currently faces a "Deployment Paradox" where theoretica...
WarPGNN: A Parametric Thermal Warpage Analysis Framework with Physics-aware Graph Neural Network	Haotian Lu, Jincong Lu, Sachin Sachdeva, Sheldon X. -D. Tan	2026-03-19	下载	With the advent of system-in-package (SiP) chiplet-based design and heterogeneous 2.5D/3D integration, thermal-induced warpage has become a critical reliability concern.
POET: Power-Oriented Evolutionary Tuning for LLM-Based RTL PPA Optimization	Heng Ping, Peiyu Zhang, Zhenkun Wang, Shixuan Li, Anzhe Cheng, Wei Yang, Paul Bogdan, Shahin Nazarian	2026-03-19	下载	Applying large language models (LLMs) to RTL code optimization for improved power, performance, and area (PPA) faces two key challenges: ensuring functional correctness of optimized designs despite LL...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Which Workloads Belong in Orbit? A Workload-First Framework for Orbital Data Centers Using Semantic Abstraction	Durgendra Narayan Singh	2026-03-19	下载	Space-based compute is becoming plausible as launch costs fall and data-intensive AI workloads grow. This paper proposes a workload-centric framework for deciding which tasks belong in orbit versus te...
Non-trivial automata networks do exist that solve the global majority problem with the local majority rule	Pedro Paulo Balbi, Kévin Perrot, Marius Rolland, Eurico Ruivo	2026-03-19	下载	The global majority problem, often referred to as the Density Classification Task, is a classical benchmark in the context of probing the computational capabilities of automata networks.
SWARM+: Scalable and Resilient Multi-Agent Consensus for Fully-Decentralized Data-Aware Workload Management	Komal Thareja, Krishnan Raghavan, Anirban Mandal, Ewa Deelman	2026-03-19	下载	Distributed scientific workflows increasingly span heterogeneous compute clusters, edge resources, and geo-distributed data repositories. In these environments, a centralized orchestrator is an archit...
Speculative Policy Orchestration: A Latency-Resilient Framework for Cloud-Robotic Manipulation	Chanh Nguyen, Shutong Jin, Florian T. Pokorny, Erik Elmroth	2026-03-19	下载	Cloud robotics enables robots to offload high-dimensional motion planning and reasoning to remote servers. However, for continuous manipulation tasks requiring high-frequency control, network latency ...
The Bilateral Efficiency of Ethernet: Recalibrating Metcalfe and Boggs After Fifty Years	Paul Borrill	2026-03-19	下载	In July 1976, Metcalfe and Boggs published their foundational paper on Ethernet in Communications of the ACM. Their efficiency model -- E = (P/C)/(P/C + W*T) -- measures the fraction of Ether time car...
cuGenOpt: A GPU-Accelerated General-Purpose Metaheuristic Framework for Combinatorial Optimization	Yuyang Liu	2026-03-19	下载	Combinatorial optimization problems arise in logistics, scheduling, and resource allocation, yet existing approaches face a fundamental trade-off among generality, performance, and usability.
A Pipelined Collaborative Speculative Decoding Framework for Efficient Edge-Cloud LLM Inference	Yida Zhang, Zhiyong Gao, Shuaibing Yue, Jie Li, Rui Wang	2026-03-19	下载	Recent advancements and widespread adoption of Large Language Models (LLMs) in both industry and academia have catalyzed significant demand for LLM serving.
FedTrident: Resilient Road Condition Classification Against Poisoning Attacks in Federated Learning	Sheng Liu, Panos Papadimitratos	2026-03-19	下载	FL has emerged as a transformative paradigm for ITS, notably camera-based Road Condition Classification (RCC). However, by enabling collaboration, FL-based RCC exposes the system to adversarial partic...
Why Synchronized Time is a Fiction: Daylight Saving Time, Leap Seconds, and the Guillotine Sharpened for Nothing	Paul Borrill	2026-03-19	下载	Civilization maintains an elaborate infrastructure devoted to the maintenance of synchronized time. Governments mandate daylight saving time. Standards bodies insert leap seconds into Coordinated Univ...
Literature Study on Operational Data Analytics Frameworks in Large-scale Computing Infrastructures	Shekhar Suman, Xiaoyu Chu, Alexandru Iosup	2026-03-19	下载	By 2025, there are zettabytes of data generated every year. The size and complexity of modern large-scale computing infrastructures like High-Performance Computing (HPC) systems continue to evolve and...
Act While Thinking: Accelerating LLM Agents via Pattern-Aware Speculative Tool Execution	Yifan Sui, Han Zhao, Rui Ma, Zhiyuan He, Hao Wang, Jianxun Li, Yuqing Yang	2026-03-19	下载	LLM-powered agents are emerging as a dominant paradigm for autonomous task solving. Unlike standard inference workloads, agents operate in a strictly serial "LLM-tool" loop, where the LLM must wait fo...
Sequence-Aware Split Heuristic to Mitigate SM Underutilization in FlashAttention-3 Low-Head-Count Decoding	Martí Llopart Font, Javier Hernando, Cristina España-Bonet	2026-03-19	下载	The standard FlashAttention-3 heuristic exhibits a GPU occupancy bottleneck in low-head-count decoding configurations because it disables sequence splitting based on sequence length alone, underutiliz...
High-Performance Portable GPU Primitives for Arbitrary Types and Operators in Julia	Emmanuel Pilliat	2026-03-19	下载	Portable GPU frameworks such as Kokkos and RAJA reduce the burden of cross-architecture development but typically incur measurable overhead on fundamental parallel primitives relative to vendor-optimi...
Comprehensive Plugin-Based Monitoring of Nexflow Workflow Executions	Sami Kharma, Tobias Wies, Florian Schintke	2026-03-19	下载	Nextflow is a workflow management system commonly used in fields like bioinformatics and earth observation. It coordinates distributed data processing of various tools as an acyclic sequence of tasks ...
From Servers to Sites: Compositional Power Trace Generation of LLM Inference for Infrastructure Planning	Grant Wilkins, Fiodar Kazhamiaka, Ram Rajagopal	2026-03-19	下载	Datacenter operators and electrical utilities rely on power traces at different spatiotemporal scales. Operators use fine-grained traces for provisioning, facility management, and scheduling, while ut...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Which Workloads Belong in Orbit? A Workload-First Framework for Orbital Data Centers Using Semantic Abstraction	Durgendra Narayan Singh	2026-03-19	下载	Space-based compute is becoming plausible as launch costs fall and data-intensive AI workloads grow. This paper proposes a workload-centric framework for deciding which tasks belong in orbit versus te...
The Bilateral Efficiency of Ethernet: Recalibrating Metcalfe and Boggs After Fifty Years	Paul Borrill	2026-03-19	下载	In July 1976, Metcalfe and Boggs published their foundational paper on Ethernet in Communications of the ACM. Their efficiency model -- E = (P/C)/(P/C + W*T) -- measures the fraction of Ether time car...
Dynamic Mask Enhanced Intelligent Multi-UAV Deployment for Urban Vehicular Networks	Gaoxiang Cao, Wenke Yuan, Yunpeng Hou, Huasen He, Quan Zheng, Jian Yang	2026-03-19	下载	Vehicular Ad Hoc Networks (VANETs) play a crucial role in realizing vehicle-road collaboration and intelligent transportation. However, urban VANETs often face challenges such as frequent link disconn...
Bridging Network Fragmentation: A Semantic-Augmented DRL Framework for UAV-aided VANETs	Gaoxiang Cao, Wenke Yuan, Huasen He, Yunpeng Hou, Xiaofeng Jiang, Shuangwu Chen, Jian Yang	2026-03-19	下载	Vehicular Ad-hoc Networks (VANETs) are the digital cornerstone of autonomous driving, yet they suffer from severe network fragmentation in urban environments due to physical obstructions.
Holistic Energy Performance Management: Enablers, Capabilities, and Features	Meysam Masoudi, Milad Ganjalizadeh, Tahar Zanouda, Pal Frenger	2026-03-19	下载	Energy consumption is a significant concern for mobile network operators, and to enable further network energy improvements it is also an important target when developing the emerging 6G standard.
Masking Intent, Sustaining Equilibrium: Risk-Aware Potential Game-empowered Two-Stage Mobile Crowdsensing	Houyi Qi, Minghui Liwang, Kaiwen Tan, Wenyong Wang, Sai Zou, Yiguang Hong, Xianbin Wang, Wei Ni	2026-03-19	下载	Beyond data collection, future mobile crowdsensing (MCS) in complex applications must satisfy diverse requirements, including reliable task completion, budget and quality constraints, and fluctuating ...
AutORAN: LLM-driven Natural Language Programming for Agile xApp Development	Xin Li, Shiming Yu, Leming Shen, Jianing Zhang, Yuanqing Zheng, Yaxiong Xie	2026-03-19	下载	Traditional RAN systems are closed and monolithic, stifling innovation. The openness and programmability enabled by Open Radio Access Network (O-RAN) are envisioned to revolutionize cellular networks ...
Cross-Layer Traffic Allocation and Contention Window Optimization for Wi-Fi 7 MLO: When DRL Meets LSTM	Zhang Liu, Xianbin Wang, Shumin Lian, Lianfen Huang, Liqun Fu, Ying-Jun Angela Zhang	2026-03-19	下载	To support future diverse applications, multi-link operation (MLO) has been introduced in the Wi-Fi 7 standard (IEEE 802.11be) to enable concurrent communication over multiple frequency bands.
RUBICONe: Wireless RAFT-Unified Behaviors for Intervehicular Cooperative Operations and Negotiations	Zhenghua Hu, Tairan Dan, Zeyu Tao, Jiacheng Qian, Amedeo Morat, Lorenzo Romano, Alessandro Massafra, Hao Xu	2026-03-19	下载	Just as Caesar declared "alea iacta est" (the die is cast) upon crossing the Rubicone river, lane change decisions in autonomous vehicles also represent critical points of no return.
iSatCR: Graph-Empowered Joint Onboard Computing and Routing for LEO Data Delivery	Jiangtao Luo, Bingbing Xu, Shaohua Xia, Yongyi Ran	2026-03-19	下载	Sending massive Earth observation data produced by low Earth orbit (LEO) satellites back to the ground for processing consumes a large amount of on-orbit bandwidth and exacerbates the space-to-ground ...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Brain-inspired AI for Edge Intelligence: a systematic review	Yingchao Cheng, Meijia Wang, Zhifeng Hao, Rajkumar Buyya	2026-03-19	下载	While Spiking Neural Networks (SNNs) promise to circumvent the severe Size, Weight, and Power (SWaP) constraints of edge intelligence, the field currently faces a "Deployment Paradox" where theoretica...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Benchmarking NIST-Standardised ML-KEM and ML-DSA on ARM Cortex-M0+: Performance, Memory, and Energy on the RP2040	Rojin Chhetri	2026-03-19	下载	The migration to post-quantum cryptography is urgent for Internet of Things devices with 10--20 year lifespans, yet no systematic benchmarks exist for the finalised NIST standards on the most constrai...
High-Performance Portable GPU Primitives for Arbitrary Types and Operators in Julia	Emmanuel Pilliat	2026-03-19	下载	Portable GPU frameworks such as Kokkos and RAJA reduce the burden of cross-architecture development but typically incur measurable overhead on fundamental parallel primitives relative to vendor-optimi...
TurboMem: High-Performance Lock-Free Memory Pool with Transparent Huge Page Auto-Merging for DPDK	Junyi Yang	2026-03-19	下载	High-speed packet processing on multicore CPUs places extreme demands on memory allocators. In systems like DPDK, fixed-size memory pools back packet buffers (mbufs) to avoid costly dynamic allocation...
Comprehensive Plugin-Based Monitoring of Nexflow Workflow Executions	Sami Kharma, Tobias Wies, Florian Schintke	2026-03-19	下载	Nextflow is a workflow management system commonly used in fields like bioinformatics and earth observation. It coordinates distributed data processing of various tools as an acyclic sequence of tasks ...