Skip to content

2026-03-19

cs.AR - Architecture

标题作者发布日期PDF摘要
Exploring the Agentic Frontier of Verilog Code GenerationPatrick Yubeaton, Siddharth Garg, Chinmay Hegde2026-03-19下载Large language models (LLMs) have made rapid advancements in code generation for popular languages such as Python and C++. Many of these recent gains can be attributed to the use of ``agents'' that wr...
Mitigating the Bandwidth Wall via Data-Streaming System-Accelerator Co-DesignQunyou Liu, Marina Zapater, David Atienza2026-03-19下载Transformers have revolutionized AI in natural language processing and computer vision, but their large computation and memory demands pose major challenges for hardware acceleration.
Sequence-Aware Split Heuristic to Mitigate SM Underutilization in FlashAttention-3 Low-Head-Count DecodingMartí Llopart Font, Javier Hernando, Cristina España-Bonet2026-03-19下载The standard FlashAttention-3 heuristic exhibits a GPU occupancy bottleneck in low-head-count decoding configurations because it disables sequence splitting based on sequence length alone, underutiliz...
Benchmarking NIST-Standardised ML-KEM and ML-DSA on ARM Cortex-M0+: Performance, Memory, and Energy on the RP2040Rojin Chhetri2026-03-19下载The migration to post-quantum cryptography is urgent for Internet of Things devices with 10--20 year lifespans, yet no systematic benchmarks exist for the finalised NIST standards on the most constrai...
Brain-inspired AI for Edge Intelligence: a systematic reviewYingchao Cheng, Meijia Wang, Zhifeng Hao, Rajkumar Buyya2026-03-19下载While Spiking Neural Networks (SNNs) promise to circumvent the severe Size, Weight, and Power (SWaP) constraints of edge intelligence, the field currently faces a "Deployment Paradox" where theoretica...
WarPGNN: A Parametric Thermal Warpage Analysis Framework with Physics-aware Graph Neural NetworkHaotian Lu, Jincong Lu, Sachin Sachdeva, Sheldon X. -D. Tan2026-03-19下载With the advent of system-in-package (SiP) chiplet-based design and heterogeneous 2.5D/3D integration, thermal-induced warpage has become a critical reliability concern.
POET: Power-Oriented Evolutionary Tuning for LLM-Based RTL PPA OptimizationHeng Ping, Peiyu Zhang, Zhenkun Wang, Shixuan Li, Anzhe Cheng, Wei Yang, Paul Bogdan, Shahin Nazarian2026-03-19下载Applying large language models (LLMs) to RTL code optimization for improved power, performance, and area (PPA) faces two key challenges: ensuring functional correctness of optimized designs despite LL...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Which Workloads Belong in Orbit? A Workload-First Framework for Orbital Data Centers Using Semantic AbstractionDurgendra Narayan Singh2026-03-19下载Space-based compute is becoming plausible as launch costs fall and data-intensive AI workloads grow. This paper proposes a workload-centric framework for deciding which tasks belong in orbit versus te...
Non-trivial automata networks do exist that solve the global majority problem with the local majority rulePedro Paulo Balbi, Kévin Perrot, Marius Rolland, Eurico Ruivo2026-03-19下载The global majority problem, often referred to as the Density Classification Task, is a classical benchmark in the context of probing the computational capabilities of automata networks.
SWARM+: Scalable and Resilient Multi-Agent Consensus for Fully-Decentralized Data-Aware Workload ManagementKomal Thareja, Krishnan Raghavan, Anirban Mandal, Ewa Deelman2026-03-19下载Distributed scientific workflows increasingly span heterogeneous compute clusters, edge resources, and geo-distributed data repositories. In these environments, a centralized orchestrator is an archit...
Speculative Policy Orchestration: A Latency-Resilient Framework for Cloud-Robotic ManipulationChanh Nguyen, Shutong Jin, Florian T. Pokorny, Erik Elmroth2026-03-19下载Cloud robotics enables robots to offload high-dimensional motion planning and reasoning to remote servers. However, for continuous manipulation tasks requiring high-frequency control, network latency ...
The Bilateral Efficiency of Ethernet: Recalibrating Metcalfe and Boggs After Fifty YearsPaul Borrill2026-03-19下载In July 1976, Metcalfe and Boggs published their foundational paper on Ethernet in Communications of the ACM. Their efficiency model -- E = (P/C)/(P/C + W*T) -- measures the fraction of Ether time car...
cuGenOpt: A GPU-Accelerated General-Purpose Metaheuristic Framework for Combinatorial OptimizationYuyang Liu2026-03-19下载Combinatorial optimization problems arise in logistics, scheduling, and resource allocation, yet existing approaches face a fundamental trade-off among generality, performance, and usability.
A Pipelined Collaborative Speculative Decoding Framework for Efficient Edge-Cloud LLM InferenceYida Zhang, Zhiyong Gao, Shuaibing Yue, Jie Li, Rui Wang2026-03-19下载Recent advancements and widespread adoption of Large Language Models (LLMs) in both industry and academia have catalyzed significant demand for LLM serving.
FedTrident: Resilient Road Condition Classification Against Poisoning Attacks in Federated LearningSheng Liu, Panos Papadimitratos2026-03-19下载FL has emerged as a transformative paradigm for ITS, notably camera-based Road Condition Classification (RCC). However, by enabling collaboration, FL-based RCC exposes the system to adversarial partic...
Why Synchronized Time is a Fiction: Daylight Saving Time, Leap Seconds, and the Guillotine Sharpened for NothingPaul Borrill2026-03-19下载Civilization maintains an elaborate infrastructure devoted to the maintenance of synchronized time. Governments mandate daylight saving time. Standards bodies insert leap seconds into Coordinated Univ...
Literature Study on Operational Data Analytics Frameworks in Large-scale Computing InfrastructuresShekhar Suman, Xiaoyu Chu, Alexandru Iosup2026-03-19下载By 2025, there are zettabytes of data generated every year. The size and complexity of modern large-scale computing infrastructures like High-Performance Computing (HPC) systems continue to evolve and...
Act While Thinking: Accelerating LLM Agents via Pattern-Aware Speculative Tool ExecutionYifan Sui, Han Zhao, Rui Ma, Zhiyuan He, Hao Wang, Jianxun Li, Yuqing Yang2026-03-19下载LLM-powered agents are emerging as a dominant paradigm for autonomous task solving. Unlike standard inference workloads, agents operate in a strictly serial "LLM-tool" loop, where the LLM must wait fo...
Sequence-Aware Split Heuristic to Mitigate SM Underutilization in FlashAttention-3 Low-Head-Count DecodingMartí Llopart Font, Javier Hernando, Cristina España-Bonet2026-03-19下载The standard FlashAttention-3 heuristic exhibits a GPU occupancy bottleneck in low-head-count decoding configurations because it disables sequence splitting based on sequence length alone, underutiliz...
High-Performance Portable GPU Primitives for Arbitrary Types and Operators in JuliaEmmanuel Pilliat2026-03-19下载Portable GPU frameworks such as Kokkos and RAJA reduce the burden of cross-architecture development but typically incur measurable overhead on fundamental parallel primitives relative to vendor-optimi...
Comprehensive Plugin-Based Monitoring of Nexflow Workflow ExecutionsSami Kharma, Tobias Wies, Florian Schintke2026-03-19下载Nextflow is a workflow management system commonly used in fields like bioinformatics and earth observation. It coordinates distributed data processing of various tools as an acyclic sequence of tasks ...
From Servers to Sites: Compositional Power Trace Generation of LLM Inference for Infrastructure PlanningGrant Wilkins, Fiodar Kazhamiaka, Ram Rajagopal2026-03-19下载Datacenter operators and electrical utilities rely on power traces at different spatiotemporal scales. Operators use fine-grained traces for provisioning, facility management, and scheduling, while ut...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Which Workloads Belong in Orbit? A Workload-First Framework for Orbital Data Centers Using Semantic AbstractionDurgendra Narayan Singh2026-03-19下载Space-based compute is becoming plausible as launch costs fall and data-intensive AI workloads grow. This paper proposes a workload-centric framework for deciding which tasks belong in orbit versus te...
The Bilateral Efficiency of Ethernet: Recalibrating Metcalfe and Boggs After Fifty YearsPaul Borrill2026-03-19下载In July 1976, Metcalfe and Boggs published their foundational paper on Ethernet in Communications of the ACM. Their efficiency model -- E = (P/C)/(P/C + W*T) -- measures the fraction of Ether time car...
Dynamic Mask Enhanced Intelligent Multi-UAV Deployment for Urban Vehicular NetworksGaoxiang Cao, Wenke Yuan, Yunpeng Hou, Huasen He, Quan Zheng, Jian Yang2026-03-19下载Vehicular Ad Hoc Networks (VANETs) play a crucial role in realizing vehicle-road collaboration and intelligent transportation. However, urban VANETs often face challenges such as frequent link disconn...
Bridging Network Fragmentation: A Semantic-Augmented DRL Framework for UAV-aided VANETsGaoxiang Cao, Wenke Yuan, Huasen He, Yunpeng Hou, Xiaofeng Jiang, Shuangwu Chen, Jian Yang2026-03-19下载Vehicular Ad-hoc Networks (VANETs) are the digital cornerstone of autonomous driving, yet they suffer from severe network fragmentation in urban environments due to physical obstructions.
Holistic Energy Performance Management: Enablers, Capabilities, and FeaturesMeysam Masoudi, Milad Ganjalizadeh, Tahar Zanouda, Pal Frenger2026-03-19下载Energy consumption is a significant concern for mobile network operators, and to enable further network energy improvements it is also an important target when developing the emerging 6G standard.
Masking Intent, Sustaining Equilibrium: Risk-Aware Potential Game-empowered Two-Stage Mobile CrowdsensingHouyi Qi, Minghui Liwang, Kaiwen Tan, Wenyong Wang, Sai Zou, Yiguang Hong, Xianbin Wang, Wei Ni2026-03-19下载Beyond data collection, future mobile crowdsensing (MCS) in complex applications must satisfy diverse requirements, including reliable task completion, budget and quality constraints, and fluctuating ...
AutORAN: LLM-driven Natural Language Programming for Agile xApp DevelopmentXin Li, Shiming Yu, Leming Shen, Jianing Zhang, Yuanqing Zheng, Yaxiong Xie2026-03-19下载Traditional RAN systems are closed and monolithic, stifling innovation. The openness and programmability enabled by Open Radio Access Network (O-RAN) are envisioned to revolutionize cellular networks ...
Cross-Layer Traffic Allocation and Contention Window Optimization for Wi-Fi 7 MLO: When DRL Meets LSTMZhang Liu, Xianbin Wang, Shumin Lian, Lianfen Huang, Liqun Fu, Ying-Jun Angela Zhang2026-03-19下载To support future diverse applications, multi-link operation (MLO) has been introduced in the Wi-Fi 7 standard (IEEE 802.11be) to enable concurrent communication over multiple frequency bands.
RUBICONe: Wireless RAFT-Unified Behaviors for Intervehicular Cooperative Operations and NegotiationsZhenghua Hu, Tairan Dan, Zeyu Tao, Jiacheng Qian, Amedeo Morat, Lorenzo Romano, Alessandro Massafra, Hao Xu2026-03-19下载Just as Caesar declared "alea iacta est" (the die is cast) upon crossing the Rubicone river, lane change decisions in autonomous vehicles also represent critical points of no return.
iSatCR: Graph-Empowered Joint Onboard Computing and Routing for LEO Data DeliveryJiangtao Luo, Bingbing Xu, Shaohua Xia, Yongyi Ran2026-03-19下载Sending massive Earth observation data produced by low Earth orbit (LEO) satellites back to the ground for processing consumes a large amount of on-orbit bandwidth and exacerbates the space-to-ground ...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Brain-inspired AI for Edge Intelligence: a systematic reviewYingchao Cheng, Meijia Wang, Zhifeng Hao, Rajkumar Buyya2026-03-19下载While Spiking Neural Networks (SNNs) promise to circumvent the severe Size, Weight, and Power (SWaP) constraints of edge intelligence, the field currently faces a "Deployment Paradox" where theoretica...

cs.PF - Performance

标题作者发布日期PDF摘要
Benchmarking NIST-Standardised ML-KEM and ML-DSA on ARM Cortex-M0+: Performance, Memory, and Energy on the RP2040Rojin Chhetri2026-03-19下载The migration to post-quantum cryptography is urgent for Internet of Things devices with 10--20 year lifespans, yet no systematic benchmarks exist for the finalised NIST standards on the most constrai...
High-Performance Portable GPU Primitives for Arbitrary Types and Operators in JuliaEmmanuel Pilliat2026-03-19下载Portable GPU frameworks such as Kokkos and RAJA reduce the burden of cross-architecture development but typically incur measurable overhead on fundamental parallel primitives relative to vendor-optimi...
TurboMem: High-Performance Lock-Free Memory Pool with Transparent Huge Page Auto-Merging for DPDKJunyi Yang2026-03-19下载High-speed packet processing on multicore CPUs places extreme demands on memory allocators. In systems like DPDK, fixed-size memory pools back packet buffers (mbufs) to avoid costly dynamic allocation...
Comprehensive Plugin-Based Monitoring of Nexflow Workflow ExecutionsSami Kharma, Tobias Wies, Florian Schintke2026-03-19下载Nextflow is a workflow management system commonly used in fields like bioinformatics and earth observation. It coordinates distributed data processing of various tools as an acyclic sequence of tasks ...

基于 VitePress 构建