2025-12-04

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
DABench-LLM: Standardized and In-Depth Benchmarking of Post-Moore Dataflow AI Accelerators for LLMs	Ziyu Hu, Zhiqing Zhong, Weijian Zheng, Zhijing Ye, Xuwei Tan, Xueru Zhang, Zheng Xie, Rajkumar Kettimuthu, Xiaodong Yu	2025-12-04	下载	The exponential growth of large language models has outpaced the capabilities of traditional CPU and GPU architectures due to the slowdown of Moore's Law.
ARCAS: An Augmented Reality Collision Avoidance System with SLAM-Based Tracking for Enhancing VRU Safety	Ahmad Yehia, Jiseop Byeon, Tianyi Wang, Huihai Wang, Yiming Xu, Junfeng Jiao, Christian Claudel	2025-12-04	下载	Vulnerable road users (VRUs) face high collision risks in mixed traffic, yet most existing safety systems prioritize driver or vehicle assistance over direct VRU support.
David vs. Goliath: Can Small Models Win Big with Agentic AI in Hardware Design?	Shashwat Shankar, Subhranshu Pandey, Innocent Dengkhw Mochahari, Bhabesh Mali, Animesh Basak Chowdhury, Sukanta Bhattacharjee, Chandan Karfa	2025-12-04	下载	Large Language Model(LLM) inference demands massive compute and energy, making domain-specific tasks expensive and unsustainable. As foundation models keep scaling, we ask: Is bigger always better for...
Declarative Synthesis and Multi-Objective Optimization of Stripboard Circuit Layouts Using Answer Set Programming	Fang Li	2025-12-04	下载	This paper presents a novel approach to automated stripboard circuit layout design using Answer Set Programming (ASP). The work formulates the layout problem as both a synthesis and multi-objective op...
Functional Stability of Software-Hardware Neural Network Implementation The NeuroComp Project	Bychkov Oleksii, Senysh Taras	2025-12-04	下载	This paper presents an innovative approach to ensuring functional stability of neural networks through hardware redundancy at the individual neuron level.
Hardware-Algorithm Co-Optimization of Early-Exit Neural Networks for Multi-Core Edge Accelerators	Alaa Zniber, Arne Symons, Ouassim Karrakchou, Marian Verhelst, Mounir Ghogho	2025-12-04	下载	Deployment of dynamic neural networks on edge accelerators requires careful consideration of hardware constraints beyond conventional complexity metrics such as Multiply-Accumulate operations.
FLEX: Leveraging FPGA-CPU Synergy for Mixed-Cell-Height Legalization Acceleration	Xingyu Liu, Jiawei Liang, Linfeng Du, Yipu Zhang, Chaofang Ma, Hanwei Fan, Jiang Xu, Wei Zhang	2025-12-04	下载	In this work, we present FLEX, an FPGA-CPU accelerator for mixed-cell-height legalization tasks. We address challenges from the following perspectives.
Context-Aware Mixture-of-Experts Inference on CXL-Enabled GPU-NDP Systems	Zehao Fan, Zhenyu Liu, Yunzhen Liu, Yayue Hou, Hadjer Benmeziane, Kaoutar El Maghraoui, Liu Liu	2025-12-04	下载	Mixture-of-Experts (MoE) models scale large language models through conditional computation, but inference becomes memory-bound once expert weights exceed the capacity of GPU memory.
RRAM-Based Analog Matrix Computing for Massive MIMO Signal Processing: A Review	Pushen Zuo, Zhong Sun	2025-12-04	下载	Resistive random-access memory (RRAM) provides an excellent platform for analog matrix computing (AMC), enabling both matrix-vector multiplication (MVM) and the solution of matrix equations through op...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
DABench-LLM: Standardized and In-Depth Benchmarking of Post-Moore Dataflow AI Accelerators for LLMs	Ziyu Hu, Zhiqing Zhong, Weijian Zheng, Zhijing Ye, Xuwei Tan, Xueru Zhang, Zheng Xie, Rajkumar Kettimuthu, Xiaodong Yu	2025-12-04	下载	The exponential growth of large language models has outpaced the capabilities of traditional CPU and GPU architectures due to the slowdown of Moore's Law.
NVLang: Unified Static Typing for Actor-Based Concurrency on the BEAM	Miguel de Oliveira Guerreiro	2025-12-04	下载	Actor-based systems like Erlang/OTP power critical infrastructure -- from telecommunications to messaging platforms -- handling millions of concurrent connections with legendary reliability.
Federated Learning for Terahertz Wireless Communication	O. Tansel Baydas, Ozgur B. Akan	2025-12-04	下载	The convergence of Terahertz (THz) communications and Federated Learning (FL) promises ultra-fast distributed learning, yet the impact of realistic wideband impairments on optimization dynamics remain...
FLEX: Leveraging FPGA-CPU Synergy for Mixed-Cell-Height Legalization Acceleration	Xingyu Liu, Jiawei Liang, Linfeng Du, Yipu Zhang, Chaofang Ma, Hanwei Fan, Jiang Xu, Wei Zhang	2025-12-04	下载	In this work, we present FLEX, an FPGA-CPU accelerator for mixed-cell-height legalization tasks. We address challenges from the following perspectives.
Offloading to CXL-based Computational Memory	Suyeon Lee, Kangkyu Park, Kwangsik Shin, Ada Gavrilovska	2025-12-04	下载	CXL-based Computational Memory (CCM) enables near-memory processing within expanded remote memory, presenting opportunities to address data movement costs associated with disaggregated memory systems ...
Reducing Fragmentation and Starvation in GPU Clusters through Dynamic Multi-Objective Scheduling	Akhmadillo Mamirov	2025-12-04	下载	GPU clusters have become essential for training and deploying modern AI systems, yet real deployments continue to report average utilization near 50%.
A Structure-Aware Irregular Blocking Method for Sparse LU Factorization	Zhen Hu, Dongliang Xiong, Kai Huang, Changjun Wu, Xiaowen Jiang	2025-12-04	下载	In sparse LU factorization, nonzero elements after symbolic factorization tend to distribute in diagonal and right-bottom region of sparse matrices.
Counting Without Running: Evaluating LLMs' Reasoning About Code Complexity	Gregory Bolet, Giorgis Georgakoudis, Konstantinos Parasyris, Harshitha Menon, Niranjan Hasabnis, Kirk W. Cameron, Gal Oren	2025-12-04	下载	Modern GPU software stacks demand developers who can anticipate performance bottlenecks before ever launching a kernel; misjudging floating-point workloads upstream can derail tuning, scheduling, and ...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Hierarchical Reinforcement Learning for the Dynamic VNE with Alternatives Problem	Ali Al Housseini, Cristina Rottondi, Omran Ayoub	2025-12-04	下载	Virtual Network Embedding (VNE) is a key enabler of network slicing, yet most formulations assume that each Virtual Network Request (VNR) has a fixed topology.
MuMeNet: A Network Simulator for Musical Metaverse Communications	Ali Al Housseini, Jaime Llorca, Luca Turchet, Tiziano Leidi, Cristina Rottondi, Omran Ayoub	2025-12-04	下载	The Metaverse, a shared and spatially organized digital continuum, is transforming various industries, with music emerging as a leading use case.
Deadline-Aware Scheduling of Distributed Quantum Circuits in Near-Term Quantum Cloud	Nour Dehaini, Christia Chahoud, Mahdi Chehimi	2025-12-04	下载	Distributed quantum computing (DQC) enables scalable quantum computations by distributing large quantum circuits on multiple quantum processing units (QPUs) in the quantum cloud.
Timely Information for Strategic Persuasion	Ahmet Bugra Gundogan, Melih Bastopcu	2025-12-04	下载	This work investigates a dynamic variant of Bayesian persuasion, in which a strategic sender seeks to influence a receiver's belief over time through controlling the timing of the information disclosu...
Vision and Causal Learning Based Channel Estimation for THz Communications	Kitae Kim, Yan Kyaw Tun, Md. Shirajum Munir, Chirsto Kurisummoottil Thomas, Walid Saad, Choong Seon Hong	2025-12-04	下载	The use of terahertz (THz) communications with massive multiple input multiple output (MIMO) systems in 6G can potentially provide high data rates and low latency communications.
Making Cellular Networks Crisis-Proof: Towards Island-Ready, Resilient-By-Design 6G Communication Network	Leon Janzen, Matthias Hollick	2025-12-04	下载	5G and 5G-Advanced cellular networks are vulnerable to regional outages resulting from disasters or targeted attacks. This fragility stems from the reliance on the central core network involved for mo...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
DABench-LLM: Standardized and In-Depth Benchmarking of Post-Moore Dataflow AI Accelerators for LLMs	Ziyu Hu, Zhiqing Zhong, Weijian Zheng, Zhijing Ye, Xuwei Tan, Xueru Zhang, Zheng Xie, Rajkumar Kettimuthu, Xiaodong Yu	2025-12-04	下载	The exponential growth of large language models has outpaced the capabilities of traditional CPU and GPU architectures due to the slowdown of Moore's Law.
AutoGuard: A Self-Healing Proactive Security Layer for DevSecOps Pipelines Using Reinforcement Learning	Praveen Anugula, Avdhesh Kumar Bhardwaj, Navin Chhibber, Rohit Tewari, Sunil Khemka, Piyush Ranjan	2025-12-04	下载	Contemporary DevSecOps pipelines have to deal with the evolution of security in an ever-continuously integrated and deployed environment. Existing methods,such as rule-based intrusion detection and st...
Counting Without Running: Evaluating LLMs' Reasoning About Code Complexity	Gregory Bolet, Giorgis Georgakoudis, Konstantinos Parasyris, Harshitha Menon, Niranjan Hasabnis, Kirk W. Cameron, Gal Oren	2025-12-04	下载	Modern GPU software stacks demand developers who can anticipate performance bottlenecks before ever launching a kernel; misjudging floating-point workloads upstream can derail tuning, scheduling, and ...