Skip to content

2025-12-04

cs.AR - Architecture

标题作者发布日期PDF摘要
DABench-LLM: Standardized and In-Depth Benchmarking of Post-Moore Dataflow AI Accelerators for LLMsZiyu Hu, Zhiqing Zhong, Weijian Zheng, Zhijing Ye, Xuwei Tan, Xueru Zhang, Zheng Xie, Rajkumar Kettimuthu, Xiaodong Yu2025-12-04下载The exponential growth of large language models has outpaced the capabilities of traditional CPU and GPU architectures due to the slowdown of Moore's Law.
ARCAS: An Augmented Reality Collision Avoidance System with SLAM-Based Tracking for Enhancing VRU SafetyAhmad Yehia, Jiseop Byeon, Tianyi Wang, Huihai Wang, Yiming Xu, Junfeng Jiao, Christian Claudel2025-12-04下载Vulnerable road users (VRUs) face high collision risks in mixed traffic, yet most existing safety systems prioritize driver or vehicle assistance over direct VRU support.
David vs. Goliath: Can Small Models Win Big with Agentic AI in Hardware Design?Shashwat Shankar, Subhranshu Pandey, Innocent Dengkhw Mochahari, Bhabesh Mali, Animesh Basak Chowdhury, Sukanta Bhattacharjee, Chandan Karfa2025-12-04下载Large Language Model(LLM) inference demands massive compute and energy, making domain-specific tasks expensive and unsustainable. As foundation models keep scaling, we ask: Is bigger always better for...
Declarative Synthesis and Multi-Objective Optimization of Stripboard Circuit Layouts Using Answer Set ProgrammingFang Li2025-12-04下载This paper presents a novel approach to automated stripboard circuit layout design using Answer Set Programming (ASP). The work formulates the layout problem as both a synthesis and multi-objective op...
Functional Stability of Software-Hardware Neural Network Implementation The NeuroComp ProjectBychkov Oleksii, Senysh Taras2025-12-04下载This paper presents an innovative approach to ensuring functional stability of neural networks through hardware redundancy at the individual neuron level.
Hardware-Algorithm Co-Optimization of Early-Exit Neural Networks for Multi-Core Edge AcceleratorsAlaa Zniber, Arne Symons, Ouassim Karrakchou, Marian Verhelst, Mounir Ghogho2025-12-04下载Deployment of dynamic neural networks on edge accelerators requires careful consideration of hardware constraints beyond conventional complexity metrics such as Multiply-Accumulate operations.
FLEX: Leveraging FPGA-CPU Synergy for Mixed-Cell-Height Legalization AccelerationXingyu Liu, Jiawei Liang, Linfeng Du, Yipu Zhang, Chaofang Ma, Hanwei Fan, Jiang Xu, Wei Zhang2025-12-04下载In this work, we present FLEX, an FPGA-CPU accelerator for mixed-cell-height legalization tasks. We address challenges from the following perspectives.
Context-Aware Mixture-of-Experts Inference on CXL-Enabled GPU-NDP SystemsZehao Fan, Zhenyu Liu, Yunzhen Liu, Yayue Hou, Hadjer Benmeziane, Kaoutar El Maghraoui, Liu Liu2025-12-04下载Mixture-of-Experts (MoE) models scale large language models through conditional computation, but inference becomes memory-bound once expert weights exceed the capacity of GPU memory.
RRAM-Based Analog Matrix Computing for Massive MIMO Signal Processing: A ReviewPushen Zuo, Zhong Sun2025-12-04下载Resistive random-access memory (RRAM) provides an excellent platform for analog matrix computing (AMC), enabling both matrix-vector multiplication (MVM) and the solution of matrix equations through op...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
DABench-LLM: Standardized and In-Depth Benchmarking of Post-Moore Dataflow AI Accelerators for LLMsZiyu Hu, Zhiqing Zhong, Weijian Zheng, Zhijing Ye, Xuwei Tan, Xueru Zhang, Zheng Xie, Rajkumar Kettimuthu, Xiaodong Yu2025-12-04下载The exponential growth of large language models has outpaced the capabilities of traditional CPU and GPU architectures due to the slowdown of Moore's Law.
NVLang: Unified Static Typing for Actor-Based Concurrency on the BEAMMiguel de Oliveira Guerreiro2025-12-04下载Actor-based systems like Erlang/OTP power critical infrastructure -- from telecommunications to messaging platforms -- handling millions of concurrent connections with legendary reliability.
Federated Learning for Terahertz Wireless CommunicationO. Tansel Baydas, Ozgur B. Akan2025-12-04下载The convergence of Terahertz (THz) communications and Federated Learning (FL) promises ultra-fast distributed learning, yet the impact of realistic wideband impairments on optimization dynamics remain...
FLEX: Leveraging FPGA-CPU Synergy for Mixed-Cell-Height Legalization AccelerationXingyu Liu, Jiawei Liang, Linfeng Du, Yipu Zhang, Chaofang Ma, Hanwei Fan, Jiang Xu, Wei Zhang2025-12-04下载In this work, we present FLEX, an FPGA-CPU accelerator for mixed-cell-height legalization tasks. We address challenges from the following perspectives.
Offloading to CXL-based Computational MemorySuyeon Lee, Kangkyu Park, Kwangsik Shin, Ada Gavrilovska2025-12-04下载CXL-based Computational Memory (CCM) enables near-memory processing within expanded remote memory, presenting opportunities to address data movement costs associated with disaggregated memory systems ...
Reducing Fragmentation and Starvation in GPU Clusters through Dynamic Multi-Objective SchedulingAkhmadillo Mamirov2025-12-04下载GPU clusters have become essential for training and deploying modern AI systems, yet real deployments continue to report average utilization near 50%.
A Structure-Aware Irregular Blocking Method for Sparse LU FactorizationZhen Hu, Dongliang Xiong, Kai Huang, Changjun Wu, Xiaowen Jiang2025-12-04下载In sparse LU factorization, nonzero elements after symbolic factorization tend to distribute in diagonal and right-bottom region of sparse matrices.
Counting Without Running: Evaluating LLMs' Reasoning About Code ComplexityGregory Bolet, Giorgis Georgakoudis, Konstantinos Parasyris, Harshitha Menon, Niranjan Hasabnis, Kirk W. Cameron, Gal Oren2025-12-04下载Modern GPU software stacks demand developers who can anticipate performance bottlenecks before ever launching a kernel; misjudging floating-point workloads upstream can derail tuning, scheduling, and ...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Hierarchical Reinforcement Learning for the Dynamic VNE with Alternatives ProblemAli Al Housseini, Cristina Rottondi, Omran Ayoub2025-12-04下载Virtual Network Embedding (VNE) is a key enabler of network slicing, yet most formulations assume that each Virtual Network Request (VNR) has a fixed topology.
MuMeNet: A Network Simulator for Musical Metaverse CommunicationsAli Al Housseini, Jaime Llorca, Luca Turchet, Tiziano Leidi, Cristina Rottondi, Omran Ayoub2025-12-04下载The Metaverse, a shared and spatially organized digital continuum, is transforming various industries, with music emerging as a leading use case.
Deadline-Aware Scheduling of Distributed Quantum Circuits in Near-Term Quantum CloudNour Dehaini, Christia Chahoud, Mahdi Chehimi2025-12-04下载Distributed quantum computing (DQC) enables scalable quantum computations by distributing large quantum circuits on multiple quantum processing units (QPUs) in the quantum cloud.
Timely Information for Strategic PersuasionAhmet Bugra Gundogan, Melih Bastopcu2025-12-04下载This work investigates a dynamic variant of Bayesian persuasion, in which a strategic sender seeks to influence a receiver's belief over time through controlling the timing of the information disclosu...
Vision and Causal Learning Based Channel Estimation for THz CommunicationsKitae Kim, Yan Kyaw Tun, Md. Shirajum Munir, Chirsto Kurisummoottil Thomas, Walid Saad, Choong Seon Hong2025-12-04下载The use of terahertz (THz) communications with massive multiple input multiple output (MIMO) systems in 6G can potentially provide high data rates and low latency communications.
Making Cellular Networks Crisis-Proof: Towards Island-Ready, Resilient-By-Design 6G Communication NetworkLeon Janzen, Matthias Hollick2025-12-04下载5G and 5G-Advanced cellular networks are vulnerable to regional outages resulting from disasters or targeted attacks. This fragility stems from the reliance on the central core network involved for mo...

cs.PF - Performance

标题作者发布日期PDF摘要
DABench-LLM: Standardized and In-Depth Benchmarking of Post-Moore Dataflow AI Accelerators for LLMsZiyu Hu, Zhiqing Zhong, Weijian Zheng, Zhijing Ye, Xuwei Tan, Xueru Zhang, Zheng Xie, Rajkumar Kettimuthu, Xiaodong Yu2025-12-04下载The exponential growth of large language models has outpaced the capabilities of traditional CPU and GPU architectures due to the slowdown of Moore's Law.
AutoGuard: A Self-Healing Proactive Security Layer for DevSecOps Pipelines Using Reinforcement LearningPraveen Anugula, Avdhesh Kumar Bhardwaj, Navin Chhibber, Rohit Tewari, Sunil Khemka, Piyush Ranjan2025-12-04下载Contemporary DevSecOps pipelines have to deal with the evolution of security in an ever-continuously integrated and deployed environment. Existing methods,such as rule-based intrusion detection and st...
Counting Without Running: Evaluating LLMs' Reasoning About Code ComplexityGregory Bolet, Giorgis Georgakoudis, Konstantinos Parasyris, Harshitha Menon, Niranjan Hasabnis, Kirk W. Cameron, Gal Oren2025-12-04下载Modern GPU software stacks demand developers who can anticipate performance bottlenecks before ever launching a kernel; misjudging floating-point workloads upstream can derail tuning, scheduling, and ...

基于 VitePress 构建