Appearance
2025-12-04
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| DABench-LLM: Standardized and In-Depth Benchmarking of Post-Moore Dataflow AI Accelerators for LLMs | Ziyu Hu, Zhiqing Zhong, Weijian Zheng, Zhijing Ye, Xuwei Tan, Xueru Zhang, Zheng Xie, Rajkumar Kettimuthu, Xiaodong Yu | 2025-12-04 | 下载 | The exponential growth of large language models has outpaced the capabilities of traditional CPU and GPU architectures due to the slowdown of Moore's Law. |
| ARCAS: An Augmented Reality Collision Avoidance System with SLAM-Based Tracking for Enhancing VRU Safety | Ahmad Yehia, Jiseop Byeon, Tianyi Wang, Huihai Wang, Yiming Xu, Junfeng Jiao, Christian Claudel | 2025-12-04 | 下载 | Vulnerable road users (VRUs) face high collision risks in mixed traffic, yet most existing safety systems prioritize driver or vehicle assistance over direct VRU support. |
| David vs. Goliath: Can Small Models Win Big with Agentic AI in Hardware Design? | Shashwat Shankar, Subhranshu Pandey, Innocent Dengkhw Mochahari, Bhabesh Mali, Animesh Basak Chowdhury, Sukanta Bhattacharjee, Chandan Karfa | 2025-12-04 | 下载 | Large Language Model(LLM) inference demands massive compute and energy, making domain-specific tasks expensive and unsustainable. As foundation models keep scaling, we ask: Is bigger always better for... |
| Declarative Synthesis and Multi-Objective Optimization of Stripboard Circuit Layouts Using Answer Set Programming | Fang Li | 2025-12-04 | 下载 | This paper presents a novel approach to automated stripboard circuit layout design using Answer Set Programming (ASP). The work formulates the layout problem as both a synthesis and multi-objective op... |
| Functional Stability of Software-Hardware Neural Network Implementation The NeuroComp Project | Bychkov Oleksii, Senysh Taras | 2025-12-04 | 下载 | This paper presents an innovative approach to ensuring functional stability of neural networks through hardware redundancy at the individual neuron level. |
| Hardware-Algorithm Co-Optimization of Early-Exit Neural Networks for Multi-Core Edge Accelerators | Alaa Zniber, Arne Symons, Ouassim Karrakchou, Marian Verhelst, Mounir Ghogho | 2025-12-04 | 下载 | Deployment of dynamic neural networks on edge accelerators requires careful consideration of hardware constraints beyond conventional complexity metrics such as Multiply-Accumulate operations. |
| FLEX: Leveraging FPGA-CPU Synergy for Mixed-Cell-Height Legalization Acceleration | Xingyu Liu, Jiawei Liang, Linfeng Du, Yipu Zhang, Chaofang Ma, Hanwei Fan, Jiang Xu, Wei Zhang | 2025-12-04 | 下载 | In this work, we present FLEX, an FPGA-CPU accelerator for mixed-cell-height legalization tasks. We address challenges from the following perspectives. |
| Context-Aware Mixture-of-Experts Inference on CXL-Enabled GPU-NDP Systems | Zehao Fan, Zhenyu Liu, Yunzhen Liu, Yayue Hou, Hadjer Benmeziane, Kaoutar El Maghraoui, Liu Liu | 2025-12-04 | 下载 | Mixture-of-Experts (MoE) models scale large language models through conditional computation, but inference becomes memory-bound once expert weights exceed the capacity of GPU memory. |
| RRAM-Based Analog Matrix Computing for Massive MIMO Signal Processing: A Review | Pushen Zuo, Zhong Sun | 2025-12-04 | 下载 | Resistive random-access memory (RRAM) provides an excellent platform for analog matrix computing (AMC), enabling both matrix-vector multiplication (MVM) and the solution of matrix equations through op... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| DABench-LLM: Standardized and In-Depth Benchmarking of Post-Moore Dataflow AI Accelerators for LLMs | Ziyu Hu, Zhiqing Zhong, Weijian Zheng, Zhijing Ye, Xuwei Tan, Xueru Zhang, Zheng Xie, Rajkumar Kettimuthu, Xiaodong Yu | 2025-12-04 | 下载 | The exponential growth of large language models has outpaced the capabilities of traditional CPU and GPU architectures due to the slowdown of Moore's Law. |
| NVLang: Unified Static Typing for Actor-Based Concurrency on the BEAM | Miguel de Oliveira Guerreiro | 2025-12-04 | 下载 | Actor-based systems like Erlang/OTP power critical infrastructure -- from telecommunications to messaging platforms -- handling millions of concurrent connections with legendary reliability. |
| Federated Learning for Terahertz Wireless Communication | O. Tansel Baydas, Ozgur B. Akan | 2025-12-04 | 下载 | The convergence of Terahertz (THz) communications and Federated Learning (FL) promises ultra-fast distributed learning, yet the impact of realistic wideband impairments on optimization dynamics remain... |
| FLEX: Leveraging FPGA-CPU Synergy for Mixed-Cell-Height Legalization Acceleration | Xingyu Liu, Jiawei Liang, Linfeng Du, Yipu Zhang, Chaofang Ma, Hanwei Fan, Jiang Xu, Wei Zhang | 2025-12-04 | 下载 | In this work, we present FLEX, an FPGA-CPU accelerator for mixed-cell-height legalization tasks. We address challenges from the following perspectives. |
| Offloading to CXL-based Computational Memory | Suyeon Lee, Kangkyu Park, Kwangsik Shin, Ada Gavrilovska | 2025-12-04 | 下载 | CXL-based Computational Memory (CCM) enables near-memory processing within expanded remote memory, presenting opportunities to address data movement costs associated with disaggregated memory systems ... |
| Reducing Fragmentation and Starvation in GPU Clusters through Dynamic Multi-Objective Scheduling | Akhmadillo Mamirov | 2025-12-04 | 下载 | GPU clusters have become essential for training and deploying modern AI systems, yet real deployments continue to report average utilization near 50%. |
| A Structure-Aware Irregular Blocking Method for Sparse LU Factorization | Zhen Hu, Dongliang Xiong, Kai Huang, Changjun Wu, Xiaowen Jiang | 2025-12-04 | 下载 | In sparse LU factorization, nonzero elements after symbolic factorization tend to distribute in diagonal and right-bottom region of sparse matrices. |
| Counting Without Running: Evaluating LLMs' Reasoning About Code Complexity | Gregory Bolet, Giorgis Georgakoudis, Konstantinos Parasyris, Harshitha Menon, Niranjan Hasabnis, Kirk W. Cameron, Gal Oren | 2025-12-04 | 下载 | Modern GPU software stacks demand developers who can anticipate performance bottlenecks before ever launching a kernel; misjudging floating-point workloads upstream can derail tuning, scheduling, and ... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Hierarchical Reinforcement Learning for the Dynamic VNE with Alternatives Problem | Ali Al Housseini, Cristina Rottondi, Omran Ayoub | 2025-12-04 | 下载 | Virtual Network Embedding (VNE) is a key enabler of network slicing, yet most formulations assume that each Virtual Network Request (VNR) has a fixed topology. |
| MuMeNet: A Network Simulator for Musical Metaverse Communications | Ali Al Housseini, Jaime Llorca, Luca Turchet, Tiziano Leidi, Cristina Rottondi, Omran Ayoub | 2025-12-04 | 下载 | The Metaverse, a shared and spatially organized digital continuum, is transforming various industries, with music emerging as a leading use case. |
| Deadline-Aware Scheduling of Distributed Quantum Circuits in Near-Term Quantum Cloud | Nour Dehaini, Christia Chahoud, Mahdi Chehimi | 2025-12-04 | 下载 | Distributed quantum computing (DQC) enables scalable quantum computations by distributing large quantum circuits on multiple quantum processing units (QPUs) in the quantum cloud. |
| Timely Information for Strategic Persuasion | Ahmet Bugra Gundogan, Melih Bastopcu | 2025-12-04 | 下载 | This work investigates a dynamic variant of Bayesian persuasion, in which a strategic sender seeks to influence a receiver's belief over time through controlling the timing of the information disclosu... |
| Vision and Causal Learning Based Channel Estimation for THz Communications | Kitae Kim, Yan Kyaw Tun, Md. Shirajum Munir, Chirsto Kurisummoottil Thomas, Walid Saad, Choong Seon Hong | 2025-12-04 | 下载 | The use of terahertz (THz) communications with massive multiple input multiple output (MIMO) systems in 6G can potentially provide high data rates and low latency communications. |
| Making Cellular Networks Crisis-Proof: Towards Island-Ready, Resilient-By-Design 6G Communication Network | Leon Janzen, Matthias Hollick | 2025-12-04 | 下载 | 5G and 5G-Advanced cellular networks are vulnerable to regional outages resulting from disasters or targeted attacks. This fragility stems from the reliance on the central core network involved for mo... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| DABench-LLM: Standardized and In-Depth Benchmarking of Post-Moore Dataflow AI Accelerators for LLMs | Ziyu Hu, Zhiqing Zhong, Weijian Zheng, Zhijing Ye, Xuwei Tan, Xueru Zhang, Zheng Xie, Rajkumar Kettimuthu, Xiaodong Yu | 2025-12-04 | 下载 | The exponential growth of large language models has outpaced the capabilities of traditional CPU and GPU architectures due to the slowdown of Moore's Law. |
| AutoGuard: A Self-Healing Proactive Security Layer for DevSecOps Pipelines Using Reinforcement Learning | Praveen Anugula, Avdhesh Kumar Bhardwaj, Navin Chhibber, Rohit Tewari, Sunil Khemka, Piyush Ranjan | 2025-12-04 | 下载 | Contemporary DevSecOps pipelines have to deal with the evolution of security in an ever-continuously integrated and deployed environment. Existing methods,such as rule-based intrusion detection and st... |
| Counting Without Running: Evaluating LLMs' Reasoning About Code Complexity | Gregory Bolet, Giorgis Georgakoudis, Konstantinos Parasyris, Harshitha Menon, Niranjan Hasabnis, Kirk W. Cameron, Gal Oren | 2025-12-04 | 下载 | Modern GPU software stacks demand developers who can anticipate performance bottlenecks before ever launching a kernel; misjudging floating-point workloads upstream can derail tuning, scheduling, and ... |