2025-02-23

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Optimizing Coverage-Driven Verification Using Machine Learning and PyUVM: A Novel Approach	Suruchi Kumari, Deepak Narayan Gadde, Aman Kumar	2025-02-23	下载	The escalating complexity of System-on-Chip (SoC) designs has created a bottleneck in verification, with traditional techniques struggling to achieve complete coverage.
A Quarter of a Century of Neuromorphic Architectures on FPGAs -- an Overview	Wiktor J. Szczerek, Artur Podobas	2025-02-23	下载	Neuromorphic computing is a relatively new discipline of computer science, where the principles of biological brain's computation and memory are used to create a new way of processing information, bas...
D2S-FLOW: Automated Parameter Extraction from Datasheets for SPICE Model Generation Using Large Language Models	Hong Cai Chen, Yi Pin Xu, Yang Zhang	2025-02-23	下载	In electronic design, engineers often manually search through extensive documents to retrieve component parameters required for constructing SPICE models, a process that is both labor-intensive and ti...
TerEffic: Highly Efficient Ternary LLM Inference on FPGA	Chenyang Yin, Zhenyu Bai, Pranav Venkatram, Shivam Aggarwal, Zhaoying Li, Tulika Mitra	2025-02-23	下载	Deploying Large Language Models (LLMs) efficiently on edge devices is often constrained by limited memory capacity and high power consumption.
Bancroft: Genomics Acceleration Beyond On-Device Memory	Se-Min Lim, Seongyoung Kang, Sang-Woo Jun	2025-02-23	下载	This paper presents Bancroft, a computational genomics acceleration platform that provides the illusion of practically infinite on-device memory capacity by compressing genomic data movement over PCIe...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Toward Responsible Federated Large Language Models: Leveraging a Safety Filter and Constitutional AI	Eunchung Noh, Jeonghun Baek	2025-02-23	下载	Recent research has increasingly focused on training large language models (LLMs) using federated learning, known as FedLLM. However, responsible AI (RAI), which aims to ensure safe responses, remains...
CRIUgpu: Transparent Checkpointing of GPU-Accelerated Workloads	Radostin Stoyanov, Viktória Spišaková, Jesus Ramos, Steven Gurfinkel, Andrei Vagin, Adrian Reber, Wesley Armour, Rodrigo Bruno	2025-02-23	下载	Deep learning training at scale is resource-intensive and time-consuming, often running across hundreds or thousands of GPUs for weeks or months.
SUperman: Efficient Permanent Computation on GPUs	Deniz Elbek, Fatih Taşyaran, Bora Uçar, Kamer Kaya	2025-02-23	下载	The permanent is a function, defined for a square matrix, with applications in various domains including quantum computing, statistical physics, complexity theory, combinatorics, and graph theory.
An Analytical Overview Of Virtual Machine Load Balancing Scheduling Algorithms with their Comparative Case Study	Priyank Vaidya, Abhinav Sharma, Murli Patel	2025-02-23	下载	Efficient virtual machine load balancing scheduling is crucial in cloud computing to optimize resource utilization and system performance. To address this issue, several load balancing scheduling algo...
Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM	Yao Zhang, Yuyi Mao, Hui Wang, Zhiwen Yu, Song Guo, Jun Zhang, Liang Wang, Bin Guo	2025-02-23	下载	Visual Simultaneous Localization and Mapping (vSLAM) is a prevailing technology for many emerging robotic applications. Achieving real-time SLAM on mobile robotic systems with limited computational re...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Optimal Indoor AP Placement: A Case Study	Farah Natiq, Qutaiba I. Ali	2025-02-23	下载	Wireless networks in a room are strongly affected by interferences. To alleviate these effects and enhance the performance of the wireless networks, some optimization was carried out.
Combining Heuristic and Reinforcement Learning to Achieve the Low-latency and High-throughput Receiver-side Congestion Control	Xianliang Jiang, Guanghui Gong, Guang Jin	2025-02-23	下载	Traditional congestion control algorithms struggle to maintain the consistent and satisfactory data transmission performance over time-varying networking condition.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification	Arshia Kermani, Ehsan Zeraatkar, Habib Irani	2025-02-23	下载	The increasing computational demands of transformer models in time series classification necessitate effective optimization strategies for energy-efficient deployment.
Annotation-guided AoS-to-SoA conversions and GPU offloading with data views in C++	Pawel K. Radtke, Tobias Weinzierl	2025-02-23	下载	The C++ programming language provides classes and structs as fundamental modeling entities. Consequently, C++ code tends to favour array-of-structs (AoS) for encoding data sequences, even though struc...