Skip to content

2025-02-23

cs.AR - Architecture

标题作者发布日期PDF摘要
Optimizing Coverage-Driven Verification Using Machine Learning and PyUVM: A Novel ApproachSuruchi Kumari, Deepak Narayan Gadde, Aman Kumar2025-02-23下载The escalating complexity of System-on-Chip (SoC) designs has created a bottleneck in verification, with traditional techniques struggling to achieve complete coverage.
A Quarter of a Century of Neuromorphic Architectures on FPGAs -- an OverviewWiktor J. Szczerek, Artur Podobas2025-02-23下载Neuromorphic computing is a relatively new discipline of computer science, where the principles of biological brain's computation and memory are used to create a new way of processing information, bas...
D2S-FLOW: Automated Parameter Extraction from Datasheets for SPICE Model Generation Using Large Language ModelsHong Cai Chen, Yi Pin Xu, Yang Zhang2025-02-23下载In electronic design, engineers often manually search through extensive documents to retrieve component parameters required for constructing SPICE models, a process that is both labor-intensive and ti...
TerEffic: Highly Efficient Ternary LLM Inference on FPGAChenyang Yin, Zhenyu Bai, Pranav Venkatram, Shivam Aggarwal, Zhaoying Li, Tulika Mitra2025-02-23下载Deploying Large Language Models (LLMs) efficiently on edge devices is often constrained by limited memory capacity and high power consumption.
Bancroft: Genomics Acceleration Beyond On-Device MemorySe-Min Lim, Seongyoung Kang, Sang-Woo Jun2025-02-23下载This paper presents Bancroft, a computational genomics acceleration platform that provides the illusion of practically infinite on-device memory capacity by compressing genomic data movement over PCIe...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Toward Responsible Federated Large Language Models: Leveraging a Safety Filter and Constitutional AIEunchung Noh, Jeonghun Baek2025-02-23下载Recent research has increasingly focused on training large language models (LLMs) using federated learning, known as FedLLM. However, responsible AI (RAI), which aims to ensure safe responses, remains...
CRIUgpu: Transparent Checkpointing of GPU-Accelerated WorkloadsRadostin Stoyanov, Viktória Spišaková, Jesus Ramos, Steven Gurfinkel, Andrei Vagin, Adrian Reber, Wesley Armour, Rodrigo Bruno2025-02-23下载Deep learning training at scale is resource-intensive and time-consuming, often running across hundreds or thousands of GPUs for weeks or months.
SUperman: Efficient Permanent Computation on GPUsDeniz Elbek, Fatih Taşyaran, Bora Uçar, Kamer Kaya2025-02-23下载The permanent is a function, defined for a square matrix, with applications in various domains including quantum computing, statistical physics, complexity theory, combinatorics, and graph theory.
An Analytical Overview Of Virtual Machine Load Balancing Scheduling Algorithms with their Comparative Case StudyPriyank Vaidya, Abhinav Sharma, Murli Patel2025-02-23下载Efficient virtual machine load balancing scheduling is crucial in cloud computing to optimize resource utilization and system performance. To address this issue, several load balancing scheduling algo...
Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAMYao Zhang, Yuyi Mao, Hui Wang, Zhiwen Yu, Song Guo, Jun Zhang, Liang Wang, Bin Guo2025-02-23下载Visual Simultaneous Localization and Mapping (vSLAM) is a prevailing technology for many emerging robotic applications. Achieving real-time SLAM on mobile robotic systems with limited computational re...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Optimal Indoor AP Placement: A Case StudyFarah Natiq, Qutaiba I. Ali2025-02-23下载Wireless networks in a room are strongly affected by interferences. To alleviate these effects and enhance the performance of the wireless networks, some optimization was carried out.
Combining Heuristic and Reinforcement Learning to Achieve the Low-latency and High-throughput Receiver-side Congestion ControlXianliang Jiang, Guanghui Gong, Guang Jin2025-02-23下载Traditional congestion control algorithms struggle to maintain the consistent and satisfactory data transmission performance over time-varying networking condition.

cs.PF - Performance

标题作者发布日期PDF摘要
Energy-Efficient Transformer Inference: Optimization Strategies for Time Series ClassificationArshia Kermani, Ehsan Zeraatkar, Habib Irani2025-02-23下载The increasing computational demands of transformer models in time series classification necessitate effective optimization strategies for energy-efficient deployment.
Annotation-guided AoS-to-SoA conversions and GPU offloading with data views in C++Pawel K. Radtke, Tobias Weinzierl2025-02-23下载The C++ programming language provides classes and structs as fundamental modeling entities. Consequently, C++ code tends to favour array-of-structs (AoS) for encoding data sequences, even though struc...

基于 VitePress 构建