Appearance
2025-02-23
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Optimizing Coverage-Driven Verification Using Machine Learning and PyUVM: A Novel Approach | Suruchi Kumari, Deepak Narayan Gadde, Aman Kumar | 2025-02-23 | 下载 | The escalating complexity of System-on-Chip (SoC) designs has created a bottleneck in verification, with traditional techniques struggling to achieve complete coverage. |
| A Quarter of a Century of Neuromorphic Architectures on FPGAs -- an Overview | Wiktor J. Szczerek, Artur Podobas | 2025-02-23 | 下载 | Neuromorphic computing is a relatively new discipline of computer science, where the principles of biological brain's computation and memory are used to create a new way of processing information, bas... |
| D2S-FLOW: Automated Parameter Extraction from Datasheets for SPICE Model Generation Using Large Language Models | Hong Cai Chen, Yi Pin Xu, Yang Zhang | 2025-02-23 | 下载 | In electronic design, engineers often manually search through extensive documents to retrieve component parameters required for constructing SPICE models, a process that is both labor-intensive and ti... |
| TerEffic: Highly Efficient Ternary LLM Inference on FPGA | Chenyang Yin, Zhenyu Bai, Pranav Venkatram, Shivam Aggarwal, Zhaoying Li, Tulika Mitra | 2025-02-23 | 下载 | Deploying Large Language Models (LLMs) efficiently on edge devices is often constrained by limited memory capacity and high power consumption. |
| Bancroft: Genomics Acceleration Beyond On-Device Memory | Se-Min Lim, Seongyoung Kang, Sang-Woo Jun | 2025-02-23 | 下载 | This paper presents Bancroft, a computational genomics acceleration platform that provides the illusion of practically infinite on-device memory capacity by compressing genomic data movement over PCIe... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Toward Responsible Federated Large Language Models: Leveraging a Safety Filter and Constitutional AI | Eunchung Noh, Jeonghun Baek | 2025-02-23 | 下载 | Recent research has increasingly focused on training large language models (LLMs) using federated learning, known as FedLLM. However, responsible AI (RAI), which aims to ensure safe responses, remains... |
| CRIUgpu: Transparent Checkpointing of GPU-Accelerated Workloads | Radostin Stoyanov, Viktória Spišaková, Jesus Ramos, Steven Gurfinkel, Andrei Vagin, Adrian Reber, Wesley Armour, Rodrigo Bruno | 2025-02-23 | 下载 | Deep learning training at scale is resource-intensive and time-consuming, often running across hundreds or thousands of GPUs for weeks or months. |
| SUperman: Efficient Permanent Computation on GPUs | Deniz Elbek, Fatih Taşyaran, Bora Uçar, Kamer Kaya | 2025-02-23 | 下载 | The permanent is a function, defined for a square matrix, with applications in various domains including quantum computing, statistical physics, complexity theory, combinatorics, and graph theory. |
| An Analytical Overview Of Virtual Machine Load Balancing Scheduling Algorithms with their Comparative Case Study | Priyank Vaidya, Abhinav Sharma, Murli Patel | 2025-02-23 | 下载 | Efficient virtual machine load balancing scheduling is crucial in cloud computing to optimize resource utilization and system performance. To address this issue, several load balancing scheduling algo... |
| Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM | Yao Zhang, Yuyi Mao, Hui Wang, Zhiwen Yu, Song Guo, Jun Zhang, Liang Wang, Bin Guo | 2025-02-23 | 下载 | Visual Simultaneous Localization and Mapping (vSLAM) is a prevailing technology for many emerging robotic applications. Achieving real-time SLAM on mobile robotic systems with limited computational re... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Optimal Indoor AP Placement: A Case Study | Farah Natiq, Qutaiba I. Ali | 2025-02-23 | 下载 | Wireless networks in a room are strongly affected by interferences. To alleviate these effects and enhance the performance of the wireless networks, some optimization was carried out. |
| Combining Heuristic and Reinforcement Learning to Achieve the Low-latency and High-throughput Receiver-side Congestion Control | Xianliang Jiang, Guanghui Gong, Guang Jin | 2025-02-23 | 下载 | Traditional congestion control algorithms struggle to maintain the consistent and satisfactory data transmission performance over time-varying networking condition. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification | Arshia Kermani, Ehsan Zeraatkar, Habib Irani | 2025-02-23 | 下载 | The increasing computational demands of transformer models in time series classification necessitate effective optimization strategies for energy-efficient deployment. |
| Annotation-guided AoS-to-SoA conversions and GPU offloading with data views in C++ | Pawel K. Radtke, Tobias Weinzierl | 2025-02-23 | 下载 | The C++ programming language provides classes and structs as fundamental modeling entities. Consequently, C++ code tends to favour array-of-structs (AoS) for encoding data sequences, even though struc... |