2025-05-31

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Processing-in-memory for genomics workloads	William Andrew Simon, Leonid Yavits, Konstantina Koliogeorgi, Yann Falevoz, Yoshihiro Shibuya, Dominique Lavenier, Irem Boybat, Klea Zambaku, Berkan Şahin, Mohammad Sadrosadati, Onur Mutlu, Abu Sebastian, Rayan Chikhi, The BioPIM Consortium, Can Alkan	2025-05-31	下载	Low-cost, high-throughput DNA and RNA sequencing (HTS) data is the backbone of the life sciences. Genome sequencing is now becoming a part of Predictive, Preventive, Personalized, and Participatory (t...
Bridging the Gap between Hardware Fuzzing and Industrial Verification	Ruiyang Ma, Tianhao Wei, Jiaxi Zhang, Chun Yang, Jiangfang Yi, Guojie Luo	2025-05-31	下载	As hardware design complexity increases, hardware fuzzing emerges as a promising tool for automating the verification process. However, a significant gap still exists before it can be applied in indus...
PointODE: Lightweight Point Cloud Learning with Neural Ordinary Differential Equations on Edge	Keisuke Sugiura, Mizuki Yasuda, Hiroki Matsutani	2025-05-31	下载	Embedded edge devices are often used as a computing platform to run real-world point cloud applications, but recent deep learning-based methods may not fit on such devices due to limited resources.
COGNATE: Acceleration of Sparse Tensor Programs on Emerging Hardware using Transfer Learning	Chamika Sudusinghe, Gerasimos Gerogiannis, Damitha Lenadora, Charles Block, Josep Torrellas, Charith Mendis	2025-05-31	下载	Sparse tensor programs are essential in deep learning and graph analytics, driving the need for optimized processing. To meet this demand, specialized hardware accelerators are being developed.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
The workflow motif: a widely-useful performance diagnosis abstraction for distributed applications	Mania Abdi, Peter Desnoyers, Mark Crovella, Raja R. Sambasivan	2025-05-31	下载	Diagnosing problems in deployed distributed applications continues to grow more challenging. A significant reason is the extreme mismatch between the powerful abstractions developers have available to...
Assortment of Attention Heads: Accelerating Federated PEFT with Head Pruning and Strategic Client Selection	Yeshwanth Venkatesha, Souvik Kundu, Priyadarshini Panda	2025-05-31	下载	Parameter Efficient Fine-Tuning (PEFT) has become the de-facto approach in adapting Large Language Models (LLMs) for downstream tasks in Natural Language Processing.
Federated learning framework for collaborative remaining useful life prognostics: an aircraft engine case study	Diogo Landau, Ingeborg de Pater, Mihaela Mitici, Nishant Saurabh	2025-05-31	下载	Complex systems such as aircraft engines are continuously monitored by sensors. In predictive aircraft maintenance, the collected sensor measurements are used to estimate the health condition and the ...
Learning Semantics, Not Addresses: Runtime Neural Prefetching for Far Memory	Yutong Huang, Zhiyuan Guo, Yiying Zhang	2025-05-31	下载	Memory prefetching has long boosted CPU caches and is increasingly vital for far-memory systems, where large portions of memory are offloaded to cheaper, remote tiers.
Enabling Secure and Ephemeral AI Workloads in Data Mesh Environments	Chinkit Patel, Kee Siong Ng	2025-05-31	下载	Many large enterprises that operate highly governed and complex ICT environments have no efficient and effective way to support their Data and AI teams in rapidly spinning up and tearing down self-ser...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Pinching Antenna-Aided Wireless Powered Communication Networks	Yixuan Li, Hongbo Xu, Ming Zeng, Yuanwei Liu	2025-05-31	下载	In this letter, we investigate a novel pinching antenna (PA)-aided wireless powered communication network (WPCN), in which multiple PAs are activated along a waveguide to establish robust line-of-sigh...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Learning Semantics, Not Addresses: Runtime Neural Prefetching for Far Memory	Yutong Huang, Zhiyuan Guo, Yiying Zhang	2025-05-31	下载	Memory prefetching has long boosted CPU caches and is increasingly vital for far-memory systems, where large portions of memory are offloaded to cheaper, remote tiers.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Accelerating Diffusion LLMs via Adaptive Parallel Decoding	Daniel Israel, Guy Van den Broeck, Aditya Grover	2025-05-31	下载	The generation speed of LLMs are bottlenecked by autoregressive decoding, where tokens are predicted sequentially one by one. Alternatively, diffusion large language models (dLLMs) theoretically allow...