Skip to content

2025-05-31

cs.AR - Architecture

标题作者发布日期PDF摘要
Processing-in-memory for genomics workloadsWilliam Andrew Simon, Leonid Yavits, Konstantina Koliogeorgi, Yann Falevoz, Yoshihiro Shibuya, Dominique Lavenier, Irem Boybat, Klea Zambaku, Berkan Şahin, Mohammad Sadrosadati, Onur Mutlu, Abu Sebastian, Rayan Chikhi, The BioPIM Consortium, Can Alkan2025-05-31下载Low-cost, high-throughput DNA and RNA sequencing (HTS) data is the backbone of the life sciences. Genome sequencing is now becoming a part of Predictive, Preventive, Personalized, and Participatory (t...
Bridging the Gap between Hardware Fuzzing and Industrial VerificationRuiyang Ma, Tianhao Wei, Jiaxi Zhang, Chun Yang, Jiangfang Yi, Guojie Luo2025-05-31下载As hardware design complexity increases, hardware fuzzing emerges as a promising tool for automating the verification process. However, a significant gap still exists before it can be applied in indus...
PointODE: Lightweight Point Cloud Learning with Neural Ordinary Differential Equations on EdgeKeisuke Sugiura, Mizuki Yasuda, Hiroki Matsutani2025-05-31下载Embedded edge devices are often used as a computing platform to run real-world point cloud applications, but recent deep learning-based methods may not fit on such devices due to limited resources.
COGNATE: Acceleration of Sparse Tensor Programs on Emerging Hardware using Transfer LearningChamika Sudusinghe, Gerasimos Gerogiannis, Damitha Lenadora, Charles Block, Josep Torrellas, Charith Mendis2025-05-31下载Sparse tensor programs are essential in deep learning and graph analytics, driving the need for optimized processing. To meet this demand, specialized hardware accelerators are being developed.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
The workflow motif: a widely-useful performance diagnosis abstraction for distributed applicationsMania Abdi, Peter Desnoyers, Mark Crovella, Raja R. Sambasivan2025-05-31下载Diagnosing problems in deployed distributed applications continues to grow more challenging. A significant reason is the extreme mismatch between the powerful abstractions developers have available to...
Assortment of Attention Heads: Accelerating Federated PEFT with Head Pruning and Strategic Client SelectionYeshwanth Venkatesha, Souvik Kundu, Priyadarshini Panda2025-05-31下载Parameter Efficient Fine-Tuning (PEFT) has become the de-facto approach in adapting Large Language Models (LLMs) for downstream tasks in Natural Language Processing.
Federated learning framework for collaborative remaining useful life prognostics: an aircraft engine case studyDiogo Landau, Ingeborg de Pater, Mihaela Mitici, Nishant Saurabh2025-05-31下载Complex systems such as aircraft engines are continuously monitored by sensors. In predictive aircraft maintenance, the collected sensor measurements are used to estimate the health condition and the ...
Learning Semantics, Not Addresses: Runtime Neural Prefetching for Far MemoryYutong Huang, Zhiyuan Guo, Yiying Zhang2025-05-31下载Memory prefetching has long boosted CPU caches and is increasingly vital for far-memory systems, where large portions of memory are offloaded to cheaper, remote tiers.
Enabling Secure and Ephemeral AI Workloads in Data Mesh EnvironmentsChinkit Patel, Kee Siong Ng2025-05-31下载Many large enterprises that operate highly governed and complex ICT environments have no efficient and effective way to support their Data and AI teams in rapidly spinning up and tearing down self-ser...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Pinching Antenna-Aided Wireless Powered Communication NetworksYixuan Li, Hongbo Xu, Ming Zeng, Yuanwei Liu2025-05-31下载In this letter, we investigate a novel pinching antenna (PA)-aided wireless powered communication network (WPCN), in which multiple PAs are activated along a waveguide to establish robust line-of-sigh...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Learning Semantics, Not Addresses: Runtime Neural Prefetching for Far MemoryYutong Huang, Zhiyuan Guo, Yiying Zhang2025-05-31下载Memory prefetching has long boosted CPU caches and is increasingly vital for far-memory systems, where large portions of memory are offloaded to cheaper, remote tiers.

cs.PF - Performance

标题作者发布日期PDF摘要
Accelerating Diffusion LLMs via Adaptive Parallel DecodingDaniel Israel, Guy Van den Broeck, Aditya Grover2025-05-31下载The generation speed of LLMs are bottlenecked by autoregressive decoding, where tokens are predicted sequentially one by one. Alternatively, diffusion large language models (dLLMs) theoretically allow...

基于 VitePress 构建