2025-11-15

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Pushing the Memory Bandwidth Wall with CXL-enabled Idle I/O Bandwidth Harvesting	Divya Kiran Kadiyala, Alexandros Daglis	2025-11-15	下载	The continual increase of cores on server-grade CPUs raises demands on memory systems, which are constrained by limited off-chip pin and data transfer rate scalability.
Sangam: Chiplet-Based DRAM-PIM Accelerator with CXL Integration for LLM Inferencing	Khyati Kiyawat, Zhenxing Fan, Yasas Seneviratne, Morteza Baradaran, Akhil Shekar, Zihan Xia, Mingu Kang, Kevin Skadron	2025-11-15	下载	Large Language Models (LLMs) are becoming increasingly data-intensive due to growing model sizes, and they are becoming memory-bound as the context length and, consequently, the key-value (KV) cache s...
eFPE: Design, Implementation, and Evaluation of a Lightweight Format-Preserving Encryption Algorithm for Embedded Systems	Nishant Vasantkumar Hegde, Suneesh Bare, K B Ramesh, Aamir Ibrahim	2025-11-15	下载	Resource-constrained embedded systems demand secure yet lightweight data protection, particularly when data formats must be preserved. This paper introduces eFPE (Enhanced Format-Preserving Encryption...
A Digital SRAM-Based Compute-In-Memory Macro for Weight-Stationary Dynamic Matrix Multiplication in Transformer Attention Score Computation	Jianyi Yu, Tengxiao Wang, Yuxuan Wang, Xiang Fu, Fei Qiao, Ying Wang, Rui Yuan, Liyuan Liu, Cong Shi	2025-11-15	下载	Compute-in-memory (CIM) techniques are widely employed in energy-efficient artificial intelligent (AI) processors. They alleviate power and latency bottlenecks caused by extensive data movements betwe...
TIMERIPPLE: Accelerating vDiTs by Understanding the Spatio-Temporal Correlations in Latent Space	Wenxuan Miao, Yulin Sun, Aiyue Chen, Jing Lin, Yiwu Yao, Yiming Gan, Jieru Zhao, Jingwen Leng, Mingyi Guo, Yu Feng	2025-11-15	下载	The recent surge in video generation has shown the growing demand for high-quality video synthesis using large vision models. Existing video generation models are predominantly based on the video diff...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
A novel strategy for multi-resource load balancing in agent-based systems	Leszek Sliwko, Aleksander Zgrzywa	2025-11-15	下载	The paper presents a multi-resource load balancing strategy which can be utilised within an agent-based system. This approach can assist system designers in their attempts to optimise the structure fo...
Distributed Seasonal Temporal Pattern Mining	Van Ho-Long, Nguyen Ho, Anh-Vu Dinh-Duc, Ha Manh Tran, Ky Trung Nguyen, Tran Dung Pham, Quoc Viet Hung Nguyen	2025-11-15	下载	The explosive growth of IoT-enabled sensors is producing enormous amounts of time series data across many domains, offering valuable opportunities to extract insights through temporal pattern mining.
Combining Serverless and High-Performance Computing Paradigms to support ML Data-Intensive Applications	Mills Staylor, Arup Kumar Sarker, Gregor von Laszewski, Geoffrey Fox, Yue Cheng, Judy Fox	2025-11-15	下载	Data is found everywhere, from health and human infrastructure to the surge of sensors and the proliferation of internet-connected devices. To meet this challenge, the data engineering field has expan...
PipeDiT: Accelerating Diffusion Transformers in Video Generation with Task Pipelining and Model Decoupling	Sijie Wang, Qiang Wang, Shaohuai Shi	2025-11-15	下载	Video generation has been advancing rapidly, and diffusion transformer (DiT) based models have demonstrated remark- able capabilities. However, their practical deployment is of- ten hindered by slow i...
Striking the Right Balance between Compute and Copy: Improving LLM Inferencing Under Speculative Decoding	Arun Ramachandran, Ramaswamy Govindarajan, Murali Annavaram, Prakash Raghavendra, Hossein Entezari Zarch, Lei Gao, Chaoyi Jiang	2025-11-15	下载	With the skyrocketing costs of GPUs and their virtual instances in the cloud, there is a significant desire to use CPUs for large language model (LLM) inference.
A Quick and Exact Method for Distributed Quantile Computation	Ivan Cao, Jaromir J. Saloni, David A. G. Harrison	2025-11-15	下载	Quantile computation is a core primitive in large-scale data analytics. In Spark, practitioners typically rely on the Greenwald-Khanna (GK) Sketch, an approximate method.
High-Performance N-Queens Solver on GPU: Iterative DFS with Zero Bank Conflicts	Guangchao Yao, Yali Li	2025-11-15	下载	The counting of solutions to the N-Queens problem is a classic NP-complete problem with extremely high computational complexity. As of now, the academic community has rigorously verified the number of...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Joint Optimization of RU Allocation and C-SR in Multi-AP Coordinated Wi-Fi Systems	Md Rahat Hasan, Kazi Ahmed Akbar Munim, Md. Forkan Uddin	2025-11-15	下载	We formulate an optimization problem for joint RU allocation and C-SR to maximize the throughput of a multi-AP coordinated WiFi system. The optimization problem is found to be a non-linear integer pro...
SenseRay-3D: Generalizable and Physics-Informed Framework for End-to-End Indoor Propagation Modeling	Yu Zheng, Kezhi Wang, Wenji Xi, Gang Yu, Jiming Chen, Jie Zhang	2025-11-15	下载	Modeling indoor radio propagation is crucial for wireless network planning and optimization. However, existing approaches often rely on labor-intensive manual modeling of geometry and material propert...
A Bio-Inspired Leader-based Energy Management System for Drone Fleets	Rosario Napoli, Antonio Celesti, Massimo Villari, Maria Fazio	2025-11-15	下载	Drones are embedded systems (ES) used across a wide range of fields, from photography to shipments and even during crisis management for searching, rescuing and damage assessment activities.
LithoSeg: A Coarse-to-Fine Framework for High-Precision Lithography Segmentation	Xinyu He, Botong Zhao, Bingbing Li, Shujing Lyu, Jiwei Shen, Yue Lu	2025-11-15	下载	Accurate segmentation and measurement of lithography scanning electron microscope (SEM) images are crucial for ensuring precise process control, optimizing device performance, and advancing semiconduc...