Appearance
2025-11-15
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Pushing the Memory Bandwidth Wall with CXL-enabled Idle I/O Bandwidth Harvesting | Divya Kiran Kadiyala, Alexandros Daglis | 2025-11-15 | 下载 | The continual increase of cores on server-grade CPUs raises demands on memory systems, which are constrained by limited off-chip pin and data transfer rate scalability. |
| Sangam: Chiplet-Based DRAM-PIM Accelerator with CXL Integration for LLM Inferencing | Khyati Kiyawat, Zhenxing Fan, Yasas Seneviratne, Morteza Baradaran, Akhil Shekar, Zihan Xia, Mingu Kang, Kevin Skadron | 2025-11-15 | 下载 | Large Language Models (LLMs) are becoming increasingly data-intensive due to growing model sizes, and they are becoming memory-bound as the context length and, consequently, the key-value (KV) cache s... |
| eFPE: Design, Implementation, and Evaluation of a Lightweight Format-Preserving Encryption Algorithm for Embedded Systems | Nishant Vasantkumar Hegde, Suneesh Bare, K B Ramesh, Aamir Ibrahim | 2025-11-15 | 下载 | Resource-constrained embedded systems demand secure yet lightweight data protection, particularly when data formats must be preserved. This paper introduces eFPE (Enhanced Format-Preserving Encryption... |
| A Digital SRAM-Based Compute-In-Memory Macro for Weight-Stationary Dynamic Matrix Multiplication in Transformer Attention Score Computation | Jianyi Yu, Tengxiao Wang, Yuxuan Wang, Xiang Fu, Fei Qiao, Ying Wang, Rui Yuan, Liyuan Liu, Cong Shi | 2025-11-15 | 下载 | Compute-in-memory (CIM) techniques are widely employed in energy-efficient artificial intelligent (AI) processors. They alleviate power and latency bottlenecks caused by extensive data movements betwe... |
| TIMERIPPLE: Accelerating vDiTs by Understanding the Spatio-Temporal Correlations in Latent Space | Wenxuan Miao, Yulin Sun, Aiyue Chen, Jing Lin, Yiwu Yao, Yiming Gan, Jieru Zhao, Jingwen Leng, Mingyi Guo, Yu Feng | 2025-11-15 | 下载 | The recent surge in video generation has shown the growing demand for high-quality video synthesis using large vision models. Existing video generation models are predominantly based on the video diff... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A novel strategy for multi-resource load balancing in agent-based systems | Leszek Sliwko, Aleksander Zgrzywa | 2025-11-15 | 下载 | The paper presents a multi-resource load balancing strategy which can be utilised within an agent-based system. This approach can assist system designers in their attempts to optimise the structure fo... |
| Distributed Seasonal Temporal Pattern Mining | Van Ho-Long, Nguyen Ho, Anh-Vu Dinh-Duc, Ha Manh Tran, Ky Trung Nguyen, Tran Dung Pham, Quoc Viet Hung Nguyen | 2025-11-15 | 下载 | The explosive growth of IoT-enabled sensors is producing enormous amounts of time series data across many domains, offering valuable opportunities to extract insights through temporal pattern mining. |
| Combining Serverless and High-Performance Computing Paradigms to support ML Data-Intensive Applications | Mills Staylor, Arup Kumar Sarker, Gregor von Laszewski, Geoffrey Fox, Yue Cheng, Judy Fox | 2025-11-15 | 下载 | Data is found everywhere, from health and human infrastructure to the surge of sensors and the proliferation of internet-connected devices. To meet this challenge, the data engineering field has expan... |
| PipeDiT: Accelerating Diffusion Transformers in Video Generation with Task Pipelining and Model Decoupling | Sijie Wang, Qiang Wang, Shaohuai Shi | 2025-11-15 | 下载 | Video generation has been advancing rapidly, and diffusion transformer (DiT) based models have demonstrated remark- able capabilities. However, their practical deployment is of- ten hindered by slow i... |
| Striking the Right Balance between Compute and Copy: Improving LLM Inferencing Under Speculative Decoding | Arun Ramachandran, Ramaswamy Govindarajan, Murali Annavaram, Prakash Raghavendra, Hossein Entezari Zarch, Lei Gao, Chaoyi Jiang | 2025-11-15 | 下载 | With the skyrocketing costs of GPUs and their virtual instances in the cloud, there is a significant desire to use CPUs for large language model (LLM) inference. |
| A Quick and Exact Method for Distributed Quantile Computation | Ivan Cao, Jaromir J. Saloni, David A. G. Harrison | 2025-11-15 | 下载 | Quantile computation is a core primitive in large-scale data analytics. In Spark, practitioners typically rely on the Greenwald-Khanna (GK) Sketch, an approximate method. |
| High-Performance N-Queens Solver on GPU: Iterative DFS with Zero Bank Conflicts | Guangchao Yao, Yali Li | 2025-11-15 | 下载 | The counting of solutions to the N-Queens problem is a classic NP-complete problem with extremely high computational complexity. As of now, the academic community has rigorously verified the number of... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Joint Optimization of RU Allocation and C-SR in Multi-AP Coordinated Wi-Fi Systems | Md Rahat Hasan, Kazi Ahmed Akbar Munim, Md. Forkan Uddin | 2025-11-15 | 下载 | We formulate an optimization problem for joint RU allocation and C-SR to maximize the throughput of a multi-AP coordinated WiFi system. The optimization problem is found to be a non-linear integer pro... |
| SenseRay-3D: Generalizable and Physics-Informed Framework for End-to-End Indoor Propagation Modeling | Yu Zheng, Kezhi Wang, Wenji Xi, Gang Yu, Jiming Chen, Jie Zhang | 2025-11-15 | 下载 | Modeling indoor radio propagation is crucial for wireless network planning and optimization. However, existing approaches often rely on labor-intensive manual modeling of geometry and material propert... |
| A Bio-Inspired Leader-based Energy Management System for Drone Fleets | Rosario Napoli, Antonio Celesti, Massimo Villari, Maria Fazio | 2025-11-15 | 下载 | Drones are embedded systems (ES) used across a wide range of fields, from photography to shipments and even during crisis management for searching, rescuing and damage assessment activities. |
| LithoSeg: A Coarse-to-Fine Framework for High-Precision Lithography Segmentation | Xinyu He, Botong Zhao, Bingbing Li, Shujing Lyu, Jiwei Shen, Yue Lu | 2025-11-15 | 下载 | Accurate segmentation and measurement of lithography scanning electron microscope (SEM) images are crucial for ensuring precise process control, optimizing device performance, and advancing semiconduc... |