2025-02-18

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
HARP: A Taxonomy for Heterogeneous and Hierarchical Processors for Mixed-reuse Workloads	Raveesh Garg, Michael Pellauer, Tushar Krishna	2025-02-18	下载	Artificial intelligence (AI) application domains consist of a mix of tensor operations with high and low arithmetic intensities (aka reuse). Hierarchical (i.e.
Variable Read Disturbance: An Experimental Analysis of Temporal Variation in DRAM Read Disturbance	Ataberk Olgun, F. Nisa Bostanci, Ismail Emir Yuksel, Oguzhan Canpolat, Haocong Luo, Geraldo F. Oliveira, A. Giray Yaglikci, Minesh Patel, Onur Mutlu	2025-02-18	下载	Modern DRAM chips are subject to read disturbance errors. State-of-the-art read disturbance mitigations rely on accurate and exhaustive characterization of the read disturbance threshold (RDT) (e.g.
Ariadne: A Hotness-Aware and Size-Adaptive Compressed Swap Technique for Fast Application Relaunch and Reduced CPU Usage on Mobile Devices	Yu Liang, Aofeng Shen, Chun Jason Xue, Riwei Pan, Haiyu Mao, Nika Mansouri Ghiasi, Qingcai Jiang, Rakesh Nadig, Lei Li, Rachata Ausavarungnirun, Mohammad Sadrosadati, Onur Mutlu	2025-02-18	下载	Growing application memory demands and concurrent usage are making mobile device memory scarce. When memory pressure is high, current mobile systems use a RAM-based compressed swap scheme (called ZRAM...
Chronus: Understanding and Securing the Cutting-Edge Industry Solutions to DRAM Read Disturbance	Oğuzhan Canpolat, A. Giray Yağlıkçı, Geraldo F. Oliveira, Ataberk Olgun, Nisa Bostancı, İsmail Emir Yüksel, Haocong Luo, Oğuz Ergin, Onur Mutlu	2025-02-18	下载	We 1) present the first rigorous security, performance, energy, and cost analyses of the state-of-the-art on-DRAM-die read disturbance mitigation method, Per Row Activation Counting (PRAC) and 2) prop...
SparAMX: Accelerating Compressed LLMs Token Generation on AMX-powered CPUs	Ahmed F. AbouElhamayed, Jordan Dotzel, Yash Akhauri, Chi-Chih Chang, Sameh Gobriel, J. Pablo Muñoz, Vui Seng Chua, Nilesh Jain, Mohamed S. Abdelfattah	2025-02-18	下载	Large language models have high compute, latency, and memory requirements. While specialized accelerators such as GPUs and TPUs typically run these workloads, CPUs are more widely available and consum...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
HARP: A Taxonomy for Heterogeneous and Hierarchical Processors for Mixed-reuse Workloads	Raveesh Garg, Michael Pellauer, Tushar Krishna	2025-02-18	下载	Artificial intelligence (AI) application domains consist of a mix of tensor operations with high and low arithmetic intensities (aka reuse). Hierarchical (i.e.
The Early Days of the Ethereum Blob Fee Market and Lessons Learnt	Lioba Heimbach, Jason Milionis	2025-02-18	下载	Ethereum has adopted a rollup-centric roadmap to scale by making rollups (layer 2 scaling solutions) the primary method for handling transactions.
Performance Trade-offs of High Order Meshless Approximation on Distributed Memory Systems	Jon Vehovar, Miha Rot, Gregor Kosec	2025-02-18	下载	Meshless methods approximate operators in a specific node as a weighted sum of values in its neighbours. Higher order approximations of derivatives provide more accurate solutions with better converge...
Atomic Smart Contract Interoperability with High Efficiency via Cross-Chain Integrated Execution	Chaoyue Yin, Mingzhe Li, Jin Zhang, You Lin, Qingsong Wei, Siow Mong Rick Goh	2025-02-18	下载	With the development of Ethereum, numerous blockchains compatible with Ethereum's execution environment (i.e., Ethereum Virtual Machine, EVM) have emerged.
SparkAttention: High-Performance Multi-Head Attention for Large Models on Volta GPU Architecture	Youxuan Xu, Tong Wu, Shigang Li, Xueying Wang, Jingjing Wang	2025-02-18	下载	Transformer are widely used in various fields such as natural language processing and computer vision. However, the training time for large Transformer models can be challenging due to the Multi-Head ...
FedHC: A Hierarchical Clustered Federated Learning Framework for Satellite Networks	Zhuocheng Liu, Zhishu Shen, Pan Zhou, Qiushi Zheng, Jiong Jin	2025-02-18	下载	With the proliferation of data-driven services, the volume of data that needs to be processed by satellite networks has significantly increased.
Surrogate Modeling for Scalable Evaluation of Distributed Computing Systems for HEP Applications	Larissa Schmid, Maximilian Horzela, Valerii Zhyla, Manuel Giffels, Günter Quast, Anne Koziolek	2025-02-18	下载	The Worldwide LHC Computing Grid (WLCG) provides the robust computing infrastructure essential for the LHC experiments by integrating global computing resources into a cohesive entity.
Minimalist Leader Election Under Weak Communication	Robin Vacus, Isabella Ziccardi	2025-02-18	下载	We propose a protocol to solve Leader Election within weak communication models such as the beeping model or the stone-age model. Unlike most previous work, our algorithm operates on only six states, ...
Distributed On-Device LLM Inference With Over-the-Air Computation	Kai Zhang, Hengtao He, Shenghui Song, Jun Zhang, Khaled B. Letaief	2025-02-18	下载	Large language models (LLMs) have achieved remarkable success across various artificial intelligence tasks. However, their enormous sizes and computational demands pose significant challenges for the ...
KiSS: A Novel Container Size-Aware Memory Management Policy for Serverless in Edge-Cloud Continuum	Sabyasachi Gupta, Paul Gratz, John Lusher	2025-02-18	下载	Serverless computing has revolutionized cloud architectures by enabling developers to deploy event-driven applications via lightweight, self-contained virtualized containers.
Min-Max Correlation Clustering via Neighborhood Similarity	Nairen Cao, Steven Roche, Hsin-Hao Su	2025-02-18	下载	We present an efficient algorithm for the min-max correlation clustering problem. The input is a complete graph where edges are labeled as either positive $(+)$ or negative $(-)$ , and the objective is...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Impact of Cross Technology Interference on Time Synchronization and Join Time in Low-Power Wireless Networks	Zegeye Mekasha Kidane, Waltenegus Dargie	2025-02-18	下载	Low-power and low-cost wireless sensor networks enable scalable and affordable sensing and can be deployed in different environments to monitor various physical parameters.
5G Integrated Communications, Navigation, and Surveillance: A Vision and Future Research Perspectives	Muhammad Asad Ullah, Vadim Kramar, Hamada Alshaer, Charles Cleary, Davi Brilhante, Vasilii Semkin, Ville-Aleksi Kaariaho, Giovanni Geraci	2025-02-18	下载	Communication, Navigation, and Surveillance (CNS) is the backbone of the Air Traffic Management (ATM) and Unmanned Aircraft System (UAS) Traffic Management (UTM) systems, ensuring safe and efficient o...
Universal Embedding Function for Traffic Classification via QUIC Domain Recognition Pretraining: A Transfer Learning Success	Jan Luxemburk, Karel Hynek, Richard Plný, Tomáš Čejka	2025-02-18	下载	Encrypted traffic classification (TC) methods must adapt to new protocols and extensions as well as to advancements in other machine learning fields.
A Survey on DRL based UAV Communications and Networking: DRL Fundamentals, Applications and Implementations	Wei Zhao, Shaoxin Cui, Wen Qiu, Zhiqiang He, Zhi Liu, Xiao Zheng, Bomin Mao, Nei Kato	2025-02-18	下载	Unmanned aerial vehicles (UAVs) are playing an increasingly pivotal role in modern communication networks,offering flexibility and enhanced coverage for a variety of applica-tions.
NTP-INT: Network Traffic Prediction-Driven In-band Network Telemetry for High-load Switches	Penghui Zhang, Hua Zhang, Yuqi Dai, Cheng Zeng, Jingyu Wang, Jianxin Liao	2025-02-18	下载	In-band network telemetry (INT) is essential to network management due to its real-time visibility. However, because of the rapid increase in network devices and services, it has become crucial to hav...
Reinforcement Learning for Dynamic Resource Allocation in Optical Networks: Hype or Hope?	Michael Doherty, Robin Matzner, Rasoul Sadeghi, Polina Bayvel, Alejandra Beghelli	2025-02-18	下载	The application of reinforcement learning (RL) to dynamic resource allocation in optical networks has been the focus of intense research activity in recent years, with almost 100 peer-reviewed papers.
Seamless Graph Task Scheduling over Dynamic Vehicular Clouds: A Hybrid Methodology for Integrating Pilot and Instantaneous Decisions	Bingshuo Guo, Minghui Liwang, Xiaoyu Xia, Li Li, Zhenzhen Jiao, Seyyedali Hosseinalipour, Xianbin Wang	2025-02-18	下载	Vehicular clouds (VCs) play a crucial role in the Internet-of-Vehicles (IoV) ecosystem by securing essential computing resources for a wide range of tasks.
KiSS: A Novel Container Size-Aware Memory Management Policy for Serverless in Edge-Cloud Continuum	Sabyasachi Gupta, Paul Gratz, John Lusher	2025-02-18	下载	Serverless computing has revolutionized cloud architectures by enabling developers to deploy event-driven applications via lightweight, self-contained virtualized containers.

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Ariadne: A Hotness-Aware and Size-Adaptive Compressed Swap Technique for Fast Application Relaunch and Reduced CPU Usage on Mobile Devices	Yu Liang, Aofeng Shen, Chun Jason Xue, Riwei Pan, Haiyu Mao, Nika Mansouri Ghiasi, Qingcai Jiang, Rakesh Nadig, Lei Li, Rachata Ausavarungnirun, Mohammad Sadrosadati, Onur Mutlu	2025-02-18	下载	Growing application memory demands and concurrent usage are making mobile device memory scarce. When memory pressure is high, current mobile systems use a RAM-based compressed swap scheme (called ZRAM...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Surrogate Modeling for Scalable Evaluation of Distributed Computing Systems for HEP Applications	Larissa Schmid, Maximilian Horzela, Valerii Zhyla, Manuel Giffels, Günter Quast, Anne Koziolek	2025-02-18	下载	The Worldwide LHC Computing Grid (WLCG) provides the robust computing infrastructure essential for the LHC experiments by integrating global computing resources into a cohesive entity.
Efficient Hybrid Amplitude-Phase Quantization for Multi-Antenna Relay System	Changdae Kim, Xianglan Jin	2025-02-18	下载	This letter explores relay quantization in multi-antenna quantize-forward (QF) relay systems. Existing methods, such as uniform phase quantization (U-PQ) and uniform amplitude-phase quantization (U-AP...
SparAMX: Accelerating Compressed LLMs Token Generation on AMX-powered CPUs	Ahmed F. AbouElhamayed, Jordan Dotzel, Yash Akhauri, Chi-Chih Chang, Sameh Gobriel, J. Pablo Muñoz, Vui Seng Chua, Nilesh Jain, Mohamed S. Abdelfattah	2025-02-18	下载	Large language models have high compute, latency, and memory requirements. While specialized accelerators such as GPUs and TPUs typically run these workloads, CPUs are more widely available and consum...