Skip to content

2025-02-18

cs.AR - Architecture

标题作者发布日期PDF摘要
HARP: A Taxonomy for Heterogeneous and Hierarchical Processors for Mixed-reuse WorkloadsRaveesh Garg, Michael Pellauer, Tushar Krishna2025-02-18下载Artificial intelligence (AI) application domains consist of a mix of tensor operations with high and low arithmetic intensities (aka reuse). Hierarchical (i.e.
Variable Read Disturbance: An Experimental Analysis of Temporal Variation in DRAM Read DisturbanceAtaberk Olgun, F. Nisa Bostanci, Ismail Emir Yuksel, Oguzhan Canpolat, Haocong Luo, Geraldo F. Oliveira, A. Giray Yaglikci, Minesh Patel, Onur Mutlu2025-02-18下载Modern DRAM chips are subject to read disturbance errors. State-of-the-art read disturbance mitigations rely on accurate and exhaustive characterization of the read disturbance threshold (RDT) (e.g.
Ariadne: A Hotness-Aware and Size-Adaptive Compressed Swap Technique for Fast Application Relaunch and Reduced CPU Usage on Mobile DevicesYu Liang, Aofeng Shen, Chun Jason Xue, Riwei Pan, Haiyu Mao, Nika Mansouri Ghiasi, Qingcai Jiang, Rakesh Nadig, Lei Li, Rachata Ausavarungnirun, Mohammad Sadrosadati, Onur Mutlu2025-02-18下载Growing application memory demands and concurrent usage are making mobile device memory scarce. When memory pressure is high, current mobile systems use a RAM-based compressed swap scheme (called ZRAM...
Chronus: Understanding and Securing the Cutting-Edge Industry Solutions to DRAM Read DisturbanceOğuzhan Canpolat, A. Giray Yağlıkçı, Geraldo F. Oliveira, Ataberk Olgun, Nisa Bostancı, İsmail Emir Yüksel, Haocong Luo, Oğuz Ergin, Onur Mutlu2025-02-18下载We 1) present the first rigorous security, performance, energy, and cost analyses of the state-of-the-art on-DRAM-die read disturbance mitigation method, Per Row Activation Counting (PRAC) and 2) prop...
SparAMX: Accelerating Compressed LLMs Token Generation on AMX-powered CPUsAhmed F. AbouElhamayed, Jordan Dotzel, Yash Akhauri, Chi-Chih Chang, Sameh Gobriel, J. Pablo Muñoz, Vui Seng Chua, Nilesh Jain, Mohamed S. Abdelfattah2025-02-18下载Large language models have high compute, latency, and memory requirements. While specialized accelerators such as GPUs and TPUs typically run these workloads, CPUs are more widely available and consum...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
HARP: A Taxonomy for Heterogeneous and Hierarchical Processors for Mixed-reuse WorkloadsRaveesh Garg, Michael Pellauer, Tushar Krishna2025-02-18下载Artificial intelligence (AI) application domains consist of a mix of tensor operations with high and low arithmetic intensities (aka reuse). Hierarchical (i.e.
The Early Days of the Ethereum Blob Fee Market and Lessons LearntLioba Heimbach, Jason Milionis2025-02-18下载Ethereum has adopted a rollup-centric roadmap to scale by making rollups (layer 2 scaling solutions) the primary method for handling transactions.
Performance Trade-offs of High Order Meshless Approximation on Distributed Memory SystemsJon Vehovar, Miha Rot, Gregor Kosec2025-02-18下载Meshless methods approximate operators in a specific node as a weighted sum of values in its neighbours. Higher order approximations of derivatives provide more accurate solutions with better converge...
Atomic Smart Contract Interoperability with High Efficiency via Cross-Chain Integrated ExecutionChaoyue Yin, Mingzhe Li, Jin Zhang, You Lin, Qingsong Wei, Siow Mong Rick Goh2025-02-18下载With the development of Ethereum, numerous blockchains compatible with Ethereum's execution environment (i.e., Ethereum Virtual Machine, EVM) have emerged.
SparkAttention: High-Performance Multi-Head Attention for Large Models on Volta GPU ArchitectureYouxuan Xu, Tong Wu, Shigang Li, Xueying Wang, Jingjing Wang2025-02-18下载Transformer are widely used in various fields such as natural language processing and computer vision. However, the training time for large Transformer models can be challenging due to the Multi-Head ...
FedHC: A Hierarchical Clustered Federated Learning Framework for Satellite NetworksZhuocheng Liu, Zhishu Shen, Pan Zhou, Qiushi Zheng, Jiong Jin2025-02-18下载With the proliferation of data-driven services, the volume of data that needs to be processed by satellite networks has significantly increased.
Surrogate Modeling for Scalable Evaluation of Distributed Computing Systems for HEP ApplicationsLarissa Schmid, Maximilian Horzela, Valerii Zhyla, Manuel Giffels, Günter Quast, Anne Koziolek2025-02-18下载The Worldwide LHC Computing Grid (WLCG) provides the robust computing infrastructure essential for the LHC experiments by integrating global computing resources into a cohesive entity.
Minimalist Leader Election Under Weak CommunicationRobin Vacus, Isabella Ziccardi2025-02-18下载We propose a protocol to solve Leader Election within weak communication models such as the beeping model or the stone-age model. Unlike most previous work, our algorithm operates on only six states, ...
Distributed On-Device LLM Inference With Over-the-Air ComputationKai Zhang, Hengtao He, Shenghui Song, Jun Zhang, Khaled B. Letaief2025-02-18下载Large language models (LLMs) have achieved remarkable success across various artificial intelligence tasks. However, their enormous sizes and computational demands pose significant challenges for the ...
KiSS: A Novel Container Size-Aware Memory Management Policy for Serverless in Edge-Cloud ContinuumSabyasachi Gupta, Paul Gratz, John Lusher2025-02-18下载Serverless computing has revolutionized cloud architectures by enabling developers to deploy event-driven applications via lightweight, self-contained virtualized containers.
Min-Max Correlation Clustering via Neighborhood SimilarityNairen Cao, Steven Roche, Hsin-Hao Su2025-02-18下载We present an efficient algorithm for the min-max correlation clustering problem. The input is a complete graph where edges are labeled as either positive (+)(+) or negative ()(-), and the objective is...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Impact of Cross Technology Interference on Time Synchronization and Join Time in Low-Power Wireless NetworksZegeye Mekasha Kidane, Waltenegus Dargie2025-02-18下载Low-power and low-cost wireless sensor networks enable scalable and affordable sensing and can be deployed in different environments to monitor various physical parameters.
5G Integrated Communications, Navigation, and Surveillance: A Vision and Future Research PerspectivesMuhammad Asad Ullah, Vadim Kramar, Hamada Alshaer, Charles Cleary, Davi Brilhante, Vasilii Semkin, Ville-Aleksi Kaariaho, Giovanni Geraci2025-02-18下载Communication, Navigation, and Surveillance (CNS) is the backbone of the Air Traffic Management (ATM) and Unmanned Aircraft System (UAS) Traffic Management (UTM) systems, ensuring safe and efficient o...
Universal Embedding Function for Traffic Classification via QUIC Domain Recognition Pretraining: A Transfer Learning SuccessJan Luxemburk, Karel Hynek, Richard Plný, Tomáš Čejka2025-02-18下载Encrypted traffic classification (TC) methods must adapt to new protocols and extensions as well as to advancements in other machine learning fields.
A Survey on DRL based UAV Communications and Networking: DRL Fundamentals, Applications and ImplementationsWei Zhao, Shaoxin Cui, Wen Qiu, Zhiqiang He, Zhi Liu, Xiao Zheng, Bomin Mao, Nei Kato2025-02-18下载Unmanned aerial vehicles (UAVs) are playing an increasingly pivotal role in modern communication networks,offering flexibility and enhanced coverage for a variety of applica-tions.
NTP-INT: Network Traffic Prediction-Driven In-band Network Telemetry for High-load SwitchesPenghui Zhang, Hua Zhang, Yuqi Dai, Cheng Zeng, Jingyu Wang, Jianxin Liao2025-02-18下载In-band network telemetry (INT) is essential to network management due to its real-time visibility. However, because of the rapid increase in network devices and services, it has become crucial to hav...
Reinforcement Learning for Dynamic Resource Allocation in Optical Networks: Hype or Hope?Michael Doherty, Robin Matzner, Rasoul Sadeghi, Polina Bayvel, Alejandra Beghelli2025-02-18下载The application of reinforcement learning (RL) to dynamic resource allocation in optical networks has been the focus of intense research activity in recent years, with almost 100 peer-reviewed papers.
Seamless Graph Task Scheduling over Dynamic Vehicular Clouds: A Hybrid Methodology for Integrating Pilot and Instantaneous DecisionsBingshuo Guo, Minghui Liwang, Xiaoyu Xia, Li Li, Zhenzhen Jiao, Seyyedali Hosseinalipour, Xianbin Wang2025-02-18下载Vehicular clouds (VCs) play a crucial role in the Internet-of-Vehicles (IoV) ecosystem by securing essential computing resources for a wide range of tasks.
KiSS: A Novel Container Size-Aware Memory Management Policy for Serverless in Edge-Cloud ContinuumSabyasachi Gupta, Paul Gratz, John Lusher2025-02-18下载Serverless computing has revolutionized cloud architectures by enabling developers to deploy event-driven applications via lightweight, self-contained virtualized containers.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Ariadne: A Hotness-Aware and Size-Adaptive Compressed Swap Technique for Fast Application Relaunch and Reduced CPU Usage on Mobile DevicesYu Liang, Aofeng Shen, Chun Jason Xue, Riwei Pan, Haiyu Mao, Nika Mansouri Ghiasi, Qingcai Jiang, Rakesh Nadig, Lei Li, Rachata Ausavarungnirun, Mohammad Sadrosadati, Onur Mutlu2025-02-18下载Growing application memory demands and concurrent usage are making mobile device memory scarce. When memory pressure is high, current mobile systems use a RAM-based compressed swap scheme (called ZRAM...

cs.PF - Performance

标题作者发布日期PDF摘要
Surrogate Modeling for Scalable Evaluation of Distributed Computing Systems for HEP ApplicationsLarissa Schmid, Maximilian Horzela, Valerii Zhyla, Manuel Giffels, Günter Quast, Anne Koziolek2025-02-18下载The Worldwide LHC Computing Grid (WLCG) provides the robust computing infrastructure essential for the LHC experiments by integrating global computing resources into a cohesive entity.
Efficient Hybrid Amplitude-Phase Quantization for Multi-Antenna Relay SystemChangdae Kim, Xianglan Jin2025-02-18下载This letter explores relay quantization in multi-antenna quantize-forward (QF) relay systems. Existing methods, such as uniform phase quantization (U-PQ) and uniform amplitude-phase quantization (U-AP...
SparAMX: Accelerating Compressed LLMs Token Generation on AMX-powered CPUsAhmed F. AbouElhamayed, Jordan Dotzel, Yash Akhauri, Chi-Chih Chang, Sameh Gobriel, J. Pablo Muñoz, Vui Seng Chua, Nilesh Jain, Mohamed S. Abdelfattah2025-02-18下载Large language models have high compute, latency, and memory requirements. While specialized accelerators such as GPUs and TPUs typically run these workloads, CPUs are more widely available and consum...

基于 VitePress 构建