Skip to content

2025-04-05

cs.AR - Architecture

标题作者发布日期PDF摘要
Multi-Phase Coupled CMOS Ring Oscillator based Potts MachineYilmaz Ege Gonul, Baris Taskin2025-04-05下载This paper presents a coupled ring oscillator based Potts ma chine to solve NP-hard combinatorial optimization problems (COPs). Potts model is a generalization of the Ising model, cap turing multiva...
Learning Cache Coherence Traffic for NoC Routing DesignGuochu Xiong, Xiangzhong Luo, Weichen Liu2025-04-05下载The rapid growth of multi-core systems highlights the need for efficient Network-on-Chip (NoC) design to ensure seamless communication. Cache coherence, essential for data consistency, substantially r...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
oneDAL Optimization for ARM Scalable Vector Extension: Maximizing Efficiency for High-Performance Data ScienceChandan Sharma, Rakshith GB, Ajay Kumar Patel, Dhanus M Lal, Darshan Patel, Ragesh Hajela, Masahiro Doteguchi, Priyanka Sharma2025-04-05下载The evolution of ARM-based architectures, particularly those incorporating Scalable Vector Extension (SVE), has introduced transformative opportunities for high-performance computing (HPC) and machine...
SLOs-Serve: Optimized Serving of Multi-SLO LLMsSiyuan Chen, Zhipeng Jia, Samira Khan, Arvind Krishnamurthy, Phillip B. Gibbons2025-04-05下载This paper introduces SLOs-Serve, a system designed for serving multi-stage large language model (LLM) requests with application- and stage-specific service level objectives (SLOs).
Corrected with the Latest Version: Make Robust Asynchronous Federated Learning PossibleChaoyi Lu, Yiding Sun, Pengbo Li, Zhichuan Yang2025-04-05下载As an emerging paradigm of federated learning, asynchronous federated learning offers significant speed advantages over traditional synchronous federated learning.
Obfuscated ConsensusJames Aspnes, Shlomi Dolev, Amit Hendin2025-04-05下载The classic Fischer, Lynch, and Paterson impossibility proof demonstrates that any deterministic protocol for consensus in either a message-passing or shared-memory system must violate at least one of...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
A Fast Solver-Free Algorithm for Traffic Engineering in Large-Scale Data Center NetworkYingming Mao, Qiaozhu Zhai, Ximeng Liu, Zhen Yao, Xia Zhu, Yuzhou Zhou2025-04-05下载Rapid growth of data center networks (DCNs) poses significant challenges for large-scale traffic engineering (TE). Existing acceleration strategies, which rely on commercial solvers or deep learning, ...
Tiny Neural Networks for Session-Level Traffic ClassificationAdel Chehade, Edoardo Ragusa, Paolo Gastaldo, Rodolfo Zunino2025-04-05下载This paper presents a system for session-level traffic classification on endpoint devices, developed using a Hardware-aware Neural Architecture Search (HW-NAS) framework.
Learning Cache Coherence Traffic for NoC Routing DesignGuochu Xiong, Xiangzhong Luo, Weichen Liu2025-04-05下载The rapid growth of multi-core systems highlights the need for efficient Network-on-Chip (NoC) design to ensure seamless communication. Cache coherence, essential for data consistency, substantially r...

cs.PF - Performance

标题作者发布日期PDF摘要
oneDAL Optimization for ARM Scalable Vector Extension: Maximizing Efficiency for High-Performance Data ScienceChandan Sharma, Rakshith GB, Ajay Kumar Patel, Dhanus M Lal, Darshan Patel, Ragesh Hajela, Masahiro Doteguchi, Priyanka Sharma2025-04-05下载The evolution of ARM-based architectures, particularly those incorporating Scalable Vector Extension (SVE), has introduced transformative opportunities for high-performance computing (HPC) and machine...

基于 VitePress 构建