Appearance
2025-04-05
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Multi-Phase Coupled CMOS Ring Oscillator based Potts Machine | Yilmaz Ege Gonul, Baris Taskin | 2025-04-05 | 下载 | This paper presents a coupled ring oscillator based Potts ma chine to solve NP-hard combinatorial optimization problems (COPs). Potts model is a generalization of the Ising model, cap turing multiva... |
| Learning Cache Coherence Traffic for NoC Routing Design | Guochu Xiong, Xiangzhong Luo, Weichen Liu | 2025-04-05 | 下载 | The rapid growth of multi-core systems highlights the need for efficient Network-on-Chip (NoC) design to ensure seamless communication. Cache coherence, essential for data consistency, substantially r... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| oneDAL Optimization for ARM Scalable Vector Extension: Maximizing Efficiency for High-Performance Data Science | Chandan Sharma, Rakshith GB, Ajay Kumar Patel, Dhanus M Lal, Darshan Patel, Ragesh Hajela, Masahiro Doteguchi, Priyanka Sharma | 2025-04-05 | 下载 | The evolution of ARM-based architectures, particularly those incorporating Scalable Vector Extension (SVE), has introduced transformative opportunities for high-performance computing (HPC) and machine... |
| SLOs-Serve: Optimized Serving of Multi-SLO LLMs | Siyuan Chen, Zhipeng Jia, Samira Khan, Arvind Krishnamurthy, Phillip B. Gibbons | 2025-04-05 | 下载 | This paper introduces SLOs-Serve, a system designed for serving multi-stage large language model (LLM) requests with application- and stage-specific service level objectives (SLOs). |
| Corrected with the Latest Version: Make Robust Asynchronous Federated Learning Possible | Chaoyi Lu, Yiding Sun, Pengbo Li, Zhichuan Yang | 2025-04-05 | 下载 | As an emerging paradigm of federated learning, asynchronous federated learning offers significant speed advantages over traditional synchronous federated learning. |
| Obfuscated Consensus | James Aspnes, Shlomi Dolev, Amit Hendin | 2025-04-05 | 下载 | The classic Fischer, Lynch, and Paterson impossibility proof demonstrates that any deterministic protocol for consensus in either a message-passing or shared-memory system must violate at least one of... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A Fast Solver-Free Algorithm for Traffic Engineering in Large-Scale Data Center Network | Yingming Mao, Qiaozhu Zhai, Ximeng Liu, Zhen Yao, Xia Zhu, Yuzhou Zhou | 2025-04-05 | 下载 | Rapid growth of data center networks (DCNs) poses significant challenges for large-scale traffic engineering (TE). Existing acceleration strategies, which rely on commercial solvers or deep learning, ... |
| Tiny Neural Networks for Session-Level Traffic Classification | Adel Chehade, Edoardo Ragusa, Paolo Gastaldo, Rodolfo Zunino | 2025-04-05 | 下载 | This paper presents a system for session-level traffic classification on endpoint devices, developed using a Hardware-aware Neural Architecture Search (HW-NAS) framework. |
| Learning Cache Coherence Traffic for NoC Routing Design | Guochu Xiong, Xiangzhong Luo, Weichen Liu | 2025-04-05 | 下载 | The rapid growth of multi-core systems highlights the need for efficient Network-on-Chip (NoC) design to ensure seamless communication. Cache coherence, essential for data consistency, substantially r... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| oneDAL Optimization for ARM Scalable Vector Extension: Maximizing Efficiency for High-Performance Data Science | Chandan Sharma, Rakshith GB, Ajay Kumar Patel, Dhanus M Lal, Darshan Patel, Ragesh Hajela, Masahiro Doteguchi, Priyanka Sharma | 2025-04-05 | 下载 | The evolution of ARM-based architectures, particularly those incorporating Scalable Vector Extension (SVE), has introduced transformative opportunities for high-performance computing (HPC) and machine... |