Appearance
2025-12-28
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| TYTAN: Taylor-series based Non-Linear Activation Engine for Deep Learning Accelerators | Soham Pramanik, Vimal William, Arnab Raha, Debayan Das, Amitava Mukherjee, Janet L. Paluh | 2025-12-28 | 下载 | The rapid advancement in AI architectures and the proliferation of AI-enabled systems have intensified the need for domain-specific architectures that enhance both the acceleration and energy efficien... |
| Enabling Long FFT Convolutions on Memory-Constrained FPGAs via Chunking | Peter Wang, Neelesh Gupta, Viktor Prasanna | 2025-12-28 | 下载 | The need for long-context reasoning has led to alternative neural network architectures besides Transformers and self-attention, a popular model being Hyena, which employs causal 1D-convolutions imple... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Osmotic Learning: A Self-Supervised Paradigm for Decentralized Contextual Data Representation | Mario Colosi, Reza Farahani, Maria Fazio, Radu Prodan, Massimo Villari | 2025-12-28 | 下载 | Data within a specific context gains deeper significance beyond its isolated interpretation. In distributed systems, interdependent data sources reveal hidden relationships and latent structures, repr... |
| Federated Learning With L0 Constraint Via Probabilistic Gates For Sparsity | Krishna Harsha Kovelakuntla Huthasana, Alireza Olama, Andreas Lundell | 2025-12-28 | 下载 | Federated Learning (FL) is a distributed machine learning setting that requires multiple clients to collaborate on training a model while maintaining data privacy. |
| Viability and Performance of a Private LLM Server for SMBs: A Benchmark Analysis of Qwen3-30B on Consumer-Grade Hardware | Alex Khalil, Guillaume Heilles, Maria Parraga, Simon Heilles | 2025-12-28 | 下载 | The proliferation of Large Language Models (LLMs) has been accompanied by a reliance on cloud-based, proprietary systems, raising significant concerns regarding data privacy, operational sovereignty, ... |
| A Domain Decomposition-based Solver for Acoustic Wave propagation in Two-Dimensional Random Media | Sudhi Sharma Padillath Vasudevan | 2025-12-28 | 下载 | An acoustic wave propagation problem with a log normal random field approximation for wave speed is solved using a sampling-free intrusive stochastic Galerkin approach. |
| Revisiting finite Abelian hidden subgroup problem and its distributed exact quantum algorithm | Ziyuan Dong, Xiang Fan, Tengxun Zhong, Daowen Qiu | 2025-12-28 | 下载 | We revisit the finite Abelian hidden subgroup problem (AHSP) from a mathematical perspective and make the following contributions. First, by employing amplitude amplification, we present an exact quan... |
| Argus: Token Aware Distributed LLM Inference Optimization | Panlong Wu, Yifei Zhong, Danyang Chen, Ting Wang, Fangxin Wang | 2025-12-28 | 下载 | Large Language Models (LLMs) are rapidly being integrated into real-world applications, yet their autoregressive architectures introduce significant inference time variability, especially when deploye... |
| Two-Robot Computational Landscape: A Complete Characterization of Model Power in Minimal Mobile Robot Systems | Naoki Kitamura, Yuichi Sudo, Koichi Wada | 2025-12-28 | 下载 | The computational power of autonomous mobile robots under the Look-Compute-Move (LCM) model has been widely studied through an extensive hierarchy of robot models defined by the presence of memory, co... |
| OptiNIC: A Resilient and Tail-Optimal RDMA NIC for Distributed ML Workloads | Ertza Warraich, Ali Imran, Annus Zulfiqar, Shay Vargaftik, Sonia Fahmy, Muhammad Shahbaz | 2025-12-28 | 下载 | As distributed machine learning (ML) workloads scale to thousands of GPUs connected by high-speed interconnects, tail latency in collective communication has become a major bottleneck. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Drift-Based Dataset Stability Benchmark | Dominik Soukup, Richard Plný, Daniel Vašata, Tomáš Čejka | 2025-12-28 | 下载 | Machine learning (ML) represents an efficient and popular approach for network traffic classification. However, network traffic classification is a challenging domain, and trained models may degrade s... |
| Multiverse: A Simulator for Evaluating Entanglement Routing in Quantum Networks | Amar Abane, Junxiao Shi, Van Sy Mai, Abderrahim Amlou, Abdella Battou | 2025-12-28 | 下载 | We present MQNS, a discrete-event simulator for rapid evaluation of entanglement routing under dynamic, heterogeneous configurations. MQNS supports runtime-configurable purification, swapping, memory ... |
| OptiNIC: A Resilient and Tail-Optimal RDMA NIC for Distributed ML Workloads | Ertza Warraich, Ali Imran, Annus Zulfiqar, Shay Vargaftik, Sonia Fahmy, Muhammad Shahbaz | 2025-12-28 | 下载 | As distributed machine learning (ML) workloads scale to thousands of GPUs connected by high-speed interconnects, tail latency in collective communication has become a major bottleneck. |