Skip to content

2025-12-28

cs.AR - Architecture

标题作者发布日期PDF摘要
TYTAN: Taylor-series based Non-Linear Activation Engine for Deep Learning AcceleratorsSoham Pramanik, Vimal William, Arnab Raha, Debayan Das, Amitava Mukherjee, Janet L. Paluh2025-12-28下载The rapid advancement in AI architectures and the proliferation of AI-enabled systems have intensified the need for domain-specific architectures that enhance both the acceleration and energy efficien...
Enabling Long FFT Convolutions on Memory-Constrained FPGAs via ChunkingPeter Wang, Neelesh Gupta, Viktor Prasanna2025-12-28下载The need for long-context reasoning has led to alternative neural network architectures besides Transformers and self-attention, a popular model being Hyena, which employs causal 1D-convolutions imple...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Osmotic Learning: A Self-Supervised Paradigm for Decentralized Contextual Data RepresentationMario Colosi, Reza Farahani, Maria Fazio, Radu Prodan, Massimo Villari2025-12-28下载Data within a specific context gains deeper significance beyond its isolated interpretation. In distributed systems, interdependent data sources reveal hidden relationships and latent structures, repr...
Federated Learning With L0 Constraint Via Probabilistic Gates For SparsityKrishna Harsha Kovelakuntla Huthasana, Alireza Olama, Andreas Lundell2025-12-28下载Federated Learning (FL) is a distributed machine learning setting that requires multiple clients to collaborate on training a model while maintaining data privacy.
Viability and Performance of a Private LLM Server for SMBs: A Benchmark Analysis of Qwen3-30B on Consumer-Grade HardwareAlex Khalil, Guillaume Heilles, Maria Parraga, Simon Heilles2025-12-28下载The proliferation of Large Language Models (LLMs) has been accompanied by a reliance on cloud-based, proprietary systems, raising significant concerns regarding data privacy, operational sovereignty, ...
A Domain Decomposition-based Solver for Acoustic Wave propagation in Two-Dimensional Random MediaSudhi Sharma Padillath Vasudevan2025-12-28下载An acoustic wave propagation problem with a log normal random field approximation for wave speed is solved using a sampling-free intrusive stochastic Galerkin approach.
Revisiting finite Abelian hidden subgroup problem and its distributed exact quantum algorithmZiyuan Dong, Xiang Fan, Tengxun Zhong, Daowen Qiu2025-12-28下载We revisit the finite Abelian hidden subgroup problem (AHSP) from a mathematical perspective and make the following contributions. First, by employing amplitude amplification, we present an exact quan...
Argus: Token Aware Distributed LLM Inference OptimizationPanlong Wu, Yifei Zhong, Danyang Chen, Ting Wang, Fangxin Wang2025-12-28下载Large Language Models (LLMs) are rapidly being integrated into real-world applications, yet their autoregressive architectures introduce significant inference time variability, especially when deploye...
Two-Robot Computational Landscape: A Complete Characterization of Model Power in Minimal Mobile Robot SystemsNaoki Kitamura, Yuichi Sudo, Koichi Wada2025-12-28下载The computational power of autonomous mobile robots under the Look-Compute-Move (LCM) model has been widely studied through an extensive hierarchy of robot models defined by the presence of memory, co...
OptiNIC: A Resilient and Tail-Optimal RDMA NIC for Distributed ML WorkloadsErtza Warraich, Ali Imran, Annus Zulfiqar, Shay Vargaftik, Sonia Fahmy, Muhammad Shahbaz2025-12-28下载As distributed machine learning (ML) workloads scale to thousands of GPUs connected by high-speed interconnects, tail latency in collective communication has become a major bottleneck.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Drift-Based Dataset Stability BenchmarkDominik Soukup, Richard Plný, Daniel Vašata, Tomáš Čejka2025-12-28下载Machine learning (ML) represents an efficient and popular approach for network traffic classification. However, network traffic classification is a challenging domain, and trained models may degrade s...
Multiverse: A Simulator for Evaluating Entanglement Routing in Quantum NetworksAmar Abane, Junxiao Shi, Van Sy Mai, Abderrahim Amlou, Abdella Battou2025-12-28下载We present MQNS, a discrete-event simulator for rapid evaluation of entanglement routing under dynamic, heterogeneous configurations. MQNS supports runtime-configurable purification, swapping, memory ...
OptiNIC: A Resilient and Tail-Optimal RDMA NIC for Distributed ML WorkloadsErtza Warraich, Ali Imran, Annus Zulfiqar, Shay Vargaftik, Sonia Fahmy, Muhammad Shahbaz2025-12-28下载As distributed machine learning (ML) workloads scale to thousands of GPUs connected by high-speed interconnects, tail latency in collective communication has become a major bottleneck.

基于 VitePress 构建