2026-01-25

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Memory-Efficient FPGA Implementation of Stochastic Simulated Annealing	Duckgyu Shin, Naoya Onizawa, Warren J. Gross, Takahiro Hanyu	2026-01-25	下载	Simulated annealing (SA) is a well-known algorithm for solving combinatorial optimization problems. However, the computation time of SA increases rapidly, as the size of the problem grows.
Late Breaking Results: Boosting Efficient Dual-Issue Execution on Lightweight RISC-V Cores	Luca Colagrande, Luca Benini	2026-01-25	下载	Large-scale ML accelerators rely on large numbers of PEs, imposing strict bounds on the area and energy budget of each PE. Prior work demonstrates that limited dual-issue capabilities can be efficient...
@NTT: Algorithm-Targeted NTT hardware acceleration via Design-Time Constant Optimization	Mohammed Nabeel, Mahmoud Hafez, Michail Maniatakos	2026-01-25	下载	The Number Theoretic Transform (NTT) is a critical computational bottleneck in many lattice-based postquantum cryptographic (PQC) algorithms. By leveraging the Fast Fourier Transform (FFT) algorithm, ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
MultiChain Blockchain Data Provenance for Deterministic Stream Processing with Kafka Streams: A Weather Data Case Study	Niaz Mohammad Ramaki, Florian Schintke	2026-01-25	下载	Auditability and reproducibility still are critical challenges for real-time data streams pipelines. Streaming engines are highly dependent on runtime scheduling, window triggers, arrival orders, and ...
Types for Grassroots Logic Programs	Ehud Shapiro	2026-01-25	下载	Grassroots Logic Programs (GLP) is a concurrent logic programming language in which logic variables are partitioned into paired readers and writers.
A Universal Load Balancing Principle and Its Application to Large Language Model Serving	Zixi Chen, Tianci Bu, Chendong Song, Xin Lu, Yinyu Ye, Zijie Zhou	2026-01-25	下载	Over 40% of computational power in Large Language Model (LLM) serving systems can be systematically wasted - not from hardware limits, but from load imbalance in barrier-synchronized parallel processi...
On the Extension of Private Distributed Matrix Multiplication Schemes to the Grid Partition	Christoph Hofmeister, Razane Tajeddine, Antonia Wachter-Zeh, Rawad Bitar	2026-01-25	下载	We consider polynomial codes for private distributed matrix multiplication (PDMM/SDMM). Existing codes for PDMM are either specialized for the outer product partitioning (OPP), or inner product partit...
CondenseGraph: Communication-Efficient Distributed GNN Training via On-the-Fly Graph Condensation	Zizhao Zhang, Yihan Xue, Haotian Zhu, Sijia Li, Zhijun Wang, Yujie Xiao	2026-01-25	下载	Distributed Graph Neural Network (GNN) training suffers from substantial communication overhead due to the inherent neighborhood dependency in graph-structured data.
LLM-42: Enabling Determinism in LLM Inference with Verified Speculation	Raja Gond, Aditya K Kamath, Ramachandran Ramjee, Ashish Panwar	2026-01-25	下载	In LLM inference, the same prompt may yield different outputs across different runs. At the system level, this non-determinism arises from floating-point non-associativity combined with dynamic batchi...
An MLIR Lowering Pipeline for Stencils at Wafer-Scale	Nicolai Stawinoga, David Katz, Anton Lydike, Justs Zarins, Nick Brown, George Bisbas, Tobias Grosser	2026-01-25	下载	The Cerebras Wafer-Scale Engine (WSE) delivers performance at an unprecedented scale of over 900,000 compute units, all connected via a single-wafer on-chip interconnect.
Faramesh: A Protocol-Agnostic Execution Control Plane for Autonomous Agent Systems	Amjad Fatmi	2026-01-25	下载	Autonomous agent systems increasingly trigger real-world side effects: deploying infrastructure, modifying databases, moving money, and executing workflows.
Multi-core & GPU-based Balanced Butterfly Counting in Signed Bipartite Graphs	Mekala Kiran, Apurba Das, Suman Banerjee, Tathagata Ray	2026-01-25	下载	Balanced butterfly counting, corresponding to counting balanced (2, 2)-bicliques, is a fundamental primitive in the analysis of signed bipartite graphs and provides a basis for studying higher-order s...
Kareus: Joint Reduction of Dynamic and Static Energy in Large Model Training	Ruofan Wu, Jae-Won Chung, Mosharaf Chowdhury	2026-01-25	下载	The computing demand of AI is growing at an unprecedented rate, but energy supply is not keeping pace. As a result, energy has become an expensive, contended resource that requires explicit management...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Multi-Agent Collaborative Intrusion Detection for Low-Altitude Economy IoT: An LLM-Enhanced Agentic AI Framework	Hongjuan Li, Hui Kang, Jiahui Li, Geng Sun, Ruichen Zhang, Jiacheng Wang, Dusit Niyato, Wei Ni, Abbas Jamalipour	2026-01-25	下载	The rapid expansion of low-altitude economy Internet of Things (LAE-IoT) networks has created unprecedented security challenges due to dynamic three-dimensional mobility patterns, distributed autonomo...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Credit Fairness: Online Fairness In Shared Resource Pools	Seyed Majid Zahedi, Rupert Freeman	2026-01-25	下载	We consider a setting in which a group of agents share resources that must be allocated among them in each discrete time period. Agents have time-varying demands and derive constant marginal utility f...