Appearance
2026-01-25
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Memory-Efficient FPGA Implementation of Stochastic Simulated Annealing | Duckgyu Shin, Naoya Onizawa, Warren J. Gross, Takahiro Hanyu | 2026-01-25 | 下载 | Simulated annealing (SA) is a well-known algorithm for solving combinatorial optimization problems. However, the computation time of SA increases rapidly, as the size of the problem grows. |
| Late Breaking Results: Boosting Efficient Dual-Issue Execution on Lightweight RISC-V Cores | Luca Colagrande, Luca Benini | 2026-01-25 | 下载 | Large-scale ML accelerators rely on large numbers of PEs, imposing strict bounds on the area and energy budget of each PE. Prior work demonstrates that limited dual-issue capabilities can be efficient... |
| @NTT: Algorithm-Targeted NTT hardware acceleration via Design-Time Constant Optimization | Mohammed Nabeel, Mahmoud Hafez, Michail Maniatakos | 2026-01-25 | 下载 | The Number Theoretic Transform (NTT) is a critical computational bottleneck in many lattice-based postquantum cryptographic (PQC) algorithms. By leveraging the Fast Fourier Transform (FFT) algorithm, ... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| MultiChain Blockchain Data Provenance for Deterministic Stream Processing with Kafka Streams: A Weather Data Case Study | Niaz Mohammad Ramaki, Florian Schintke | 2026-01-25 | 下载 | Auditability and reproducibility still are critical challenges for real-time data streams pipelines. Streaming engines are highly dependent on runtime scheduling, window triggers, arrival orders, and ... |
| Types for Grassroots Logic Programs | Ehud Shapiro | 2026-01-25 | 下载 | Grassroots Logic Programs (GLP) is a concurrent logic programming language in which logic variables are partitioned into paired readers and writers. |
| A Universal Load Balancing Principle and Its Application to Large Language Model Serving | Zixi Chen, Tianci Bu, Chendong Song, Xin Lu, Yinyu Ye, Zijie Zhou | 2026-01-25 | 下载 | Over 40% of computational power in Large Language Model (LLM) serving systems can be systematically wasted - not from hardware limits, but from load imbalance in barrier-synchronized parallel processi... |
| On the Extension of Private Distributed Matrix Multiplication Schemes to the Grid Partition | Christoph Hofmeister, Razane Tajeddine, Antonia Wachter-Zeh, Rawad Bitar | 2026-01-25 | 下载 | We consider polynomial codes for private distributed matrix multiplication (PDMM/SDMM). Existing codes for PDMM are either specialized for the outer product partitioning (OPP), or inner product partit... |
| CondenseGraph: Communication-Efficient Distributed GNN Training via On-the-Fly Graph Condensation | Zizhao Zhang, Yihan Xue, Haotian Zhu, Sijia Li, Zhijun Wang, Yujie Xiao | 2026-01-25 | 下载 | Distributed Graph Neural Network (GNN) training suffers from substantial communication overhead due to the inherent neighborhood dependency in graph-structured data. |
| LLM-42: Enabling Determinism in LLM Inference with Verified Speculation | Raja Gond, Aditya K Kamath, Ramachandran Ramjee, Ashish Panwar | 2026-01-25 | 下载 | In LLM inference, the same prompt may yield different outputs across different runs. At the system level, this non-determinism arises from floating-point non-associativity combined with dynamic batchi... |
| An MLIR Lowering Pipeline for Stencils at Wafer-Scale | Nicolai Stawinoga, David Katz, Anton Lydike, Justs Zarins, Nick Brown, George Bisbas, Tobias Grosser | 2026-01-25 | 下载 | The Cerebras Wafer-Scale Engine (WSE) delivers performance at an unprecedented scale of over 900,000 compute units, all connected via a single-wafer on-chip interconnect. |
| Faramesh: A Protocol-Agnostic Execution Control Plane for Autonomous Agent Systems | Amjad Fatmi | 2026-01-25 | 下载 | Autonomous agent systems increasingly trigger real-world side effects: deploying infrastructure, modifying databases, moving money, and executing workflows. |
| Multi-core & GPU-based Balanced Butterfly Counting in Signed Bipartite Graphs | Mekala Kiran, Apurba Das, Suman Banerjee, Tathagata Ray | 2026-01-25 | 下载 | Balanced butterfly counting, corresponding to counting balanced (2, 2)-bicliques, is a fundamental primitive in the analysis of signed bipartite graphs and provides a basis for studying higher-order s... |
| Kareus: Joint Reduction of Dynamic and Static Energy in Large Model Training | Ruofan Wu, Jae-Won Chung, Mosharaf Chowdhury | 2026-01-25 | 下载 | The computing demand of AI is growing at an unprecedented rate, but energy supply is not keeping pace. As a result, energy has become an expensive, contended resource that requires explicit management... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Multi-Agent Collaborative Intrusion Detection for Low-Altitude Economy IoT: An LLM-Enhanced Agentic AI Framework | Hongjuan Li, Hui Kang, Jiahui Li, Geng Sun, Ruichen Zhang, Jiacheng Wang, Dusit Niyato, Wei Ni, Abbas Jamalipour | 2026-01-25 | 下载 | The rapid expansion of low-altitude economy Internet of Things (LAE-IoT) networks has created unprecedented security challenges due to dynamic three-dimensional mobility patterns, distributed autonomo... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Credit Fairness: Online Fairness In Shared Resource Pools | Seyed Majid Zahedi, Rupert Freeman | 2026-01-25 | 下载 | We consider a setting in which a group of agents share resources that must be allocated among them in each discrete time period. Agents have time-varying demands and derive constant marginal utility f... |