Appearance
2026-02-21
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| HillInfer: Efficient Long-Context LLM Inference on the Edge with Hierarchical KV Eviction using SmartSSD | He Sun, Shinan Liu, Li Li, Mingjun Xiao | 2026-02-21 | 下载 | Deploying Large Language Models (LLMs) on memory-constrained AI Personal Computers (AIPCs) enables low-latency, privacy-preserving inference, but long-context generation is fundamentally bottlenecked ... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| WANSpec: Leveraging Global Compute Capacity for LLM Inference | Noah Martin, Fahad Dogar | 2026-02-21 | 下载 | Data centers capable of running large language models (LLMs) are spread across the globe. Some have high end GPUs for running the most advanced models (100B+ parameters), and others are only suitable ... |
| Carbon-aware decentralized dynamic task offloading in MIMO-MEC networks via multi-agent reinforcement learning | Mubshra Zulfiqar, Muhammad Ayzed Mirza, Basit Qureshi | 2026-02-21 | 下载 | Massive internet of things microservices require integrating renewable energy harvesting into mobile edge computing (MEC) for sustainable eScience infrastructures. |
| DualScale: Energy-Efficient Disaggregated LLM Serving via Phase-Aware Placement and DVFS | Omar Basit, Yunzhao Liu, Z. Jonny Kong, Y. Charlie Hu | 2026-02-21 | 下载 | Prefill/decode disaggregation is increasingly adopted in LLM serving to improve the latency-throughput tradeoff and meet strict TTFT and TPOT SLOs. |
| What Distributed Computing Got Wrong: The Category Mistake That Turned Design Choices into Laws of Nature | Paul Borrill | 2026-02-21 | 下载 | The foundational impossibility results of distributed computing -- the Fischer-Lynch-Paterson theorem, the Two Generals Problem, the CAP theorem -- are widely understood as discoveries about the physi... |
| When Coordination Is Avoidable: A Monotonicity Analysis of Organizational Tasks | Harang Ju | 2026-02-21 | 下载 | Organizations devote substantial resources to coordination, yet which tasks actually require it for correctness remains unclear. The problem is acute in multi-agent AI systems, where coordination over... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| EdgeSketch: Efficient Analysis of Massive Graph Streams | Jakub Lemiesz, Dingqi Yang, Philippe Cudré-Mauroux | 2026-02-21 | 下载 | We introduce EdgeSketch, a compact graph representation for efficient analysis of massive graph streams. EdgeSketch provides unbiased estimators for key graph properties with controllable variance and... |
| Towards Green Connectivity: An AI-Driven Mesh Architecture for Sustainable and Scalable Wireless Networks | Muhammad Ahmed Mohsin, Muhammad Jazib, Muhammad Saad, Ayesha Mohsin | 2026-02-21 | 下载 | Traditional macro-cell and micro-cell infrastructures suffer from severe inefficiencies, with current macro-cell networks operating at less than 5 percent energy efficiency, leading to nearly 95 perce... |