Skip to content

2026-02-21

cs.AR - Architecture

标题作者发布日期PDF摘要
HillInfer: Efficient Long-Context LLM Inference on the Edge with Hierarchical KV Eviction using SmartSSDHe Sun, Shinan Liu, Li Li, Mingjun Xiao2026-02-21下载Deploying Large Language Models (LLMs) on memory-constrained AI Personal Computers (AIPCs) enables low-latency, privacy-preserving inference, but long-context generation is fundamentally bottlenecked ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
WANSpec: Leveraging Global Compute Capacity for LLM InferenceNoah Martin, Fahad Dogar2026-02-21下载Data centers capable of running large language models (LLMs) are spread across the globe. Some have high end GPUs for running the most advanced models (100B+ parameters), and others are only suitable ...
Carbon-aware decentralized dynamic task offloading in MIMO-MEC networks via multi-agent reinforcement learningMubshra Zulfiqar, Muhammad Ayzed Mirza, Basit Qureshi2026-02-21下载Massive internet of things microservices require integrating renewable energy harvesting into mobile edge computing (MEC) for sustainable eScience infrastructures.
DualScale: Energy-Efficient Disaggregated LLM Serving via Phase-Aware Placement and DVFSOmar Basit, Yunzhao Liu, Z. Jonny Kong, Y. Charlie Hu2026-02-21下载Prefill/decode disaggregation is increasingly adopted in LLM serving to improve the latency-throughput tradeoff and meet strict TTFT and TPOT SLOs.
What Distributed Computing Got Wrong: The Category Mistake That Turned Design Choices into Laws of NaturePaul Borrill2026-02-21下载The foundational impossibility results of distributed computing -- the Fischer-Lynch-Paterson theorem, the Two Generals Problem, the CAP theorem -- are widely understood as discoveries about the physi...
When Coordination Is Avoidable: A Monotonicity Analysis of Organizational TasksHarang Ju2026-02-21下载Organizations devote substantial resources to coordination, yet which tasks actually require it for correctness remains unclear. The problem is acute in multi-agent AI systems, where coordination over...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
EdgeSketch: Efficient Analysis of Massive Graph StreamsJakub Lemiesz, Dingqi Yang, Philippe Cudré-Mauroux2026-02-21下载We introduce EdgeSketch, a compact graph representation for efficient analysis of massive graph streams. EdgeSketch provides unbiased estimators for key graph properties with controllable variance and...
Towards Green Connectivity: An AI-Driven Mesh Architecture for Sustainable and Scalable Wireless NetworksMuhammad Ahmed Mohsin, Muhammad Jazib, Muhammad Saad, Ayesha Mohsin2026-02-21下载Traditional macro-cell and micro-cell infrastructures suffer from severe inefficiencies, with current macro-cell networks operating at less than 5 percent energy efficiency, leading to nearly 95 perce...

基于 VitePress 构建