2026-02-15

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
ABI: A tightly integrated, unified, sparsity-aware, reconfigurable, compute near-register file/cache GPU architecture with light-weight softmax for deep learning, linear algebra, and Ising compute	Siddhartha Raman Sundara Raman, Jaydeep P. Kulkarni	2026-02-15	下载	We present a tightly integrated and unified near-memory GPU architecture that delivers 6 to 16 times speedup and 6 to 13 times energy savings across Convolutional Neural Networks, Graph Convolutional ...
Probabilistic approximate optimization using single-photon avalanche diode arrays	Ziyad Alswaidan, Abdelrahman S. Abdelrahman, Md Sakibur Sajal, Shuvro Chowdhury, Kai-Chun Lin, Hunter Guthrie, Sanjay Seshan, Shawn Blanton, Flaviano Morone, Marc Dandin, Kerem Y. Camsari, Tathagata Srimani	2026-02-15	下载	Combinatorial optimization problems are central to science and engineering and specialized hardware from quantum annealers to classical Ising machines are being actively developed to address them.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Floe: Federated Specialization for Real-Time LLM-SLM Inference	Chunlin Tian, Kahou Tam, Yebo Wu, Shuaihang Zhong, Li Li, Nicholas D. Lane, Chengzhong Xu	2026-02-15	下载	Deploying large language models (LLMs) in real-time systems remains challenging due to their substantial computational demands and privacy concerns.
Parallel Sparse and Data-Sparse Factorization-based Linear Solvers	Xiaoye Sherry Li, Yang Liu	2026-02-15	下载	Efficient solutions of large-scale, ill-conditioned and indefinite algebraic equations are ubiquitously needed in numerous computational fields, including multiphysics simulations, machine learning, a...
ML-ECS: A Collaborative Multimodal Learning Framework for Edge-Cloud Synergies	Yuze Liu, Shibo Chu, Tiehua Zhang, Hao Zhou, Zhishu Shen, Jinze Wang, Jianzhong Qi, Feng Xia	2026-02-15	下载	Edge-cloud synergies provide a promising paradigm for privacy-preserving deployment of foundation models, where lightweight on-device models adapt to domain-specific data and cloud-hosted models coord...
FedEMA-Distill: Exponential Moving Average Guided Knowledge Distillation for Robust Federated Learning	Hamza Reguieg, Mohamed El Kamili, Essaid Sabir	2026-02-15	下载	Federated learning (FL) often degrades when clients hold heterogeneous non-Independent and Identically Distributed (non-IID) data and when some clients behave adversarially, leading to client drift, s...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
MILD: Multi-Intent Learning and Disambiguation for Proactive Failure Prediction in Intent-based Networking	Md. Kamrul Hossain, Walid Aljoby	2026-02-15	下载	In multi-intent intent-based networks, a single fault can trigger co-drift where multiple intents exhibit symptomatic KPI degradation, creating ambiguity about the true root-cause intent.
Toward Autonomous O-RAN: A Multi-Scale Agentic AI Framework for Real-Time Network Control and Management	Hojjat Navidan, Mohammad Cheraghinia, Jaron Fontaine, Mohamed Seif, Eli De Poorter, H. Vincent Poor, Ingrid Moerman, Adnan Shahid	2026-02-15	下载	Open Radio Access Networks (O-RAN) promise flexible 6G network access through disaggregated, software-driven components and open interfaces, but this programmability also increases operational complex...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Dual-Signal Adaptive KV-Cache Optimization for Long-Form Video Understanding in Vision-Language Models	Vishnu Sai, Dheeraj Sai, Srinath B, Girish Varma, Priyesh Shukla	2026-02-15	下载	Vision-Language Models (VLMs) face a critical memory bottleneck when processing long-form video content due to the linear growth of the Key-Value (KV) cache with sequence length.