Skip to content

2026-02-15

cs.AR - Architecture

标题作者发布日期PDF摘要
ABI: A tightly integrated, unified, sparsity-aware, reconfigurable, compute near-register file/cache GPU architecture with light-weight softmax for deep learning, linear algebra, and Ising computeSiddhartha Raman Sundara Raman, Jaydeep P. Kulkarni2026-02-15下载We present a tightly integrated and unified near-memory GPU architecture that delivers 6 to 16 times speedup and 6 to 13 times energy savings across Convolutional Neural Networks, Graph Convolutional ...
Probabilistic approximate optimization using single-photon avalanche diode arraysZiyad Alswaidan, Abdelrahman S. Abdelrahman, Md Sakibur Sajal, Shuvro Chowdhury, Kai-Chun Lin, Hunter Guthrie, Sanjay Seshan, Shawn Blanton, Flaviano Morone, Marc Dandin, Kerem Y. Camsari, Tathagata Srimani2026-02-15下载Combinatorial optimization problems are central to science and engineering and specialized hardware from quantum annealers to classical Ising machines are being actively developed to address them.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Floe: Federated Specialization for Real-Time LLM-SLM InferenceChunlin Tian, Kahou Tam, Yebo Wu, Shuaihang Zhong, Li Li, Nicholas D. Lane, Chengzhong Xu2026-02-15下载Deploying large language models (LLMs) in real-time systems remains challenging due to their substantial computational demands and privacy concerns.
Parallel Sparse and Data-Sparse Factorization-based Linear SolversXiaoye Sherry Li, Yang Liu2026-02-15下载Efficient solutions of large-scale, ill-conditioned and indefinite algebraic equations are ubiquitously needed in numerous computational fields, including multiphysics simulations, machine learning, a...
ML-ECS: A Collaborative Multimodal Learning Framework for Edge-Cloud SynergiesYuze Liu, Shibo Chu, Tiehua Zhang, Hao Zhou, Zhishu Shen, Jinze Wang, Jianzhong Qi, Feng Xia2026-02-15下载Edge-cloud synergies provide a promising paradigm for privacy-preserving deployment of foundation models, where lightweight on-device models adapt to domain-specific data and cloud-hosted models coord...
FedEMA-Distill: Exponential Moving Average Guided Knowledge Distillation for Robust Federated LearningHamza Reguieg, Mohamed El Kamili, Essaid Sabir2026-02-15下载Federated learning (FL) often degrades when clients hold heterogeneous non-Independent and Identically Distributed (non-IID) data and when some clients behave adversarially, leading to client drift, s...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
MILD: Multi-Intent Learning and Disambiguation for Proactive Failure Prediction in Intent-based NetworkingMd. Kamrul Hossain, Walid Aljoby2026-02-15下载In multi-intent intent-based networks, a single fault can trigger co-drift where multiple intents exhibit symptomatic KPI degradation, creating ambiguity about the true root-cause intent.
Toward Autonomous O-RAN: A Multi-Scale Agentic AI Framework for Real-Time Network Control and ManagementHojjat Navidan, Mohammad Cheraghinia, Jaron Fontaine, Mohamed Seif, Eli De Poorter, H. Vincent Poor, Ingrid Moerman, Adnan Shahid2026-02-15下载Open Radio Access Networks (O-RAN) promise flexible 6G network access through disaggregated, software-driven components and open interfaces, but this programmability also increases operational complex...

cs.PF - Performance

标题作者发布日期PDF摘要
Dual-Signal Adaptive KV-Cache Optimization for Long-Form Video Understanding in Vision-Language ModelsVishnu Sai, Dheeraj Sai, Srinath B, Girish Varma, Priyesh Shukla2026-02-15下载Vision-Language Models (VLMs) face a critical memory bottleneck when processing long-form video content due to the linear growth of the Key-Value (KV) cache with sequence length.

基于 VitePress 构建