2025-12-25

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Online Learning Extreme Learning Machine with Low-Complexity Predictive Plasticity Rule and FPGA Implementation	Zhenya Zang, Xingda Li, David Day Uei Li	2025-12-25	下载	We propose a simplified, biologically inspired predictive local learning rule that eliminates the need for global backpropagation in conventional neural networks and membrane integration in event-base...
Analysis of LLM Vulnerability to GPU Soft Errors: An Instruction-Level Fault Injection Study	Duo Chai, Zizhen Liu, Shuhuai Wang, Songwei Pei, Cheng Liu, Huawei Li, Shangguang Wang	2025-12-25	下载	Large language models (LLMs) are highly compute- and memory-intensive, posing significant demands on high-performance GPUs. At the same time, advances in GPU technology driven by shrinking transistor ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Smart IoT-Based Leak Forecasting and Detection for Energy-Efficient Liquid Cooling in AI Data Centers	Krishna Chaitanya Sunkara, Rambabu Konakanchi	2025-12-25	下载	AI data centers which are GPU centric, have adopted liquid cooling to handle extreme heat loads, but coolant leaks result in substantial energy loss through unplanned shutdowns and extended repair per...
Hyperion: Low-Latency Ultra-HD Video Analytics via Collaborative Vision Transformer Inference	Linyi Jiang, Yifei Zhu, Hao Yin, Bo Li	2025-12-25	下载	Recent advancements in array-camera videography enable real-time capturing of ultra-high-definition (Ultra-HD) videos, providing rich visual information in a large field of view.
LEFT-RS: A Lock-Free Fault-Tolerant Resource Sharing Protocol for Multicore Real-Time Systems	Nan Chen, Xiaotian Dai, Tong Cheng, Alan Burns, Iain Bate, Shuai Zhao	2025-12-25	下载	Emerging real-time applications have driven the transition to multicore embedded systems, where tasks must share resources due to functional demands and limited availability.
Embedding Samples Dispatching for Recommendation Model Training in Edge Environments	Guopeng Li, Haisheng Tan, Chi Zhang, Hongqiu Ni, Zilong Wang, Xinyue Zhang, Yang Xu, Han Tian	2025-12-25	下载	Training deep learning recommendation models (DLRMs) on edge workers brings several benefits, particularly in terms of data privacy protection, low latency and personalization.
nncase: An End-to-End Compiler for Efficient LLM Deployment on Heterogeneous Storage Architectures	Hui Guo, Qihang Zheng, Chenghai Huo, Dongliang Guo, Haoqi Yang, Yang Zhang	2025-12-25	下载	The efficient deployment of large language models (LLMs) is hindered by memory architecture heterogeneity, where traditional compilers suffer from fragmented workflows and high adaptation costs.
Valori: A Deterministic Memory Substrate for AI Systems	Varshith Gudur	2025-12-25	下载	Modern AI systems rely on vector embeddings stored and searched using floating-point arithmetic. While effective for approximate similarity search, this design introduces fundamental non-determinism: ...
Efficient MoE Inference with Fine-Grained Scheduling of Disaggregated Expert Parallelism	Xinglin Pan, Shaohuai Shi, Wenxiang Lin, Yuxin Wang, Zhenheng Tang, Wei Wang, Xiaowen Chu	2025-12-25	下载	The mixture-of-experts (MoE) architecture scales model size with sublinear computational increase but suffers from memory-intensive inference due to KV caches and sparse expert activation.
Demystifying ARM SME to Optimize General Matrix Multiplications	Chencheng Deng, Weiling Yang, Jianbin Fang, Dezun Dong	2025-12-25	下载	General Matrix Multiplication (GEMM) is a critical kernel in high-performance computing and deep learning. While modern architectures like ARM's Scalable Matrix Extension (SME) introduce dedicated har...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Smart IoT-Based Leak Forecasting and Detection for Energy-Efficient Liquid Cooling in AI Data Centers	Krishna Chaitanya Sunkara, Rambabu Konakanchi	2025-12-25	下载	AI data centers which are GPU centric, have adopted liquid cooling to handle extreme heat loads, but coolant leaks result in substantial energy loss through unplanned shutdowns and extended repair per...
Multiconnectivity for SAGIN: Current Trends, Challenges, AI-driven Solutions, and Opportunities	Abd Ullah Khan, Adnan Shahid, Haejoon Jung, Hyundong Shin	2025-12-25	下载	Space-air-ground-integrated network (SAGIN)-enabled multiconnectivity (MC) is emerging as a key enabler for next-generation networks, enabling users to simultaneously utilize multiple links across mul...
Physics-informed Diffusion Models for Multi-scale Prediction of Reference Signal Received Power in Wireless Networks	Xiaoqian Qi, Haoye Chai, Yue Wang, Zhaocheng Wang, Yong Li	2025-12-25	下载	The Reference Signal Received Power (RSRP) is a crucial factor that determines communication performance in mobile networks. Accurately predicting the RSRP can help network operators perceive user exp...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
LEFT-RS: A Lock-Free Fault-Tolerant Resource Sharing Protocol for Multicore Real-Time Systems	Nan Chen, Xiaotian Dai, Tong Cheng, Alan Burns, Iain Bate, Shuai Zhao	2025-12-25	下载	Emerging real-time applications have driven the transition to multicore embedded systems, where tasks must share resources due to functional demands and limited availability.