Skip to content

2025-12-25

cs.AR - Architecture

标题作者发布日期PDF摘要
Online Learning Extreme Learning Machine with Low-Complexity Predictive Plasticity Rule and FPGA ImplementationZhenya Zang, Xingda Li, David Day Uei Li2025-12-25下载We propose a simplified, biologically inspired predictive local learning rule that eliminates the need for global backpropagation in conventional neural networks and membrane integration in event-base...
Analysis of LLM Vulnerability to GPU Soft Errors: An Instruction-Level Fault Injection StudyDuo Chai, Zizhen Liu, Shuhuai Wang, Songwei Pei, Cheng Liu, Huawei Li, Shangguang Wang2025-12-25下载Large language models (LLMs) are highly compute- and memory-intensive, posing significant demands on high-performance GPUs. At the same time, advances in GPU technology driven by shrinking transistor ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Smart IoT-Based Leak Forecasting and Detection for Energy-Efficient Liquid Cooling in AI Data CentersKrishna Chaitanya Sunkara, Rambabu Konakanchi2025-12-25下载AI data centers which are GPU centric, have adopted liquid cooling to handle extreme heat loads, but coolant leaks result in substantial energy loss through unplanned shutdowns and extended repair per...
Hyperion: Low-Latency Ultra-HD Video Analytics via Collaborative Vision Transformer InferenceLinyi Jiang, Yifei Zhu, Hao Yin, Bo Li2025-12-25下载Recent advancements in array-camera videography enable real-time capturing of ultra-high-definition (Ultra-HD) videos, providing rich visual information in a large field of view.
LEFT-RS: A Lock-Free Fault-Tolerant Resource Sharing Protocol for Multicore Real-Time SystemsNan Chen, Xiaotian Dai, Tong Cheng, Alan Burns, Iain Bate, Shuai Zhao2025-12-25下载Emerging real-time applications have driven the transition to multicore embedded systems, where tasks must share resources due to functional demands and limited availability.
Embedding Samples Dispatching for Recommendation Model Training in Edge EnvironmentsGuopeng Li, Haisheng Tan, Chi Zhang, Hongqiu Ni, Zilong Wang, Xinyue Zhang, Yang Xu, Han Tian2025-12-25下载Training deep learning recommendation models (DLRMs) on edge workers brings several benefits, particularly in terms of data privacy protection, low latency and personalization.
nncase: An End-to-End Compiler for Efficient LLM Deployment on Heterogeneous Storage ArchitecturesHui Guo, Qihang Zheng, Chenghai Huo, Dongliang Guo, Haoqi Yang, Yang Zhang2025-12-25下载The efficient deployment of large language models (LLMs) is hindered by memory architecture heterogeneity, where traditional compilers suffer from fragmented workflows and high adaptation costs.
Valori: A Deterministic Memory Substrate for AI SystemsVarshith Gudur2025-12-25下载Modern AI systems rely on vector embeddings stored and searched using floating-point arithmetic. While effective for approximate similarity search, this design introduces fundamental non-determinism: ...
Efficient MoE Inference with Fine-Grained Scheduling of Disaggregated Expert ParallelismXinglin Pan, Shaohuai Shi, Wenxiang Lin, Yuxin Wang, Zhenheng Tang, Wei Wang, Xiaowen Chu2025-12-25下载The mixture-of-experts (MoE) architecture scales model size with sublinear computational increase but suffers from memory-intensive inference due to KV caches and sparse expert activation.
Demystifying ARM SME to Optimize General Matrix MultiplicationsChencheng Deng, Weiling Yang, Jianbin Fang, Dezun Dong2025-12-25下载General Matrix Multiplication (GEMM) is a critical kernel in high-performance computing and deep learning. While modern architectures like ARM's Scalable Matrix Extension (SME) introduce dedicated har...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Smart IoT-Based Leak Forecasting and Detection for Energy-Efficient Liquid Cooling in AI Data CentersKrishna Chaitanya Sunkara, Rambabu Konakanchi2025-12-25下载AI data centers which are GPU centric, have adopted liquid cooling to handle extreme heat loads, but coolant leaks result in substantial energy loss through unplanned shutdowns and extended repair per...
Multiconnectivity for SAGIN: Current Trends, Challenges, AI-driven Solutions, and OpportunitiesAbd Ullah Khan, Adnan Shahid, Haejoon Jung, Hyundong Shin2025-12-25下载Space-air-ground-integrated network (SAGIN)-enabled multiconnectivity (MC) is emerging as a key enabler for next-generation networks, enabling users to simultaneously utilize multiple links across mul...
Physics-informed Diffusion Models for Multi-scale Prediction of Reference Signal Received Power in Wireless NetworksXiaoqian Qi, Haoye Chai, Yue Wang, Zhaocheng Wang, Yong Li2025-12-25下载The Reference Signal Received Power (RSRP) is a crucial factor that determines communication performance in mobile networks. Accurately predicting the RSRP can help network operators perceive user exp...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
LEFT-RS: A Lock-Free Fault-Tolerant Resource Sharing Protocol for Multicore Real-Time SystemsNan Chen, Xiaotian Dai, Tong Cheng, Alan Burns, Iain Bate, Shuai Zhao2025-12-25下载Emerging real-time applications have driven the transition to multicore embedded systems, where tasks must share resources due to functional demands and limited availability.

基于 VitePress 构建