Appearance
2025-12-25
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Online Learning Extreme Learning Machine with Low-Complexity Predictive Plasticity Rule and FPGA Implementation | Zhenya Zang, Xingda Li, David Day Uei Li | 2025-12-25 | 下载 | We propose a simplified, biologically inspired predictive local learning rule that eliminates the need for global backpropagation in conventional neural networks and membrane integration in event-base... |
| Analysis of LLM Vulnerability to GPU Soft Errors: An Instruction-Level Fault Injection Study | Duo Chai, Zizhen Liu, Shuhuai Wang, Songwei Pei, Cheng Liu, Huawei Li, Shangguang Wang | 2025-12-25 | 下载 | Large language models (LLMs) are highly compute- and memory-intensive, posing significant demands on high-performance GPUs. At the same time, advances in GPU technology driven by shrinking transistor ... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Smart IoT-Based Leak Forecasting and Detection for Energy-Efficient Liquid Cooling in AI Data Centers | Krishna Chaitanya Sunkara, Rambabu Konakanchi | 2025-12-25 | 下载 | AI data centers which are GPU centric, have adopted liquid cooling to handle extreme heat loads, but coolant leaks result in substantial energy loss through unplanned shutdowns and extended repair per... |
| Hyperion: Low-Latency Ultra-HD Video Analytics via Collaborative Vision Transformer Inference | Linyi Jiang, Yifei Zhu, Hao Yin, Bo Li | 2025-12-25 | 下载 | Recent advancements in array-camera videography enable real-time capturing of ultra-high-definition (Ultra-HD) videos, providing rich visual information in a large field of view. |
| LEFT-RS: A Lock-Free Fault-Tolerant Resource Sharing Protocol for Multicore Real-Time Systems | Nan Chen, Xiaotian Dai, Tong Cheng, Alan Burns, Iain Bate, Shuai Zhao | 2025-12-25 | 下载 | Emerging real-time applications have driven the transition to multicore embedded systems, where tasks must share resources due to functional demands and limited availability. |
| Embedding Samples Dispatching for Recommendation Model Training in Edge Environments | Guopeng Li, Haisheng Tan, Chi Zhang, Hongqiu Ni, Zilong Wang, Xinyue Zhang, Yang Xu, Han Tian | 2025-12-25 | 下载 | Training deep learning recommendation models (DLRMs) on edge workers brings several benefits, particularly in terms of data privacy protection, low latency and personalization. |
| nncase: An End-to-End Compiler for Efficient LLM Deployment on Heterogeneous Storage Architectures | Hui Guo, Qihang Zheng, Chenghai Huo, Dongliang Guo, Haoqi Yang, Yang Zhang | 2025-12-25 | 下载 | The efficient deployment of large language models (LLMs) is hindered by memory architecture heterogeneity, where traditional compilers suffer from fragmented workflows and high adaptation costs. |
| Valori: A Deterministic Memory Substrate for AI Systems | Varshith Gudur | 2025-12-25 | 下载 | Modern AI systems rely on vector embeddings stored and searched using floating-point arithmetic. While effective for approximate similarity search, this design introduces fundamental non-determinism: ... |
| Efficient MoE Inference with Fine-Grained Scheduling of Disaggregated Expert Parallelism | Xinglin Pan, Shaohuai Shi, Wenxiang Lin, Yuxin Wang, Zhenheng Tang, Wei Wang, Xiaowen Chu | 2025-12-25 | 下载 | The mixture-of-experts (MoE) architecture scales model size with sublinear computational increase but suffers from memory-intensive inference due to KV caches and sparse expert activation. |
| Demystifying ARM SME to Optimize General Matrix Multiplications | Chencheng Deng, Weiling Yang, Jianbin Fang, Dezun Dong | 2025-12-25 | 下载 | General Matrix Multiplication (GEMM) is a critical kernel in high-performance computing and deep learning. While modern architectures like ARM's Scalable Matrix Extension (SME) introduce dedicated har... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Smart IoT-Based Leak Forecasting and Detection for Energy-Efficient Liquid Cooling in AI Data Centers | Krishna Chaitanya Sunkara, Rambabu Konakanchi | 2025-12-25 | 下载 | AI data centers which are GPU centric, have adopted liquid cooling to handle extreme heat loads, but coolant leaks result in substantial energy loss through unplanned shutdowns and extended repair per... |
| Multiconnectivity for SAGIN: Current Trends, Challenges, AI-driven Solutions, and Opportunities | Abd Ullah Khan, Adnan Shahid, Haejoon Jung, Hyundong Shin | 2025-12-25 | 下载 | Space-air-ground-integrated network (SAGIN)-enabled multiconnectivity (MC) is emerging as a key enabler for next-generation networks, enabling users to simultaneously utilize multiple links across mul... |
| Physics-informed Diffusion Models for Multi-scale Prediction of Reference Signal Received Power in Wireless Networks | Xiaoqian Qi, Haoye Chai, Yue Wang, Zhaocheng Wang, Yong Li | 2025-12-25 | 下载 | The Reference Signal Received Power (RSRP) is a crucial factor that determines communication performance in mobile networks. Accurately predicting the RSRP can help network operators perceive user exp... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| LEFT-RS: A Lock-Free Fault-Tolerant Resource Sharing Protocol for Multicore Real-Time Systems | Nan Chen, Xiaotian Dai, Tong Cheng, Alan Burns, Iain Bate, Shuai Zhao | 2025-12-25 | 下载 | Emerging real-time applications have driven the transition to multicore embedded systems, where tasks must share resources due to functional demands and limited availability. |