2024-12-24

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
ReducedLUT: Table Decomposition with "Don't Care" Conditions	Oliver Cassidy, Marta Andronic, Samuel Coward, George A. Constantinides	2024-12-24	下载	Lookup tables (LUTs) are frequently used to efficiently store arrays of precomputed values for complex mathematical computations. When used in the context of neural networks, these functions exhibit a...
GCN-ABFT: Low-Cost Online Error Checking for Graph Convolutional Networks	Christodoulos Peltekis, Giorgos Dimitrakopoulos	2024-12-24	下载	Graph convolutional networks (GCNs) are popular for building machine-learning application for graph-structured data. This widespread adoption led to the development of specialized GCN hardware acceler...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Efficient Circuit Cutting and Scheduling in a Multi-Node Quantum System with Dynamic EPR Pairs	Zefan Du, Wenrui Zhang, Wenqi Wei, Juntao Chen, Tao Han, Zhiding Liang, Ying Mao	2024-12-24	下载	Despite advancements, current quantum hardware faces significant challenges, including limited qubit counts and high susceptibility to noise, which hinder the execution of large, complex algorithms.
Circuit Folding: Modular and Qubit-Level Workload Management in Quantum-Classical Systems	Shuwen Kan, Yanni Li, Hao Wang, Sara Mouradian, Ying Mao	2024-12-24	下载	Circuit cutting is a promising technique that leverages both quantum and classical computational resources, enabling the practical execution of large quantum circuits on noisy intermediate-scale quant...
TimelyLLM: Segmented LLM Serving System for Time-sensitive Robotic Applications	Neiwen Ling, Guojun Chen, Lin Zhong	2024-12-24	下载	Large Language Models (LLMs) such as GPT-4 and Llama3 can already comprehend complex commands and process diverse tasks. This advancement facilitates their application in controlling drones and robots...
Double Spending Analysis of Nakamoto Consensus for Time-Varying Mining Rates with Ruin Theory	Mustafa Doger, Sennur Ulukus, Nail Akar	2024-12-24	下载	Theoretical guarantees for double spending probabilities for the Nakamoto consensus under the $k$ -deep confirmation rule have been extensively studied for zero/bounded network delays and fixed mining ...
Pilot-Quantum: A Quantum-HPC Middleware for Resource, Workload and Task Management	Pradeep Mantha, Florian J. Kiwit, Nishant Saurabh, Shantenu Jha, Andre Luckow	2024-12-24	下载	As quantum hardware advances, integrating quantum processing units (QPUs) into HPC environments and managing diverse infrastructure and software stacks becomes increasingly essential.
Hardware-aware Circuit Cutting and Distributed Qubit Mapping for Connected Quantum Systems	Zefan Du, Yanni Li, Zijian Mo, Wenqi Wei, Juntao Chen, Rajkumar Buyya, Ying Mao	2024-12-24	下载	Quantum computing offers unparalleled computational capabilities but faces significant challenges, including limited qubit counts, diverse hardware topologies, and dynamic noise/error rates, which hin...
Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge Networks	Changfu Xu, Jianxiong Guo, Wanyu Lin, Haodong Zou, Wentao Fan, Tian Wang, Xiaowen Chu, Jiannong Cao	2024-12-24	下载	Artificial Intelligence Generated Content (AIGC) has gained significant popularity for creating diverse content. Current AIGC models primarily focus on content quality within a centralized framework, ...
KunServe: Parameter-centric Memory Management for Efficient Memory Overloading Handling in LLM Serving	Rongxin Cheng, Yuxin Lai, Xingda Wei, Rong Chen, Haibo Chen	2024-12-24	下载	Serving LLMs with a cluster of GPUs is common nowadays, where the serving system must meet strict latency SLOs required by applications. However, the stateful nature of LLM serving requires maintainin...
XSema: A Novel Framework for Semantic Extraction of Cross-chain Transactions	Ziye Zheng, Jiajing Wu, Dan Lin, Quanzhong Li, Na Ruan	2024-12-24	下载	As the number of blockchain platforms continues to grow, the independence of these networks poses challenges for transferring assets and information across chains.
Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels	Mingcong Song, Xinru Tang, Fengfan Hou, Jing Li, Wei Wei, Yipeng Ma, Runqiu Xiao, Hongjie Si, Dingcheng Jiang, Shouyi Yin, Yang Hu, Guoping Long	2024-12-24	下载	Meeting growing demands for low latency and cost efficiency in production-grade large language model (LLM) serving systems requires integrating advanced optimization techniques.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
TPAoI: Ensuring Fresh Service Status at the Network Edge in Compute-First Networking	Haosheng He, Jianpeng Qi, Chao Liu, Junyu Dong, Yanwei Yu	2024-12-24	下载	In compute-first networking, maintaining fresh and accurate status information at the network edge is crucial for effective access to remote services.
A Large-Scale IPv6-Based Measurement of the Starlink Network	Bingsen Wang, Xiaohui Zhang, Shuai Wang, Li Chen, Jinwei Zhao, Dan Li, Yong Jiang	2024-12-24	下载	Low Earth Orbit (LEO) satellite networks have attracted considerable attention for their ability to deliver global, low-latency broadband Internet services.
Adapting Large Language Models for Improving TCP Fairness over WiFi	Shyam Kumar Shrestha, Shiva Raj Pokhrel, Jonathan Kua	2024-12-24	下载	The new transmission control protocol (TCP) relies on Deep Learning (DL) for prediction and optimization, but requires significant manual effort to design deep neural networks (DNNs) and struggles wit...
Energy Efficient Computation Offloading and Virtual Connection Control in Uplink Small Cell Networks	Davoud Yousefi, Hassan Yari, Farzad Osouli, Mohammad Ebrahimi, Somayeh Esmalifalak, Morteza Johari, Abbas Azarnezhad, Fatemeh Sadeghi, Rogayeh Mirzapour	2024-12-24	下载	Nowadays, the use of soft computational techniques in power systems under the umbrella of machine learning is increasing with good reception. In this paper, we first present a deep learning approach t...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Interference-free Operating System: A 6 Years' Experience in Mitigating Cross-Core Interference in Linux	Zhaomeng Deng, Ziqi Zhang, Ding Li, Yao Guo, Yunfeng Ye, Yuxin Ren, Ning Jia, Xinwei Hu	2024-12-24	下载	Real-time operating systems employ spatial and temporal isolation to guarantee predictability and schedulability of real-time systems on multi-core processors.