Skip to content

2024-12-24

cs.AR - Architecture

标题作者发布日期PDF摘要
ReducedLUT: Table Decomposition with "Don't Care" ConditionsOliver Cassidy, Marta Andronic, Samuel Coward, George A. Constantinides2024-12-24下载Lookup tables (LUTs) are frequently used to efficiently store arrays of precomputed values for complex mathematical computations. When used in the context of neural networks, these functions exhibit a...
GCN-ABFT: Low-Cost Online Error Checking for Graph Convolutional NetworksChristodoulos Peltekis, Giorgos Dimitrakopoulos2024-12-24下载Graph convolutional networks (GCNs) are popular for building machine-learning application for graph-structured data. This widespread adoption led to the development of specialized GCN hardware acceler...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Efficient Circuit Cutting and Scheduling in a Multi-Node Quantum System with Dynamic EPR PairsZefan Du, Wenrui Zhang, Wenqi Wei, Juntao Chen, Tao Han, Zhiding Liang, Ying Mao2024-12-24下载Despite advancements, current quantum hardware faces significant challenges, including limited qubit counts and high susceptibility to noise, which hinder the execution of large, complex algorithms.
Circuit Folding: Modular and Qubit-Level Workload Management in Quantum-Classical SystemsShuwen Kan, Yanni Li, Hao Wang, Sara Mouradian, Ying Mao2024-12-24下载Circuit cutting is a promising technique that leverages both quantum and classical computational resources, enabling the practical execution of large quantum circuits on noisy intermediate-scale quant...
TimelyLLM: Segmented LLM Serving System for Time-sensitive Robotic ApplicationsNeiwen Ling, Guojun Chen, Lin Zhong2024-12-24下载Large Language Models (LLMs) such as GPT-4 and Llama3 can already comprehend complex commands and process diverse tasks. This advancement facilitates their application in controlling drones and robots...
Double Spending Analysis of Nakamoto Consensus for Time-Varying Mining Rates with Ruin TheoryMustafa Doger, Sennur Ulukus, Nail Akar2024-12-24下载Theoretical guarantees for double spending probabilities for the Nakamoto consensus under the kk-deep confirmation rule have been extensively studied for zero/bounded network delays and fixed mining ...
Pilot-Quantum: A Quantum-HPC Middleware for Resource, Workload and Task ManagementPradeep Mantha, Florian J. Kiwit, Nishant Saurabh, Shantenu Jha, Andre Luckow2024-12-24下载As quantum hardware advances, integrating quantum processing units (QPUs) into HPC environments and managing diverse infrastructure and software stacks becomes increasingly essential.
Hardware-aware Circuit Cutting and Distributed Qubit Mapping for Connected Quantum SystemsZefan Du, Yanni Li, Zijian Mo, Wenqi Wei, Juntao Chen, Rajkumar Buyya, Ying Mao2024-12-24下载Quantum computing offers unparalleled computational capabilities but faces significant challenges, including limited qubit counts, diverse hardware topologies, and dynamic noise/error rates, which hin...
Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge NetworksChangfu Xu, Jianxiong Guo, Wanyu Lin, Haodong Zou, Wentao Fan, Tian Wang, Xiaowen Chu, Jiannong Cao2024-12-24下载Artificial Intelligence Generated Content (AIGC) has gained significant popularity for creating diverse content. Current AIGC models primarily focus on content quality within a centralized framework, ...
KunServe: Parameter-centric Memory Management for Efficient Memory Overloading Handling in LLM ServingRongxin Cheng, Yuxin Lai, Xingda Wei, Rong Chen, Haibo Chen2024-12-24下载Serving LLMs with a cluster of GPUs is common nowadays, where the serving system must meet strict latency SLOs required by applications. However, the stateful nature of LLM serving requires maintainin...
XSema: A Novel Framework for Semantic Extraction of Cross-chain TransactionsZiye Zheng, Jiajing Wu, Dan Lin, Quanzhong Li, Na Ruan2024-12-24下载As the number of blockchain platforms continues to grow, the independence of these networks poses challenges for transferring assets and information across chains.
Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernelsMingcong Song, Xinru Tang, Fengfan Hou, Jing Li, Wei Wei, Yipeng Ma, Runqiu Xiao, Hongjie Si, Dingcheng Jiang, Shouyi Yin, Yang Hu, Guoping Long2024-12-24下载Meeting growing demands for low latency and cost efficiency in production-grade large language model (LLM) serving systems requires integrating advanced optimization techniques.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
TPAoI: Ensuring Fresh Service Status at the Network Edge in Compute-First NetworkingHaosheng He, Jianpeng Qi, Chao Liu, Junyu Dong, Yanwei Yu2024-12-24下载In compute-first networking, maintaining fresh and accurate status information at the network edge is crucial for effective access to remote services.
A Large-Scale IPv6-Based Measurement of the Starlink NetworkBingsen Wang, Xiaohui Zhang, Shuai Wang, Li Chen, Jinwei Zhao, Dan Li, Yong Jiang2024-12-24下载Low Earth Orbit (LEO) satellite networks have attracted considerable attention for their ability to deliver global, low-latency broadband Internet services.
Adapting Large Language Models for Improving TCP Fairness over WiFiShyam Kumar Shrestha, Shiva Raj Pokhrel, Jonathan Kua2024-12-24下载The new transmission control protocol (TCP) relies on Deep Learning (DL) for prediction and optimization, but requires significant manual effort to design deep neural networks (DNNs) and struggles wit...
Energy Efficient Computation Offloading and Virtual Connection Control in Uplink Small Cell NetworksDavoud Yousefi, Hassan Yari, Farzad Osouli, Mohammad Ebrahimi, Somayeh Esmalifalak, Morteza Johari, Abbas Azarnezhad, Fatemeh Sadeghi, Rogayeh Mirzapour2024-12-24下载Nowadays, the use of soft computational techniques in power systems under the umbrella of machine learning is increasing with good reception. In this paper, we first present a deep learning approach t...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Interference-free Operating System: A 6 Years' Experience in Mitigating Cross-Core Interference in LinuxZhaomeng Deng, Ziqi Zhang, Ding Li, Yao Guo, Yunfeng Ye, Yuxin Ren, Ning Jia, Xinwei Hu2024-12-24下载Real-time operating systems employ spatial and temporal isolation to guarantee predictability and schedulability of real-time systems on multi-core processors.

基于 VitePress 构建