Appearance
2024-12-24
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| ReducedLUT: Table Decomposition with "Don't Care" Conditions | Oliver Cassidy, Marta Andronic, Samuel Coward, George A. Constantinides | 2024-12-24 | 下载 | Lookup tables (LUTs) are frequently used to efficiently store arrays of precomputed values for complex mathematical computations. When used in the context of neural networks, these functions exhibit a... |
| GCN-ABFT: Low-Cost Online Error Checking for Graph Convolutional Networks | Christodoulos Peltekis, Giorgos Dimitrakopoulos | 2024-12-24 | 下载 | Graph convolutional networks (GCNs) are popular for building machine-learning application for graph-structured data. This widespread adoption led to the development of specialized GCN hardware acceler... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Efficient Circuit Cutting and Scheduling in a Multi-Node Quantum System with Dynamic EPR Pairs | Zefan Du, Wenrui Zhang, Wenqi Wei, Juntao Chen, Tao Han, Zhiding Liang, Ying Mao | 2024-12-24 | 下载 | Despite advancements, current quantum hardware faces significant challenges, including limited qubit counts and high susceptibility to noise, which hinder the execution of large, complex algorithms. |
| Circuit Folding: Modular and Qubit-Level Workload Management in Quantum-Classical Systems | Shuwen Kan, Yanni Li, Hao Wang, Sara Mouradian, Ying Mao | 2024-12-24 | 下载 | Circuit cutting is a promising technique that leverages both quantum and classical computational resources, enabling the practical execution of large quantum circuits on noisy intermediate-scale quant... |
| TimelyLLM: Segmented LLM Serving System for Time-sensitive Robotic Applications | Neiwen Ling, Guojun Chen, Lin Zhong | 2024-12-24 | 下载 | Large Language Models (LLMs) such as GPT-4 and Llama3 can already comprehend complex commands and process diverse tasks. This advancement facilitates their application in controlling drones and robots... |
| Double Spending Analysis of Nakamoto Consensus for Time-Varying Mining Rates with Ruin Theory | Mustafa Doger, Sennur Ulukus, Nail Akar | 2024-12-24 | 下载 | Theoretical guarantees for double spending probabilities for the Nakamoto consensus under the -deep confirmation rule have been extensively studied for zero/bounded network delays and fixed mining ... |
| Pilot-Quantum: A Quantum-HPC Middleware for Resource, Workload and Task Management | Pradeep Mantha, Florian J. Kiwit, Nishant Saurabh, Shantenu Jha, Andre Luckow | 2024-12-24 | 下载 | As quantum hardware advances, integrating quantum processing units (QPUs) into HPC environments and managing diverse infrastructure and software stacks becomes increasingly essential. |
| Hardware-aware Circuit Cutting and Distributed Qubit Mapping for Connected Quantum Systems | Zefan Du, Yanni Li, Zijian Mo, Wenqi Wei, Juntao Chen, Rajkumar Buyya, Ying Mao | 2024-12-24 | 下载 | Quantum computing offers unparalleled computational capabilities but faces significant challenges, including limited qubit counts, diverse hardware topologies, and dynamic noise/error rates, which hin... |
| Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge Networks | Changfu Xu, Jianxiong Guo, Wanyu Lin, Haodong Zou, Wentao Fan, Tian Wang, Xiaowen Chu, Jiannong Cao | 2024-12-24 | 下载 | Artificial Intelligence Generated Content (AIGC) has gained significant popularity for creating diverse content. Current AIGC models primarily focus on content quality within a centralized framework, ... |
| KunServe: Parameter-centric Memory Management for Efficient Memory Overloading Handling in LLM Serving | Rongxin Cheng, Yuxin Lai, Xingda Wei, Rong Chen, Haibo Chen | 2024-12-24 | 下载 | Serving LLMs with a cluster of GPUs is common nowadays, where the serving system must meet strict latency SLOs required by applications. However, the stateful nature of LLM serving requires maintainin... |
| XSema: A Novel Framework for Semantic Extraction of Cross-chain Transactions | Ziye Zheng, Jiajing Wu, Dan Lin, Quanzhong Li, Na Ruan | 2024-12-24 | 下载 | As the number of blockchain platforms continues to grow, the independence of these networks poses challenges for transferring assets and information across chains. |
| Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels | Mingcong Song, Xinru Tang, Fengfan Hou, Jing Li, Wei Wei, Yipeng Ma, Runqiu Xiao, Hongjie Si, Dingcheng Jiang, Shouyi Yin, Yang Hu, Guoping Long | 2024-12-24 | 下载 | Meeting growing demands for low latency and cost efficiency in production-grade large language model (LLM) serving systems requires integrating advanced optimization techniques. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| TPAoI: Ensuring Fresh Service Status at the Network Edge in Compute-First Networking | Haosheng He, Jianpeng Qi, Chao Liu, Junyu Dong, Yanwei Yu | 2024-12-24 | 下载 | In compute-first networking, maintaining fresh and accurate status information at the network edge is crucial for effective access to remote services. |
| A Large-Scale IPv6-Based Measurement of the Starlink Network | Bingsen Wang, Xiaohui Zhang, Shuai Wang, Li Chen, Jinwei Zhao, Dan Li, Yong Jiang | 2024-12-24 | 下载 | Low Earth Orbit (LEO) satellite networks have attracted considerable attention for their ability to deliver global, low-latency broadband Internet services. |
| Adapting Large Language Models for Improving TCP Fairness over WiFi | Shyam Kumar Shrestha, Shiva Raj Pokhrel, Jonathan Kua | 2024-12-24 | 下载 | The new transmission control protocol (TCP) relies on Deep Learning (DL) for prediction and optimization, but requires significant manual effort to design deep neural networks (DNNs) and struggles wit... |
| Energy Efficient Computation Offloading and Virtual Connection Control in Uplink Small Cell Networks | Davoud Yousefi, Hassan Yari, Farzad Osouli, Mohammad Ebrahimi, Somayeh Esmalifalak, Morteza Johari, Abbas Azarnezhad, Fatemeh Sadeghi, Rogayeh Mirzapour | 2024-12-24 | 下载 | Nowadays, the use of soft computational techniques in power systems under the umbrella of machine learning is increasing with good reception. In this paper, we first present a deep learning approach t... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Interference-free Operating System: A 6 Years' Experience in Mitigating Cross-Core Interference in Linux | Zhaomeng Deng, Ziqi Zhang, Ding Li, Yao Guo, Yunfeng Ye, Yuxin Ren, Ning Jia, Xinwei Hu | 2024-12-24 | 下载 | Real-time operating systems employ spatial and temporal isolation to guarantee predictability and schedulability of real-time systems on multi-core processors. |