Appearance
2025-02-20
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Micro Blossom: Accelerated Minimum-Weight Perfect Matching Decoding for Quantum Error Correction | Yue Wu, Namitha Liyanage, Lin Zhong | 2025-02-20 | 下载 | Minimum-Weight Perfect Matching (MWPM) decoding is important to quantum error correction decoding because of its accuracy. However, many believe that it is difficult, if possible at all, to achieve th... |
| Leveraging Error Resilience of Iterative Algorithms for Energy Efficiency: from Concept to Implementation | G. A. Gillani, A. Krapukhin, A. B. J. Kokkeler | 2025-02-20 | 下载 | Iterative algorithms are widely used in digital signal processing applications. With the case study of radio astronomy calibration processing, this work contributes towards revealing and exploiting th... |
| Parallelizing a modern GPU simulator | Rodrigo Huerta, Antonio González | 2025-02-20 | 下载 | Simulators are a primary tool in computer architecture research but are extremely computationally intensive. Simulating modern architectures with increased core counts and recent workloads can be chal... |
| DeepRTL: Bridging Verilog Understanding and Generation with a Unified Representation Model | Yi Liu, Changran Xu, Yunhao Zhou, Zeju Li, Qiang Xu | 2025-02-20 | 下载 | Recent advancements in large language models (LLMs) have shown significant potential for automating hardware description language (HDL) code generation from high-level natural language instructions. |
| μRL: Discovering Transient Execution Vulnerabilities Using Reinforcement Learning | M. Caner Tol, Kemal Derya, Berk Sunar | 2025-02-20 | 下载 | We propose using reinforcement learning to address the challenges of discovering microarchitectural vulnerabilities, such as Spectre and Meltdown, which exploit subtle interactions in modern processor... |
| NDPage: Efficient Address Translation for Near-Data Processing Architectures via Tailored Page Table | Qingcai Jiang, Buxin Tu, Hong An | 2025-02-20 | 下载 | Near-Data Processing (NDP) has been a promising architectural paradigm to address the memory wall problem for data-intensive applications. Practical implementation of NDP architectures calls for syste... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention | Shang Yang, Junxian Guo, Haotian Tang, Qinghao Hu, Guangxuan Xiao, Jiaming Tang, Yujun Lin, Zhijian Liu, Yao Lu, Song Han | 2025-02-20 | 下载 | Large language models (LLMs) have shown remarkable potential in processing long sequences and complex reasoning tasks, yet efficiently serving these models remains challenging due to the quadratic com... |
| MadVoro: Parallel Construction of Voronoi Diagrams in Distributed Memory Systems | Maor Mizrachi, Barak Raveh, Elad Steinberg | 2025-02-20 | 下载 | Voronoi diagrams are essential geometrical structures with numerous applications, particularly astrophysics-driven finite volume methods. While serial algorithms for constructing these entities are we... |
| Byzantine Game Theory: Sun Tzus Boxes | Andrei Constantinescu, Roger Wattenhofer | 2025-02-20 | 下载 | We introduce the Byzantine Selection Problem, living at the intersection of game theory and fault-tolerant distributed computing. Here, an event organizer is presented with a group of agents, and ... |
| Parallelizing a modern GPU simulator | Rodrigo Huerta, Antonio González | 2025-02-20 | 下载 | Simulators are a primary tool in computer architecture research but are extremely computationally intensive. Simulating modern architectures with increased core counts and recent workloads can be chal... |
| SageServe: Optimizing LLM Serving on Cloud Data Centers with Forecast Aware Auto-Scaling | Shashwat Jaiswal, Kunal Jain, Yogesh Simmhan, Anjaly Parayil, Ankur Mallick, Rujia Wang, Renee St. Amant, Chetan Bansal, Victor Rühle, Anoop Kulkarni, Steve Kofsky, Saravan Rajmohan | 2025-02-20 | 下载 | Global cloud service providers handle inference workloads for Large Language Models (LLMs) that span latency-sensitive (e.g., chatbots) and insensitive (e.g. |
| madupite: A High-Performance Distributed Solver for Large-Scale Markov Decision Processes | Matilde Gargiani, Robin Sieber, Philip Pawlowsky, Václav Hapla, John Lygeros | 2025-02-20 | 下载 | This paper introduces madupite, a high-performance distributed solver for large-scale Markov Decision Processes (MDPs). MDPs are widely used to model complex dynamical systems in various fields, inclu... |
| LLM4FaaS: No-Code Application Development using LLMs and FaaS | Minghe Wang, Tobias Pfandzelter, Trever Schirmer, David Bermbach | 2025-02-20 | 下载 | Large language models (LLMs) show great capabilities in generating code from natural language descriptions, bringing programming power closer to non-technical users. |
| Optimizing the Longhorn Cloud-native Software Defined Storage Engine for High Performance | Konstantinos Kampadais, Antony Chazapis, Angelos Bilas | 2025-02-20 | 下载 | Longhorn is an open-source, cloud-native software-defined storage (SDS) engine that delivers distributed block storage management in Kubernetes environments. |
| A Parallel Hierarchical Approach for Community Detection on Large-scale Dynamic Networks | Grigoriy Bokov, Aleksandr Konovalov, Anna Uporova, Stanislav Moiseev, Ivan Safonov, Alexander Radionov | 2025-02-20 | 下载 | In this paper, we propose a novel parallel hierarchical Leiden-based algorithm for dynamic community detection. The algorithm, for a given batch update of edge insertions and deletions, partitions the... |
| It Takes Two to Tango: Serverless Workflow Serving via Bilaterally Engaged Resource Adaptation | Jing Wu, Lin Wang, Quanfeng Deng, Chen Yu, Dong Zhang, Bingheng Yan, Fangming Liu | 2025-02-20 | 下载 | Serverless platforms typically adopt an early-binding approach for function sizing, requiring developers to specify an immutable size for each function within a workflow beforehand. |
| Blockchain-based Framework for Scalable and Incentivized Federated Learning | Bijun Wu, Oshani Seneviratne | 2025-02-20 | 下载 | Federated Learning (FL) enables collaborative model training without sharing raw data, preserving privacy while harnessing distributed datasets. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| NeSt-VR: An Adaptive Bitrate Algorithm for Virtual Reality Streaming over Wi-Fi | Miguel Casasnovas, Ferran Maura, Isjtar Vandebroeck, Haryo Sukmawanto, Eric Joris, Boris Bellalta | 2025-02-20 | 下载 | Real-time interactive Virtual Reality (VR) streaming is a significantly challenging use case for Wi-Fi given its high throughput and low latency requirements, especially considering the constraints im... |
| Online Resource Management for the Uplink of Wideband Hybrid Beamforming System | Yuan Quan, Haseen Rahman, Catherine Rosenberg | 2025-02-20 | 下载 | This paper studies the radio resource management (RRM) for the uplink (UL) of a cellular system with codebook-based hybrid beamforming. We consider the often neglected but highly practical multi-chann... |
| Tracking and Assigning Jobs to a Markov Machine | Subhankar Banerjee, Sennur Ulukus | 2025-02-20 | 下载 | We consider a time-slotted communication system with a machine, a cloud server, and a sampler. Job requests from the users are queued on the server to be completed by the machine. |
| A Survey of Internet Censorship and its Measurement: Methodology, Trends, and Challenges | Steffen Wendzel, Simon Volpert, Sebastian Zillien, Julia Lenz, Philip Rünz, Luca Caviglione | 2025-02-20 | 下载 | Internet censorship limits the access of nodes residing within a specific network environment to the public Internet, and vice versa. During the last decade, techniques for conducting Internet censors... |
| Reinforcement Learning with Graph Attention for Routing and Wavelength Assignment with Lightpath Reuse | Michael Doherty, Alejandra Beghelli | 2025-02-20 | 下载 | Many works have investigated reinforcement learning (RL) for routing and spectrum assignment on flex-grid networks but only one work to date has examined RL for fixed-grid with flex-rate transponders,... |
| Counter Pools: Counter Representation for Efficient Stream Processing | Ran Ben Basat, Gil Einziger, Bilal Tyah, Shay Vargaftik | 2025-02-20 | 下载 | Due to the large data volume and number of distinct elements, space is often the bottleneck of many stream processing systems. The data structures used by these systems often consist of counters whose... |
| Optimal Popularity-based Transmission Range Selection for D2D-supported Content Delivery | Loreto Pescosolido, Andrea Passarella, Marco Conti | 2025-02-20 | 下载 | Considering device-to-device (D2D) wireless links as a virtual extension of 5G (and beyond) cellular networks to deliver popular contents has been proposed as an interesting approach to reduce energy ... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Taming and Controlling Performance and Energy Trade-offs Automatically in Network Applications | Han Dong, Yara Awad, Sanjay Arora, Orran Krieger, Jonathan Appavoo | 2025-02-20 | 下载 | In this paper, we demonstrate that a server running a single latency-sensitive application can be treated as a black box to reduce energy consumption while meeting an SLA target. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention | Shang Yang, Junxian Guo, Haotian Tang, Qinghao Hu, Guangxuan Xiao, Jiaming Tang, Yujun Lin, Zhijian Liu, Yao Lu, Song Han | 2025-02-20 | 下载 | Large language models (LLMs) have shown remarkable potential in processing long sequences and complex reasoning tasks, yet efficiently serving these models remains challenging due to the quadratic com... |
| Parallelizing a modern GPU simulator | Rodrigo Huerta, Antonio González | 2025-02-20 | 下载 | Simulators are a primary tool in computer architecture research but are extremely computationally intensive. Simulating modern architectures with increased core counts and recent workloads can be chal... |