Skip to content

2025-02-20

cs.AR - Architecture

标题作者发布日期PDF摘要
Micro Blossom: Accelerated Minimum-Weight Perfect Matching Decoding for Quantum Error CorrectionYue Wu, Namitha Liyanage, Lin Zhong2025-02-20下载Minimum-Weight Perfect Matching (MWPM) decoding is important to quantum error correction decoding because of its accuracy. However, many believe that it is difficult, if possible at all, to achieve th...
Leveraging Error Resilience of Iterative Algorithms for Energy Efficiency: from Concept to ImplementationG. A. Gillani, A. Krapukhin, A. B. J. Kokkeler2025-02-20下载Iterative algorithms are widely used in digital signal processing applications. With the case study of radio astronomy calibration processing, this work contributes towards revealing and exploiting th...
Parallelizing a modern GPU simulatorRodrigo Huerta, Antonio González2025-02-20下载Simulators are a primary tool in computer architecture research but are extremely computationally intensive. Simulating modern architectures with increased core counts and recent workloads can be chal...
DeepRTL: Bridging Verilog Understanding and Generation with a Unified Representation ModelYi Liu, Changran Xu, Yunhao Zhou, Zeju Li, Qiang Xu2025-02-20下载Recent advancements in large language models (LLMs) have shown significant potential for automating hardware description language (HDL) code generation from high-level natural language instructions.
μRL: Discovering Transient Execution Vulnerabilities Using Reinforcement LearningM. Caner Tol, Kemal Derya, Berk Sunar2025-02-20下载We propose using reinforcement learning to address the challenges of discovering microarchitectural vulnerabilities, such as Spectre and Meltdown, which exploit subtle interactions in modern processor...
NDPage: Efficient Address Translation for Near-Data Processing Architectures via Tailored Page TableQingcai Jiang, Buxin Tu, Hong An2025-02-20下载Near-Data Processing (NDP) has been a promising architectural paradigm to address the memory wall problem for data-intensive applications. Practical implementation of NDP architectures calls for syste...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
LServe: Efficient Long-sequence LLM Serving with Unified Sparse AttentionShang Yang, Junxian Guo, Haotian Tang, Qinghao Hu, Guangxuan Xiao, Jiaming Tang, Yujun Lin, Zhijian Liu, Yao Lu, Song Han2025-02-20下载Large language models (LLMs) have shown remarkable potential in processing long sequences and complex reasoning tasks, yet efficiently serving these models remains challenging due to the quadratic com...
MadVoro: Parallel Construction of Voronoi Diagrams in Distributed Memory SystemsMaor Mizrachi, Barak Raveh, Elad Steinberg2025-02-20下载Voronoi diagrams are essential geometrical structures with numerous applications, particularly astrophysics-driven finite volume methods. While serial algorithms for constructing these entities are we...
Byzantine Game Theory: Sun Tzus BoxesAndrei Constantinescu, Roger Wattenhofer2025-02-20下载We introduce the Byzantine Selection Problem, living at the intersection of game theory and fault-tolerant distributed computing. Here, an event organizer is presented with a group of nn agents, and ...
Parallelizing a modern GPU simulatorRodrigo Huerta, Antonio González2025-02-20下载Simulators are a primary tool in computer architecture research but are extremely computationally intensive. Simulating modern architectures with increased core counts and recent workloads can be chal...
SageServe: Optimizing LLM Serving on Cloud Data Centers with Forecast Aware Auto-ScalingShashwat Jaiswal, Kunal Jain, Yogesh Simmhan, Anjaly Parayil, Ankur Mallick, Rujia Wang, Renee St. Amant, Chetan Bansal, Victor Rühle, Anoop Kulkarni, Steve Kofsky, Saravan Rajmohan2025-02-20下载Global cloud service providers handle inference workloads for Large Language Models (LLMs) that span latency-sensitive (e.g., chatbots) and insensitive (e.g.
madupite: A High-Performance Distributed Solver for Large-Scale Markov Decision ProcessesMatilde Gargiani, Robin Sieber, Philip Pawlowsky, Václav Hapla, John Lygeros2025-02-20下载This paper introduces madupite, a high-performance distributed solver for large-scale Markov Decision Processes (MDPs). MDPs are widely used to model complex dynamical systems in various fields, inclu...
LLM4FaaS: No-Code Application Development using LLMs and FaaSMinghe Wang, Tobias Pfandzelter, Trever Schirmer, David Bermbach2025-02-20下载Large language models (LLMs) show great capabilities in generating code from natural language descriptions, bringing programming power closer to non-technical users.
Optimizing the Longhorn Cloud-native Software Defined Storage Engine for High PerformanceKonstantinos Kampadais, Antony Chazapis, Angelos Bilas2025-02-20下载Longhorn is an open-source, cloud-native software-defined storage (SDS) engine that delivers distributed block storage management in Kubernetes environments.
A Parallel Hierarchical Approach for Community Detection on Large-scale Dynamic NetworksGrigoriy Bokov, Aleksandr Konovalov, Anna Uporova, Stanislav Moiseev, Ivan Safonov, Alexander Radionov2025-02-20下载In this paper, we propose a novel parallel hierarchical Leiden-based algorithm for dynamic community detection. The algorithm, for a given batch update of edge insertions and deletions, partitions the...
It Takes Two to Tango: Serverless Workflow Serving via Bilaterally Engaged Resource AdaptationJing Wu, Lin Wang, Quanfeng Deng, Chen Yu, Dong Zhang, Bingheng Yan, Fangming Liu2025-02-20下载Serverless platforms typically adopt an early-binding approach for function sizing, requiring developers to specify an immutable size for each function within a workflow beforehand.
Blockchain-based Framework for Scalable and Incentivized Federated LearningBijun Wu, Oshani Seneviratne2025-02-20下载Federated Learning (FL) enables collaborative model training without sharing raw data, preserving privacy while harnessing distributed datasets.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
NeSt-VR: An Adaptive Bitrate Algorithm for Virtual Reality Streaming over Wi-FiMiguel Casasnovas, Ferran Maura, Isjtar Vandebroeck, Haryo Sukmawanto, Eric Joris, Boris Bellalta2025-02-20下载Real-time interactive Virtual Reality (VR) streaming is a significantly challenging use case for Wi-Fi given its high throughput and low latency requirements, especially considering the constraints im...
Online Resource Management for the Uplink of Wideband Hybrid Beamforming SystemYuan Quan, Haseen Rahman, Catherine Rosenberg2025-02-20下载This paper studies the radio resource management (RRM) for the uplink (UL) of a cellular system with codebook-based hybrid beamforming. We consider the often neglected but highly practical multi-chann...
Tracking and Assigning Jobs to a Markov MachineSubhankar Banerjee, Sennur Ulukus2025-02-20下载We consider a time-slotted communication system with a machine, a cloud server, and a sampler. Job requests from the users are queued on the server to be completed by the machine.
A Survey of Internet Censorship and its Measurement: Methodology, Trends, and ChallengesSteffen Wendzel, Simon Volpert, Sebastian Zillien, Julia Lenz, Philip Rünz, Luca Caviglione2025-02-20下载Internet censorship limits the access of nodes residing within a specific network environment to the public Internet, and vice versa. During the last decade, techniques for conducting Internet censors...
Reinforcement Learning with Graph Attention for Routing and Wavelength Assignment with Lightpath ReuseMichael Doherty, Alejandra Beghelli2025-02-20下载Many works have investigated reinforcement learning (RL) for routing and spectrum assignment on flex-grid networks but only one work to date has examined RL for fixed-grid with flex-rate transponders,...
Counter Pools: Counter Representation for Efficient Stream ProcessingRan Ben Basat, Gil Einziger, Bilal Tyah, Shay Vargaftik2025-02-20下载Due to the large data volume and number of distinct elements, space is often the bottleneck of many stream processing systems. The data structures used by these systems often consist of counters whose...
Optimal Popularity-based Transmission Range Selection for D2D-supported Content DeliveryLoreto Pescosolido, Andrea Passarella, Marco Conti2025-02-20下载Considering device-to-device (D2D) wireless links as a virtual extension of 5G (and beyond) cellular networks to deliver popular contents has been proposed as an interesting approach to reduce energy ...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Taming and Controlling Performance and Energy Trade-offs Automatically in Network ApplicationsHan Dong, Yara Awad, Sanjay Arora, Orran Krieger, Jonathan Appavoo2025-02-20下载In this paper, we demonstrate that a server running a single latency-sensitive application can be treated as a black box to reduce energy consumption while meeting an SLA target.

cs.PF - Performance

标题作者发布日期PDF摘要
LServe: Efficient Long-sequence LLM Serving with Unified Sparse AttentionShang Yang, Junxian Guo, Haotian Tang, Qinghao Hu, Guangxuan Xiao, Jiaming Tang, Yujun Lin, Zhijian Liu, Yao Lu, Song Han2025-02-20下载Large language models (LLMs) have shown remarkable potential in processing long sequences and complex reasoning tasks, yet efficiently serving these models remains challenging due to the quadratic com...
Parallelizing a modern GPU simulatorRodrigo Huerta, Antonio González2025-02-20下载Simulators are a primary tool in computer architecture research but are extremely computationally intensive. Simulating modern architectures with increased core counts and recent workloads can be chal...

基于 VitePress 构建