2025-03-11

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
A Comparison of the Cerebras Wafer-Scale Integration Technology with Nvidia GPU-based Systems for Artificial Intelligence	Yudhishthira Kundu, Manroop Kaur, Tripty Wig, Kriti Kumar, Pushpanjali Kumari, Vivek Puri, Manish Arora	2025-03-11	下载	Cerebras' wafer-scale engine (WSE) technology merges multiple dies on a single wafer. It addresses the challenges of memory bandwidth, latency, and scalability, making it suitable for artificial intel...
ResBench: Benchmarking LLM-Generated FPGA Designs with Resource Awareness	Ce Guo, Tong Zhao	2025-03-11	下载	Field-Programmable Gate Arrays (FPGAs) are widely used in modern hardware design, yet writing Hardware Description Language (HDL) code for FPGA implementation remains a complex and time-consuming task...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
SIMT/GPU Data Race Verification using ISCC and Intermediary Code Representations: A Case Study	Andrew Osterhout, Ganesh Gopalakrishnan	2025-03-11	下载	It is often difficult to write code that you can ensure will be executed in the right order when programing for parallel compute tasks. Due to the way that today's parallel compute hardware, primarily...
A Parallel and Highly-Portable HPC Poisson Solver: Preconditioned Bi-CGSTAB with alpaka	Luca Pennati, Måns I. Andersson, Klaus Steiniger, Rene Widera, Tapish Narwal, Michael Bussmann, Stefano Markidis	2025-03-11	下载	This paper presents the design, implementation, and performance analysis of a parallel and GPU-accelerated Poisson solver based on the Preconditioned Bi-Conjugate Gradient Stabilized (Bi-CGSTAB) metho...
Cabinet: Dynamically Weighted Consensus Made Fast	Gengrui Zhang, Shiquan Zhang, Michail Bachras, Yuqiu Zhang, Hans-Arno Jacobsen	2025-03-11	下载	Conventional consensus algorithms, such as Paxos and Raft, encounter inefficiencies when applied to large-scale distributed systems due to the requirement of waiting for replies from a majority of nod...
A Comprehensive Experimentation Framework for Energy-Efficient Design of Cloud-Native Applications	Sebastian Werner, Maria C. Borges, Karl Wolf, Stefan Tai	2025-03-11	下载	Current approaches to designing energy-efficient applications typically rely on measuring individual components using readily available local metrics, like CPU utilization.
A Fair and Lightweight Consensus Algorithm for IoT	Sokratis Vavilis, Harris Niavis, Konstantinos Loupos	2025-03-11	下载	With the rapid growth of hyperconnected devices and decentralized data architectures, safeguarding Internet of Things (IoT) transactions is becoming increasingly challenging.
Accelerating MoE Model Inference with Expert Sharding	Oana Balmau, Anne-Marie Kermarrec, Rafael Pires, André Loureiro Espírito Santo, Martijn de Vos, Milos Vujasinovic	2025-03-11	下载	Mixture of experts (MoE) models achieve state-of-the-art results in language modeling but suffer from inefficient hardware utilization due to imbalanced token routing and communication overhead.
FastCache: Optimizing Multimodal LLM Serving through Lightweight KV-Cache Compression Framework	Jianian Zhu, Hang Wu, Haojie Wang, Yinghui Li, Biao Hou, Ruixuan Li, Jidong Zhai	2025-03-11	下载	Multi-modal Large Language Models (MLLMs) serving systems commonly employ KV-cache compression to reduce memory footprint. However, existing compression methods introduce significant processing overhe...
TokenSim: Enabling Hardware and Software Exploration for Large Language Model Inference Systems	Feiyang Wu, Zhuohang Bian, Guoyang Duan, Tianle Xu, Junchi Wu, Teng Ma, Yongqiang Yao, Ruihao Gong, Youwei Zhuo	2025-03-11	下载	The increasing demand for large language model (LLM) serving has necessitated significant advancements in the optimization and profiling of LLM inference systems.
Efficient Query Verification for Blockchain Superlight Clients Using SNARKs	Stefano De Angelis, Ivan Visconti, Andrea Vitaletti, Marco Zecchini	2025-03-11	下载	Blockchains are among the most powerful technologies to realize decentralized information systems. In order to safely enjoy all guarantees provided by a blockchain, one should maintain a full node, th...
Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference	Pol G. Recasens, Ferran Agullo, Yue Zhu, Chen Wang, Eun Kyung Lee, Olivier Tardieu, Jordi Torres, Josep Ll. Berral	2025-03-11	下载	Large language models have been widely adopted across different tasks, but their auto-regressive generation nature often leads to inefficient resource utilization during inference.
SoK: A cloudy view on trust relationships of CVMs -- How Confidential Virtual Machines are falling short in Public Cloud	Jana Eisoldt, Anna Galanou, Andrey Ruzhanskiy, Nils Küchenmeister, Yewgenij Baburkin, Tianxiang Dai, Ivan Gudymenko, Stefan Köpsell, Rüdiger Kapitza	2025-03-11	下载	Confidential computing in the public cloud intends to safeguard workload privacy while outsourcing infrastructure management to a cloud provider.
Will LLMs Scaling Hit the Wall? Breaking Barriers via Distributed Resources on Massive Edge Devices	Tao Shen, Didi Zhu, Ziyu Zhao, Zexi Li, Chao Wu, Fei Wu	2025-03-11	下载	The remarkable success of foundation models has been driven by scaling laws, demonstrating that model performance improves predictably with increased training data and model size.
Detecting Backdoor Attacks in Federated Learning via Direction Alignment Inspection	Jiahao Xu, Zikai Zhang, Rui Hu	2025-03-11	下载	The distributed nature of training makes Federated Learning (FL) vulnerable to backdoor attacks, where malicious model updates aim to compromise the global model's performance on specific tasks.
MFC 5.0: An exascale many-physics flow solver	Benjamin Wilfong, Henry A. Le Berre, Anand Radhakrishnan, Ansh Gupta, Daniel J. Vickers, Diego Vaca-Revelo, Dimitrios Adam, Haocheng Yu, Hyeoksu Lee, Jose Rodolfo Chreim, Mirelys Carcana Barbosa, Yanjun Zhang, Esteban Cisneros-Garibay, Aswin Gnanaskandan, Mauro Rodriguez, Reuben D. Budiardja, Stephen Abbott, Tim Colonius, Spencer H. Bryngelson	2025-03-11	下载	Many problems of interest in engineering, medicine, and the fundamental sciences rely on high-fidelity flow simulation, making performant computational fluid dynamics solvers a mainstay of the open-so...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
MAREA: A Delay-Aware Multi-time-Scale Radio Resource Orchestrator for 6G O-RAN	Oscar Adamuz-Hinojosa, Lanfranco Zanzi, Vincenzo Sciancalepore, Xavier Costa-Pérez	2025-03-11	下载	The Open Radio Access Network (O-RAN)-compliant solutions often lack crucial details for implementing effective control loops at various time scales.
Hierarchical Multi Agent DRL for Soft Handovers Between Edge Clouds in Open RAN	F. Giarrè, I. A. Meer, M. Masoudi, M. Ozger, C. Cavdar	2025-03-11	下载	Multi-connectivity (MC) for aerial users via a set of ground access points offers the potential for highly reliable communication. Within an open radio access network (O-RAN) architecture, edge clouds...
Efficient Resource Allocation in 5G Massive MIMO-NOMA Networks: Comparative Analysis of SINR-Aware Power Allocation and Spatial Correlation-Based Clustering	Samar Chebbi, Oussama Habachi, Jean-Pierre Cances, Vahid Meghdadi, Essaid Sabir	2025-03-11	下载	With the evolution of 5G networks, optimizing resource allocation has become crucial to meeting the increasing demand for massive connectivity and high throughput.
Integrating Captive Portal Technology into Computer Science Education: A Modular, Hands-On Approach to Infrastructure	Lianting Wang, Marcelo Ponce	2025-03-11	下载	In this paper, we present an educational project aimed to introduce students to the technology behind Captive Portals infrastructures. For doing this, we developed a series of modules to emphasize eac...
Towards Sustainability in 6G and beyond: Challenges and Opportunities of Open RAN	Hamed Ahmadi, Mostafa Rahmani, Swarna Bindu Chetty, Eirini Eleni Tsiropoulou, Huseyin Arslan, Merouane Debbah, Tony Quek	2025-03-11	下载	The transition to 6G is expected to bring significant advancements, including much higher data rates, enhanced reliability and ultra-low latency compared to previous generations.
Uni-Gaussians: Unifying Camera and Lidar Simulation with Gaussians for Dynamic Driving Scenarios	Zikang Yuan, Yuechuan Pu, Hongcheng Luo, Fengtian Lang, Cheng Chi, Teng Li, Yingying Shen, Haiyang Sun, Bing Wang, Xin Yang	2025-03-11	下载	Ensuring the safety of autonomous vehicles necessitates comprehensive simulation of multi-sensor data, encompassing inputs from both cameras and LiDAR sensors, across various dynamic driving scenarios...
A systematic literature review of unsupervised learning algorithms for anomalous traffic detection based on flows	Alberto Miguel-Diez, Adrián Campazas-Vega, Claudia Álvarez-Aparicio, Gonzalo Esteban-Costales, Ángel Manuel Guerrero-Higueras	2025-03-11	下载	The constant increase of devices connected to the Internet, and therefore of cyber-attacks, makes it necessary to analyze network traffic in order to recognize malicious activity.
Explainable Autoencoder Design for RSSI-Based Multi-User Beam Probing and Hybrid Precoding	Asmaa Abdallah, Abdulkadir Celik, Ahmed Alkhateeb, Ahmed M. Eltawil	2025-03-11	下载	This paper introduces a novel neural network (NN) structure referred to as an ``Auto-hybrid precoder'' (Auto-HP) and an unsupervised deep learning (DL) approach that jointly designs \ac{mmWave} probin...
Cost-driven prunings for iterative solving of constrained routing problem with SRLG-disjoint protection	P. A. Mosharev, Choon-Meng Lee, Xu Shu, Xiaoshan Zhang, Man-Hong Yung	2025-03-11	下载	The search for the optimal pair of active and protection paths in a network with Shared Risk Link Groups (SRLG) is a challenging but high-value problem in the industry that is inevitable in ensuring r...
LLM4MAC: An LLM-Driven Reinforcement Learning Framework for MAC Protocol Emergence	Renxuan Tan, Rongpeng Li, Zhifeng Zhao	2025-03-11	下载	With the advent of 6G systems, emerging hyper-connected ecosystems necessitate agile and adaptive medium access control (MAC) protocols to contend with network dynamics and diverse service requirement...
Mobility-aware Seamless Service Migration and Resource Allocation in Multi-edge IoV Systems	Zheyi Chen, Sijin Huang, Geyong Min, Zhaolong Ning, Jie Li, Yan Zhang	2025-03-11	下载	Mobile Edge Computing (MEC) offers low-latency and high-bandwidth support for Internet-of-Vehicles (IoV) applications. However, due to high vehicle mobility and finite communication coverage of base s...
Interference Graph Estimation for Resource Allocation in Multi-Cell Multi-Numerology Networks: A Power-Domain Approach	Daqian Ding, Haorui Li, Yibo Pi, Xudong Wang	2025-03-11	下载	The interference graph, depicting the intra- and inter-cell interference channel gains, is indispensable for resource allocation in multi-cell networks.
ALCS: An Adaptive Latency Compensation Scheduler for Multipath TCP in Satellite-Terrestrial Integrated Networks	Lin Wang, Ze Wang, Zeyi Deng, Jingjing Zhang, Yue Gao	2025-03-11	下载	The Satellite-Terrestrial Integrated Network (STIN) enhances end-to-end transmission by simultaneously utilizing terrestrial and satellite networks, offering significant benefits in scenarios like eme...
Accelerating Development in UAV Network Digital Twins with a Flexible Simulation Framework	Md Sharif Hossen, Anil Gurses, Mihail Sichitiu, Ismail Guvenc	2025-03-11	下载	Unmanned aerial vehicles (UAVs) enhance coverage and provide flexible deployment in 5G and next-generation wireless networks. The performance of such wireless networks can be improved by developing ne...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Meta-Reinforcement Learning with Discrete World Models for Adaptive Load Balancing	Cameron Redovian	2025-03-11	下载	We integrate a meta-reinforcement learning algorithm with the DreamerV3 architecture to improve load balancing in operating systems. This approach enables rapid adaptation to dynamic workloads with mi...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Investigating Execution-Aware Language Models for Code Optimization	Federico Di Menna, Luca Traini, Gabriele Bavota, Vittorio Cortellessa	2025-03-11	下载	Code optimization is the process of enhancing code efficiency, while preserving its intended functionality. This process often requires a deep understanding of the code execution behavior at run-time ...