Skip to content

2025-03-11

cs.AR - Architecture

标题作者发布日期PDF摘要
A Comparison of the Cerebras Wafer-Scale Integration Technology with Nvidia GPU-based Systems for Artificial IntelligenceYudhishthira Kundu, Manroop Kaur, Tripty Wig, Kriti Kumar, Pushpanjali Kumari, Vivek Puri, Manish Arora2025-03-11下载Cerebras' wafer-scale engine (WSE) technology merges multiple dies on a single wafer. It addresses the challenges of memory bandwidth, latency, and scalability, making it suitable for artificial intel...
ResBench: Benchmarking LLM-Generated FPGA Designs with Resource AwarenessCe Guo, Tong Zhao2025-03-11下载Field-Programmable Gate Arrays (FPGAs) are widely used in modern hardware design, yet writing Hardware Description Language (HDL) code for FPGA implementation remains a complex and time-consuming task...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
SIMT/GPU Data Race Verification using ISCC and Intermediary Code Representations: A Case StudyAndrew Osterhout, Ganesh Gopalakrishnan2025-03-11下载It is often difficult to write code that you can ensure will be executed in the right order when programing for parallel compute tasks. Due to the way that today's parallel compute hardware, primarily...
A Parallel and Highly-Portable HPC Poisson Solver: Preconditioned Bi-CGSTAB with alpakaLuca Pennati, Måns I. Andersson, Klaus Steiniger, Rene Widera, Tapish Narwal, Michael Bussmann, Stefano Markidis2025-03-11下载This paper presents the design, implementation, and performance analysis of a parallel and GPU-accelerated Poisson solver based on the Preconditioned Bi-Conjugate Gradient Stabilized (Bi-CGSTAB) metho...
Cabinet: Dynamically Weighted Consensus Made FastGengrui Zhang, Shiquan Zhang, Michail Bachras, Yuqiu Zhang, Hans-Arno Jacobsen2025-03-11下载Conventional consensus algorithms, such as Paxos and Raft, encounter inefficiencies when applied to large-scale distributed systems due to the requirement of waiting for replies from a majority of nod...
A Comprehensive Experimentation Framework for Energy-Efficient Design of Cloud-Native ApplicationsSebastian Werner, Maria C. Borges, Karl Wolf, Stefan Tai2025-03-11下载Current approaches to designing energy-efficient applications typically rely on measuring individual components using readily available local metrics, like CPU utilization.
A Fair and Lightweight Consensus Algorithm for IoTSokratis Vavilis, Harris Niavis, Konstantinos Loupos2025-03-11下载With the rapid growth of hyperconnected devices and decentralized data architectures, safeguarding Internet of Things (IoT) transactions is becoming increasingly challenging.
Accelerating MoE Model Inference with Expert ShardingOana Balmau, Anne-Marie Kermarrec, Rafael Pires, André Loureiro Espírito Santo, Martijn de Vos, Milos Vujasinovic2025-03-11下载Mixture of experts (MoE) models achieve state-of-the-art results in language modeling but suffer from inefficient hardware utilization due to imbalanced token routing and communication overhead.
FastCache: Optimizing Multimodal LLM Serving through Lightweight KV-Cache Compression FrameworkJianian Zhu, Hang Wu, Haojie Wang, Yinghui Li, Biao Hou, Ruixuan Li, Jidong Zhai2025-03-11下载Multi-modal Large Language Models (MLLMs) serving systems commonly employ KV-cache compression to reduce memory footprint. However, existing compression methods introduce significant processing overhe...
TokenSim: Enabling Hardware and Software Exploration for Large Language Model Inference SystemsFeiyang Wu, Zhuohang Bian, Guoyang Duan, Tianle Xu, Junchi Wu, Teng Ma, Yongqiang Yao, Ruihao Gong, Youwei Zhuo2025-03-11下载The increasing demand for large language model (LLM) serving has necessitated significant advancements in the optimization and profiling of LLM inference systems.
Efficient Query Verification for Blockchain Superlight Clients Using SNARKsStefano De Angelis, Ivan Visconti, Andrea Vitaletti, Marco Zecchini2025-03-11下载Blockchains are among the most powerful technologies to realize decentralized information systems. In order to safely enjoy all guarantees provided by a blockchain, one should maintain a full node, th...
Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM InferencePol G. Recasens, Ferran Agullo, Yue Zhu, Chen Wang, Eun Kyung Lee, Olivier Tardieu, Jordi Torres, Josep Ll. Berral2025-03-11下载Large language models have been widely adopted across different tasks, but their auto-regressive generation nature often leads to inefficient resource utilization during inference.
SoK: A cloudy view on trust relationships of CVMs -- How Confidential Virtual Machines are falling short in Public CloudJana Eisoldt, Anna Galanou, Andrey Ruzhanskiy, Nils Küchenmeister, Yewgenij Baburkin, Tianxiang Dai, Ivan Gudymenko, Stefan Köpsell, Rüdiger Kapitza2025-03-11下载Confidential computing in the public cloud intends to safeguard workload privacy while outsourcing infrastructure management to a cloud provider.
Will LLMs Scaling Hit the Wall? Breaking Barriers via Distributed Resources on Massive Edge DevicesTao Shen, Didi Zhu, Ziyu Zhao, Zexi Li, Chao Wu, Fei Wu2025-03-11下载The remarkable success of foundation models has been driven by scaling laws, demonstrating that model performance improves predictably with increased training data and model size.
Detecting Backdoor Attacks in Federated Learning via Direction Alignment InspectionJiahao Xu, Zikai Zhang, Rui Hu2025-03-11下载The distributed nature of training makes Federated Learning (FL) vulnerable to backdoor attacks, where malicious model updates aim to compromise the global model's performance on specific tasks.
MFC 5.0: An exascale many-physics flow solverBenjamin Wilfong, Henry A. Le Berre, Anand Radhakrishnan, Ansh Gupta, Daniel J. Vickers, Diego Vaca-Revelo, Dimitrios Adam, Haocheng Yu, Hyeoksu Lee, Jose Rodolfo Chreim, Mirelys Carcana Barbosa, Yanjun Zhang, Esteban Cisneros-Garibay, Aswin Gnanaskandan, Mauro Rodriguez, Reuben D. Budiardja, Stephen Abbott, Tim Colonius, Spencer H. Bryngelson2025-03-11下载Many problems of interest in engineering, medicine, and the fundamental sciences rely on high-fidelity flow simulation, making performant computational fluid dynamics solvers a mainstay of the open-so...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
MAREA: A Delay-Aware Multi-time-Scale Radio Resource Orchestrator for 6G O-RANOscar Adamuz-Hinojosa, Lanfranco Zanzi, Vincenzo Sciancalepore, Xavier Costa-Pérez2025-03-11下载The Open Radio Access Network (O-RAN)-compliant solutions often lack crucial details for implementing effective control loops at various time scales.
Hierarchical Multi Agent DRL for Soft Handovers Between Edge Clouds in Open RANF. Giarrè, I. A. Meer, M. Masoudi, M. Ozger, C. Cavdar2025-03-11下载Multi-connectivity (MC) for aerial users via a set of ground access points offers the potential for highly reliable communication. Within an open radio access network (O-RAN) architecture, edge clouds...
Efficient Resource Allocation in 5G Massive MIMO-NOMA Networks: Comparative Analysis of SINR-Aware Power Allocation and Spatial Correlation-Based ClusteringSamar Chebbi, Oussama Habachi, Jean-Pierre Cances, Vahid Meghdadi, Essaid Sabir2025-03-11下载With the evolution of 5G networks, optimizing resource allocation has become crucial to meeting the increasing demand for massive connectivity and high throughput.
Integrating Captive Portal Technology into Computer Science Education: A Modular, Hands-On Approach to InfrastructureLianting Wang, Marcelo Ponce2025-03-11下载In this paper, we present an educational project aimed to introduce students to the technology behind Captive Portals infrastructures. For doing this, we developed a series of modules to emphasize eac...
Towards Sustainability in 6G and beyond: Challenges and Opportunities of Open RANHamed Ahmadi, Mostafa Rahmani, Swarna Bindu Chetty, Eirini Eleni Tsiropoulou, Huseyin Arslan, Merouane Debbah, Tony Quek2025-03-11下载The transition to 6G is expected to bring significant advancements, including much higher data rates, enhanced reliability and ultra-low latency compared to previous generations.
Uni-Gaussians: Unifying Camera and Lidar Simulation with Gaussians for Dynamic Driving ScenariosZikang Yuan, Yuechuan Pu, Hongcheng Luo, Fengtian Lang, Cheng Chi, Teng Li, Yingying Shen, Haiyang Sun, Bing Wang, Xin Yang2025-03-11下载Ensuring the safety of autonomous vehicles necessitates comprehensive simulation of multi-sensor data, encompassing inputs from both cameras and LiDAR sensors, across various dynamic driving scenarios...
A systematic literature review of unsupervised learning algorithms for anomalous traffic detection based on flowsAlberto Miguel-Diez, Adrián Campazas-Vega, Claudia Álvarez-Aparicio, Gonzalo Esteban-Costales, Ángel Manuel Guerrero-Higueras2025-03-11下载The constant increase of devices connected to the Internet, and therefore of cyber-attacks, makes it necessary to analyze network traffic in order to recognize malicious activity.
Explainable Autoencoder Design for RSSI-Based Multi-User Beam Probing and Hybrid PrecodingAsmaa Abdallah, Abdulkadir Celik, Ahmed Alkhateeb, Ahmed M. Eltawil2025-03-11下载This paper introduces a novel neural network (NN) structure referred to as an ``Auto-hybrid precoder'' (Auto-HP) and an unsupervised deep learning (DL) approach that jointly designs \ac{mmWave} probin...
Cost-driven prunings for iterative solving of constrained routing problem with SRLG-disjoint protectionP. A. Mosharev, Choon-Meng Lee, Xu Shu, Xiaoshan Zhang, Man-Hong Yung2025-03-11下载The search for the optimal pair of active and protection paths in a network with Shared Risk Link Groups (SRLG) is a challenging but high-value problem in the industry that is inevitable in ensuring r...
LLM4MAC: An LLM-Driven Reinforcement Learning Framework for MAC Protocol EmergenceRenxuan Tan, Rongpeng Li, Zhifeng Zhao2025-03-11下载With the advent of 6G systems, emerging hyper-connected ecosystems necessitate agile and adaptive medium access control (MAC) protocols to contend with network dynamics and diverse service requirement...
Mobility-aware Seamless Service Migration and Resource Allocation in Multi-edge IoV SystemsZheyi Chen, Sijin Huang, Geyong Min, Zhaolong Ning, Jie Li, Yan Zhang2025-03-11下载Mobile Edge Computing (MEC) offers low-latency and high-bandwidth support for Internet-of-Vehicles (IoV) applications. However, due to high vehicle mobility and finite communication coverage of base s...
Interference Graph Estimation for Resource Allocation in Multi-Cell Multi-Numerology Networks: A Power-Domain ApproachDaqian Ding, Haorui Li, Yibo Pi, Xudong Wang2025-03-11下载The interference graph, depicting the intra- and inter-cell interference channel gains, is indispensable for resource allocation in multi-cell networks.
ALCS: An Adaptive Latency Compensation Scheduler for Multipath TCP in Satellite-Terrestrial Integrated NetworksLin Wang, Ze Wang, Zeyi Deng, Jingjing Zhang, Yue Gao2025-03-11下载The Satellite-Terrestrial Integrated Network (STIN) enhances end-to-end transmission by simultaneously utilizing terrestrial and satellite networks, offering significant benefits in scenarios like eme...
Accelerating Development in UAV Network Digital Twins with a Flexible Simulation FrameworkMd Sharif Hossen, Anil Gurses, Mihail Sichitiu, Ismail Guvenc2025-03-11下载Unmanned aerial vehicles (UAVs) enhance coverage and provide flexible deployment in 5G and next-generation wireless networks. The performance of such wireless networks can be improved by developing ne...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Meta-Reinforcement Learning with Discrete World Models for Adaptive Load BalancingCameron Redovian2025-03-11下载We integrate a meta-reinforcement learning algorithm with the DreamerV3 architecture to improve load balancing in operating systems. This approach enables rapid adaptation to dynamic workloads with mi...

cs.PF - Performance

标题作者发布日期PDF摘要
Investigating Execution-Aware Language Models for Code OptimizationFederico Di Menna, Luca Traini, Gabriele Bavota, Vittorio Cortellessa2025-03-11下载Code optimization is the process of enhancing code efficiency, while preserving its intended functionality. This process often requires a deep understanding of the code execution behavior at run-time ...

基于 VitePress 构建