Appearance
2025-03-11
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A Comparison of the Cerebras Wafer-Scale Integration Technology with Nvidia GPU-based Systems for Artificial Intelligence | Yudhishthira Kundu, Manroop Kaur, Tripty Wig, Kriti Kumar, Pushpanjali Kumari, Vivek Puri, Manish Arora | 2025-03-11 | 下载 | Cerebras' wafer-scale engine (WSE) technology merges multiple dies on a single wafer. It addresses the challenges of memory bandwidth, latency, and scalability, making it suitable for artificial intel... |
| ResBench: Benchmarking LLM-Generated FPGA Designs with Resource Awareness | Ce Guo, Tong Zhao | 2025-03-11 | 下载 | Field-Programmable Gate Arrays (FPGAs) are widely used in modern hardware design, yet writing Hardware Description Language (HDL) code for FPGA implementation remains a complex and time-consuming task... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| SIMT/GPU Data Race Verification using ISCC and Intermediary Code Representations: A Case Study | Andrew Osterhout, Ganesh Gopalakrishnan | 2025-03-11 | 下载 | It is often difficult to write code that you can ensure will be executed in the right order when programing for parallel compute tasks. Due to the way that today's parallel compute hardware, primarily... |
| A Parallel and Highly-Portable HPC Poisson Solver: Preconditioned Bi-CGSTAB with alpaka | Luca Pennati, Måns I. Andersson, Klaus Steiniger, Rene Widera, Tapish Narwal, Michael Bussmann, Stefano Markidis | 2025-03-11 | 下载 | This paper presents the design, implementation, and performance analysis of a parallel and GPU-accelerated Poisson solver based on the Preconditioned Bi-Conjugate Gradient Stabilized (Bi-CGSTAB) metho... |
| Cabinet: Dynamically Weighted Consensus Made Fast | Gengrui Zhang, Shiquan Zhang, Michail Bachras, Yuqiu Zhang, Hans-Arno Jacobsen | 2025-03-11 | 下载 | Conventional consensus algorithms, such as Paxos and Raft, encounter inefficiencies when applied to large-scale distributed systems due to the requirement of waiting for replies from a majority of nod... |
| A Comprehensive Experimentation Framework for Energy-Efficient Design of Cloud-Native Applications | Sebastian Werner, Maria C. Borges, Karl Wolf, Stefan Tai | 2025-03-11 | 下载 | Current approaches to designing energy-efficient applications typically rely on measuring individual components using readily available local metrics, like CPU utilization. |
| A Fair and Lightweight Consensus Algorithm for IoT | Sokratis Vavilis, Harris Niavis, Konstantinos Loupos | 2025-03-11 | 下载 | With the rapid growth of hyperconnected devices and decentralized data architectures, safeguarding Internet of Things (IoT) transactions is becoming increasingly challenging. |
| Accelerating MoE Model Inference with Expert Sharding | Oana Balmau, Anne-Marie Kermarrec, Rafael Pires, André Loureiro Espírito Santo, Martijn de Vos, Milos Vujasinovic | 2025-03-11 | 下载 | Mixture of experts (MoE) models achieve state-of-the-art results in language modeling but suffer from inefficient hardware utilization due to imbalanced token routing and communication overhead. |
| FastCache: Optimizing Multimodal LLM Serving through Lightweight KV-Cache Compression Framework | Jianian Zhu, Hang Wu, Haojie Wang, Yinghui Li, Biao Hou, Ruixuan Li, Jidong Zhai | 2025-03-11 | 下载 | Multi-modal Large Language Models (MLLMs) serving systems commonly employ KV-cache compression to reduce memory footprint. However, existing compression methods introduce significant processing overhe... |
| TokenSim: Enabling Hardware and Software Exploration for Large Language Model Inference Systems | Feiyang Wu, Zhuohang Bian, Guoyang Duan, Tianle Xu, Junchi Wu, Teng Ma, Yongqiang Yao, Ruihao Gong, Youwei Zhuo | 2025-03-11 | 下载 | The increasing demand for large language model (LLM) serving has necessitated significant advancements in the optimization and profiling of LLM inference systems. |
| Efficient Query Verification for Blockchain Superlight Clients Using SNARKs | Stefano De Angelis, Ivan Visconti, Andrea Vitaletti, Marco Zecchini | 2025-03-11 | 下载 | Blockchains are among the most powerful technologies to realize decentralized information systems. In order to safely enjoy all guarantees provided by a blockchain, one should maintain a full node, th... |
| Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference | Pol G. Recasens, Ferran Agullo, Yue Zhu, Chen Wang, Eun Kyung Lee, Olivier Tardieu, Jordi Torres, Josep Ll. Berral | 2025-03-11 | 下载 | Large language models have been widely adopted across different tasks, but their auto-regressive generation nature often leads to inefficient resource utilization during inference. |
| SoK: A cloudy view on trust relationships of CVMs -- How Confidential Virtual Machines are falling short in Public Cloud | Jana Eisoldt, Anna Galanou, Andrey Ruzhanskiy, Nils Küchenmeister, Yewgenij Baburkin, Tianxiang Dai, Ivan Gudymenko, Stefan Köpsell, Rüdiger Kapitza | 2025-03-11 | 下载 | Confidential computing in the public cloud intends to safeguard workload privacy while outsourcing infrastructure management to a cloud provider. |
| Will LLMs Scaling Hit the Wall? Breaking Barriers via Distributed Resources on Massive Edge Devices | Tao Shen, Didi Zhu, Ziyu Zhao, Zexi Li, Chao Wu, Fei Wu | 2025-03-11 | 下载 | The remarkable success of foundation models has been driven by scaling laws, demonstrating that model performance improves predictably with increased training data and model size. |
| Detecting Backdoor Attacks in Federated Learning via Direction Alignment Inspection | Jiahao Xu, Zikai Zhang, Rui Hu | 2025-03-11 | 下载 | The distributed nature of training makes Federated Learning (FL) vulnerable to backdoor attacks, where malicious model updates aim to compromise the global model's performance on specific tasks. |
| MFC 5.0: An exascale many-physics flow solver | Benjamin Wilfong, Henry A. Le Berre, Anand Radhakrishnan, Ansh Gupta, Daniel J. Vickers, Diego Vaca-Revelo, Dimitrios Adam, Haocheng Yu, Hyeoksu Lee, Jose Rodolfo Chreim, Mirelys Carcana Barbosa, Yanjun Zhang, Esteban Cisneros-Garibay, Aswin Gnanaskandan, Mauro Rodriguez, Reuben D. Budiardja, Stephen Abbott, Tim Colonius, Spencer H. Bryngelson | 2025-03-11 | 下载 | Many problems of interest in engineering, medicine, and the fundamental sciences rely on high-fidelity flow simulation, making performant computational fluid dynamics solvers a mainstay of the open-so... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| MAREA: A Delay-Aware Multi-time-Scale Radio Resource Orchestrator for 6G O-RAN | Oscar Adamuz-Hinojosa, Lanfranco Zanzi, Vincenzo Sciancalepore, Xavier Costa-Pérez | 2025-03-11 | 下载 | The Open Radio Access Network (O-RAN)-compliant solutions often lack crucial details for implementing effective control loops at various time scales. |
| Hierarchical Multi Agent DRL for Soft Handovers Between Edge Clouds in Open RAN | F. Giarrè, I. A. Meer, M. Masoudi, M. Ozger, C. Cavdar | 2025-03-11 | 下载 | Multi-connectivity (MC) for aerial users via a set of ground access points offers the potential for highly reliable communication. Within an open radio access network (O-RAN) architecture, edge clouds... |
| Efficient Resource Allocation in 5G Massive MIMO-NOMA Networks: Comparative Analysis of SINR-Aware Power Allocation and Spatial Correlation-Based Clustering | Samar Chebbi, Oussama Habachi, Jean-Pierre Cances, Vahid Meghdadi, Essaid Sabir | 2025-03-11 | 下载 | With the evolution of 5G networks, optimizing resource allocation has become crucial to meeting the increasing demand for massive connectivity and high throughput. |
| Integrating Captive Portal Technology into Computer Science Education: A Modular, Hands-On Approach to Infrastructure | Lianting Wang, Marcelo Ponce | 2025-03-11 | 下载 | In this paper, we present an educational project aimed to introduce students to the technology behind Captive Portals infrastructures. For doing this, we developed a series of modules to emphasize eac... |
| Towards Sustainability in 6G and beyond: Challenges and Opportunities of Open RAN | Hamed Ahmadi, Mostafa Rahmani, Swarna Bindu Chetty, Eirini Eleni Tsiropoulou, Huseyin Arslan, Merouane Debbah, Tony Quek | 2025-03-11 | 下载 | The transition to 6G is expected to bring significant advancements, including much higher data rates, enhanced reliability and ultra-low latency compared to previous generations. |
| Uni-Gaussians: Unifying Camera and Lidar Simulation with Gaussians for Dynamic Driving Scenarios | Zikang Yuan, Yuechuan Pu, Hongcheng Luo, Fengtian Lang, Cheng Chi, Teng Li, Yingying Shen, Haiyang Sun, Bing Wang, Xin Yang | 2025-03-11 | 下载 | Ensuring the safety of autonomous vehicles necessitates comprehensive simulation of multi-sensor data, encompassing inputs from both cameras and LiDAR sensors, across various dynamic driving scenarios... |
| A systematic literature review of unsupervised learning algorithms for anomalous traffic detection based on flows | Alberto Miguel-Diez, Adrián Campazas-Vega, Claudia Álvarez-Aparicio, Gonzalo Esteban-Costales, Ángel Manuel Guerrero-Higueras | 2025-03-11 | 下载 | The constant increase of devices connected to the Internet, and therefore of cyber-attacks, makes it necessary to analyze network traffic in order to recognize malicious activity. |
| Explainable Autoencoder Design for RSSI-Based Multi-User Beam Probing and Hybrid Precoding | Asmaa Abdallah, Abdulkadir Celik, Ahmed Alkhateeb, Ahmed M. Eltawil | 2025-03-11 | 下载 | This paper introduces a novel neural network (NN) structure referred to as an ``Auto-hybrid precoder'' (Auto-HP) and an unsupervised deep learning (DL) approach that jointly designs \ac{mmWave} probin... |
| Cost-driven prunings for iterative solving of constrained routing problem with SRLG-disjoint protection | P. A. Mosharev, Choon-Meng Lee, Xu Shu, Xiaoshan Zhang, Man-Hong Yung | 2025-03-11 | 下载 | The search for the optimal pair of active and protection paths in a network with Shared Risk Link Groups (SRLG) is a challenging but high-value problem in the industry that is inevitable in ensuring r... |
| LLM4MAC: An LLM-Driven Reinforcement Learning Framework for MAC Protocol Emergence | Renxuan Tan, Rongpeng Li, Zhifeng Zhao | 2025-03-11 | 下载 | With the advent of 6G systems, emerging hyper-connected ecosystems necessitate agile and adaptive medium access control (MAC) protocols to contend with network dynamics and diverse service requirement... |
| Mobility-aware Seamless Service Migration and Resource Allocation in Multi-edge IoV Systems | Zheyi Chen, Sijin Huang, Geyong Min, Zhaolong Ning, Jie Li, Yan Zhang | 2025-03-11 | 下载 | Mobile Edge Computing (MEC) offers low-latency and high-bandwidth support for Internet-of-Vehicles (IoV) applications. However, due to high vehicle mobility and finite communication coverage of base s... |
| Interference Graph Estimation for Resource Allocation in Multi-Cell Multi-Numerology Networks: A Power-Domain Approach | Daqian Ding, Haorui Li, Yibo Pi, Xudong Wang | 2025-03-11 | 下载 | The interference graph, depicting the intra- and inter-cell interference channel gains, is indispensable for resource allocation in multi-cell networks. |
| ALCS: An Adaptive Latency Compensation Scheduler for Multipath TCP in Satellite-Terrestrial Integrated Networks | Lin Wang, Ze Wang, Zeyi Deng, Jingjing Zhang, Yue Gao | 2025-03-11 | 下载 | The Satellite-Terrestrial Integrated Network (STIN) enhances end-to-end transmission by simultaneously utilizing terrestrial and satellite networks, offering significant benefits in scenarios like eme... |
| Accelerating Development in UAV Network Digital Twins with a Flexible Simulation Framework | Md Sharif Hossen, Anil Gurses, Mihail Sichitiu, Ismail Guvenc | 2025-03-11 | 下载 | Unmanned aerial vehicles (UAVs) enhance coverage and provide flexible deployment in 5G and next-generation wireless networks. The performance of such wireless networks can be improved by developing ne... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Meta-Reinforcement Learning with Discrete World Models for Adaptive Load Balancing | Cameron Redovian | 2025-03-11 | 下载 | We integrate a meta-reinforcement learning algorithm with the DreamerV3 architecture to improve load balancing in operating systems. This approach enables rapid adaptation to dynamic workloads with mi... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Investigating Execution-Aware Language Models for Code Optimization | Federico Di Menna, Luca Traini, Gabriele Bavota, Vittorio Cortellessa | 2025-03-11 | 下载 | Code optimization is the process of enhancing code efficiency, while preserving its intended functionality. This process often requires a deep understanding of the code execution behavior at run-time ... |