2025-02-04

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
LLM-USO: Large Language Model-based Universal Sizing Optimizer	Karthik Somayaji N. S, Peng Li	2025-02-04	下载	The design of analog circuits is a cornerstone of integrated circuit (IC) development, requiring the optimization of complex, interconnected sub-structures such as amplifiers, comparators, and buffers...
FPGA Innovation Research in the Netherlands: Present Landscape and Future Outlook	Nikolaos Alachiotis, Sjoerd van den Belt, Steven van der Vlugt, Reinier van der Walle, Mohsen Safari, Bruno Endres Forlin, Tiziano De Matteis, Zaid Al-Ars, Roel Jordans, António J. Sousa de Almeida, Federico Corradi, Christiaan Baaij, Ana-Lucia Varbanescu	2025-02-04	下载	FPGAs have transformed digital design by enabling versatile and customizable solutions that balance performance and power efficiency, yielding them essential for today's diverse computing challenges.
Random Adaptive Cache Placement Policy	Vrushank Ahire, Pranav Menon, Aniruddh Muley, Abhinandan S. Prasad	2025-02-04	下载	This paper presents a new hybrid cache replacement algorithm that combines random allocation with a modified V-Way cache implementation. Our RAC adapts to complex cache access patterns and optimizes c...
Towards Efficient LUT-based PIM: A Scalable and Low-Power Approach for Modern Workloads	Bahareh Khabbazan, Marc Riera, Antonio González	2025-02-04	下载	Data movement in memory-intensive workloads, such as deep learning, incurs energy costs that are over three orders of magnitude higher than the cost of computation.
Hardware and software build flow with SoCMake	Risto Pejašinović, Alessandro Caratelli, Anvesh Nookala, Benoît Walter Denkinger, Marco Andorno	2025-02-04	下载	The increasing demand for electronics is driving shorter development cycles for application-specific integrated circuits (ASICs). To meet these constraints, hardware designers emphasize reusability an...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Federated Low-Rank Tensor Estimation for Multimodal Image Reconstruction	Anh Van Nguyen, Diego Klabjan, Minseok Ryu, Kibaek Kim, Zichao Di	2025-02-04	下载	Low-rank tensor estimation offers a powerful approach to addressing high-dimensional data challenges and can substantially improve solutions to ill-posed inverse problems, such as image reconstruction...
Gradient Correction in Federated Learning with Adaptive Optimization	Evan Chen, Shiqiang Wang, Jianing Zhang, Dong-Jun Han, Chaoyue Liu, Christopher Brinton	2025-02-04	下载	In federated learning (FL), model training performance is strongly impacted by data heterogeneity across clients. Client-drift compensation methods have recently emerged as a solution to this issue, i...
Hecate: Unlocking Efficient Sparse Model Training via Fully Sharded Sparse Data Parallelism	Yuhao Qing, Guichao Zhu, Fanxin Li, Lintian Lei, Zekai Sun, Xiuxian Guan, Shixiong Zhao, Xusheng Chen, Dong Huang, Sen Wang, Heming Cui	2025-02-04	下载	Mixture-of-Experts (MoE) has emerged as a promising sparse paradigm for scaling up pre-trained models (PTMs) with remarkable cost-effectiveness.
H-MBR: Hypervisor-level Memory Bandwidth Reservation for Mixed Criticality Systems	Afonso Oliveira, Diogo Costa, Gonçalo Moreira, José Martins, Sandro Pinto	2025-02-04	下载	Recent advancements in fields such as automotive and aerospace have driven a growing demand for robust computational resources. Applications that were once designed for basic MCUs are now deployed on ...
LV-XAttn: Distributed Cross-Attention for Long Visual Inputs in Multimodal Large Language Models	Tzu-Tao Chang, Shivaram Venkataraman	2025-02-04	下载	Cross-attention is commonly adopted in multimodal large language models (MLLMs) for integrating visual information into the language backbone.
FPGA Innovation Research in the Netherlands: Present Landscape and Future Outlook	Nikolaos Alachiotis, Sjoerd van den Belt, Steven van der Vlugt, Reinier van der Walle, Mohsen Safari, Bruno Endres Forlin, Tiziano De Matteis, Zaid Al-Ars, Roel Jordans, António J. Sousa de Almeida, Federico Corradi, Christiaan Baaij, Ana-Lucia Varbanescu	2025-02-04	下载	FPGAs have transformed digital design by enabling versatile and customizable solutions that balance performance and power efficiency, yielding them essential for today's diverse computing challenges.
An inherently parallel H2-ULV factorization for solving dense linear systems on GPUs	Qianxiang Ma, Rio Yokota	2025-02-04	下载	Hierarchical low-rank approximation of dense matrices can reduce the complexity of their factorization from O(N^3) to O(N). However, the complex structure of such hierarchical matrices makes them diff...
Random Adaptive Cache Placement Policy	Vrushank Ahire, Pranav Menon, Aniruddh Muley, Abhinandan S. Prasad	2025-02-04	下载	This paper presents a new hybrid cache replacement algorithm that combines random allocation with a modified V-Way cache implementation. Our RAC adapts to complex cache access patterns and optimizes c...
Extending Asynchronous Byzantine Agreement with Crusader Agreement	Mose Mizrahi Erbes, Roger Wattenhofer	2025-02-04	下载	In this work, we study multivalued byzantine agreement (BA) in an asynchronous network of $n$ parties where up to $t < \frac{n}{3}$ parties are byzantine.
Comparative Analysis of FPGA and GPU Performance for Machine Learning-Based Track Reconstruction at LHCb	Fotis I. Giasemis, Vladimir Lončar, Bertrand Granado, Vladimir Vava Gligorov	2025-02-04	下载	In high-energy physics, the increasing luminosity and detector granularity at the Large Hadron Collider are driving the need for more efficient data processing solutions.
Broadcast in Almost Mixing Time	Anton Paramonov, Roger Wattenhofer	2025-02-04	下载	We study the problem of broadcasting multiple messages in the CONGEST model. In this problem, a dedicated source node $s$ possesses a set $M$ of messages with every message of size $O(\log n)$ where $...
SMTFL: Secure Model Training to Untrusted Participants in Federated Learning	Zhihui Zhao, Xiaorong Dong, Yimo Ren, Jianhua Wang, Dan Yu, Hongsong Zhu, Yongle Chen	2025-02-04	下载	Federated learning is an essential distributed model training technique. However, threats such as gradient inversion attacks and poisoning attacks pose significant risks to the privacy of training dat...
Ilargi: a GPU Compatible Factorized ML Model Training Framework	Wenbo Sun, Rihan Hai	2025-02-04	下载	The machine learning (ML) training over disparate data sources traditionally involves materialization, which can impose substantial time and space overhead due to data movement and replication.
Evaluating Fault Tolerance and Scalability in Distributed File Systems: A Case Study of GFS, HDFS, and MinIO	Shubham Malhotra, Fnu Yashu, Muhammad Saqib, Dipkumar Mehta, Jagdish Jangid, Sachin Dixit	2025-02-04	下载	Distributed File Systems (DFS) are essential for managing vast datasets across multiple servers, offering benefits in scalability, fault tolerance, and data accessibility.
Optimizing Spot Instance Reliability and Security Using Cloud-Native Data and Tools	Muhammad Saqib, Shubham Malhotra, Dipkumar Mehta, Jagdish Jangid, Fnu Yashu, Sachin Dixit	2025-02-04	下载	This paper represents "Cloudlab", a comprehensive, cloud - native laboratory designed to support network security research and training. Built on Google Cloud and adhering to GitOps methodologies, Clo...
A Multi-Objective Framework for Optimizing GPU-Enabled VM Placement in Cloud Data Centers with Multi-Instance GPU Technology	Ahmad Siavashi, Mahmoud Momtazpour	2025-02-04	下载	The extensive use of GPUs in cloud computing and the growing need for multitenancy have driven the development of innovative solutions for efficient GPU resource management.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
CReIS: Computation Reuse through Image Similarity in ICN-Based Edge Computing	Atiyeh Javaheri, Ali Bohlooli, Kamal Jamshidi	2025-02-04	下载	At the edge, there is a high level of similarity in computing. One approach that has been proposed to enhance the efficiency of edge computing is computation reuse, which eliminates redundant computat...
Network Digital Twin for 5G-Enabled Mobile Robots	Luis Roda Sanchez, Lanfranco Zanzi, Xi Li, Guillem Gari, Xavier Costa Perez	2025-02-04	下载	The maturity and commercial roll-out of 5G networks and its deployment for private networks makes 5G a key enabler for various vertical industries and applications, including robotics.
Bayesian Optimization for Repeater Protocols	Lorenzo La Corte, Kenneth Goodenough, Ananda G. Maity, Siddhartha Santra, David Elkouss	2025-02-04	下载	Efficiently distributing secret keys over long distances remains a critical challenge in the development of quantum networks. "First-generation" quantum repeater chains distribute entanglement by exec...
Graph Neural Networks for O-RAN Mobility Management: A Link Prediction Approach	Ana Gonzalez Bermudez, Miquel Farreras, Milan Groshev, José Antonio Trujillo, Isabel de la Bandera, Raquel Barco	2025-02-04	下载	Mobility performance has been a key focus in cellular networks up to 5G. To enhance handover (HO) performance, 3GPP introduced Conditional Handover (CHO) and Layer 1/Layer 2 Triggered Mobility (LTM) m...
NFV-Enabled Service Recovery in Space-Air-Ground Integrated Networks: A Matching Game Based Approach	Ziye Jia, Yilu Cao, Lijun He, Guangxia Li, Fuhui Zhou, Qihui Wu, Zhu Han	2025-02-04	下载	To achieve ubiquitous connectivity of the sixth generation communication, the space-air-ground integrated network (SAGIN) is a popular topic. However, the dynamic nodes in SAGIN such as satellites and...
Efficient Laser Frequency Allocation in Packet-Optical Nodes with Coherent Transceivers	Constantine A. Kyriakopoulos	2025-02-04	下载	The introduction of silicon chipsets with the capability of processing incoming optical packet traffic, creates a new generation of packet-optical nodes, the whiteboxes.
Design and Simulation of the Adaptive Continuous Entanglement Generation Protocol	Caitao Zhan, Joaquin Chung, Allen Zang, Alexander Kolar, Rajkumar Kettimuthu	2025-02-04	下载	Generating and distributing remote entangled pairs (EPs) is a primary function of quantum networks, as entanglement is the fundamental resource for key quantum network applications.

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Cache is King: Smart Page Eviction with eBPF	Tal Zussman, Ioannis Zarkadas, Jeremy Carin, Andrew Cheng, Hubertus Franke, Jonas Pfefferle, Asaf Cidon	2025-02-04	下载	The page cache is a central part of an OS. It reduces repeated accesses to storage by deciding which pages to retain in memory. As a result, the page cache has a significant impact on the performance ...
Random Adaptive Cache Placement Policy	Vrushank Ahire, Pranav Menon, Aniruddh Muley, Abhinandan S. Prasad	2025-02-04	下载	This paper presents a new hybrid cache replacement algorithm that combines random allocation with a modified V-Way cache implementation. Our RAC adapts to complex cache access patterns and optimizes c...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Random Adaptive Cache Placement Policy	Vrushank Ahire, Pranav Menon, Aniruddh Muley, Abhinandan S. Prasad	2025-02-04	下载	This paper presents a new hybrid cache replacement algorithm that combines random allocation with a modified V-Way cache implementation. Our RAC adapts to complex cache access patterns and optimizes c...
Evaluating Fault Tolerance and Scalability in Distributed File Systems: A Case Study of GFS, HDFS, and MinIO	Shubham Malhotra, Fnu Yashu, Muhammad Saqib, Dipkumar Mehta, Jagdish Jangid, Sachin Dixit	2025-02-04	下载	Distributed File Systems (DFS) are essential for managing vast datasets across multiple servers, offering benefits in scalability, fault tolerance, and data accessibility.
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing	Wenhao Zheng, Yixiao Chen, Weitong Zhang, Souvik Kundu, Yun Li, Zhengzhong Liu, Eric P. Xing, Hongyi Wang, Huaxiu Yao	2025-02-04	下载	Large language models have achieved remarkable success in various tasks but suffer from high computational costs during inference, limiting their deployment in resource-constrained applications.