Skip to content

2025-02-04

cs.AR - Architecture

标题作者发布日期PDF摘要
LLM-USO: Large Language Model-based Universal Sizing OptimizerKarthik Somayaji N. S, Peng Li2025-02-04下载The design of analog circuits is a cornerstone of integrated circuit (IC) development, requiring the optimization of complex, interconnected sub-structures such as amplifiers, comparators, and buffers...
FPGA Innovation Research in the Netherlands: Present Landscape and Future OutlookNikolaos Alachiotis, Sjoerd van den Belt, Steven van der Vlugt, Reinier van der Walle, Mohsen Safari, Bruno Endres Forlin, Tiziano De Matteis, Zaid Al-Ars, Roel Jordans, António J. Sousa de Almeida, Federico Corradi, Christiaan Baaij, Ana-Lucia Varbanescu2025-02-04下载FPGAs have transformed digital design by enabling versatile and customizable solutions that balance performance and power efficiency, yielding them essential for today's diverse computing challenges.
Random Adaptive Cache Placement PolicyVrushank Ahire, Pranav Menon, Aniruddh Muley, Abhinandan S. Prasad2025-02-04下载This paper presents a new hybrid cache replacement algorithm that combines random allocation with a modified V-Way cache implementation. Our RAC adapts to complex cache access patterns and optimizes c...
Towards Efficient LUT-based PIM: A Scalable and Low-Power Approach for Modern WorkloadsBahareh Khabbazan, Marc Riera, Antonio González2025-02-04下载Data movement in memory-intensive workloads, such as deep learning, incurs energy costs that are over three orders of magnitude higher than the cost of computation.
Hardware and software build flow with SoCMakeRisto Pejašinović, Alessandro Caratelli, Anvesh Nookala, Benoît Walter Denkinger, Marco Andorno2025-02-04下载The increasing demand for electronics is driving shorter development cycles for application-specific integrated circuits (ASICs). To meet these constraints, hardware designers emphasize reusability an...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Federated Low-Rank Tensor Estimation for Multimodal Image ReconstructionAnh Van Nguyen, Diego Klabjan, Minseok Ryu, Kibaek Kim, Zichao Di2025-02-04下载Low-rank tensor estimation offers a powerful approach to addressing high-dimensional data challenges and can substantially improve solutions to ill-posed inverse problems, such as image reconstruction...
Gradient Correction in Federated Learning with Adaptive OptimizationEvan Chen, Shiqiang Wang, Jianing Zhang, Dong-Jun Han, Chaoyue Liu, Christopher Brinton2025-02-04下载In federated learning (FL), model training performance is strongly impacted by data heterogeneity across clients. Client-drift compensation methods have recently emerged as a solution to this issue, i...
Hecate: Unlocking Efficient Sparse Model Training via Fully Sharded Sparse Data ParallelismYuhao Qing, Guichao Zhu, Fanxin Li, Lintian Lei, Zekai Sun, Xiuxian Guan, Shixiong Zhao, Xusheng Chen, Dong Huang, Sen Wang, Heming Cui2025-02-04下载Mixture-of-Experts (MoE) has emerged as a promising sparse paradigm for scaling up pre-trained models (PTMs) with remarkable cost-effectiveness.
H-MBR: Hypervisor-level Memory Bandwidth Reservation for Mixed Criticality SystemsAfonso Oliveira, Diogo Costa, Gonçalo Moreira, José Martins, Sandro Pinto2025-02-04下载Recent advancements in fields such as automotive and aerospace have driven a growing demand for robust computational resources. Applications that were once designed for basic MCUs are now deployed on ...
LV-XAttn: Distributed Cross-Attention for Long Visual Inputs in Multimodal Large Language ModelsTzu-Tao Chang, Shivaram Venkataraman2025-02-04下载Cross-attention is commonly adopted in multimodal large language models (MLLMs) for integrating visual information into the language backbone.
FPGA Innovation Research in the Netherlands: Present Landscape and Future OutlookNikolaos Alachiotis, Sjoerd van den Belt, Steven van der Vlugt, Reinier van der Walle, Mohsen Safari, Bruno Endres Forlin, Tiziano De Matteis, Zaid Al-Ars, Roel Jordans, António J. Sousa de Almeida, Federico Corradi, Christiaan Baaij, Ana-Lucia Varbanescu2025-02-04下载FPGAs have transformed digital design by enabling versatile and customizable solutions that balance performance and power efficiency, yielding them essential for today's diverse computing challenges.
An inherently parallel H2-ULV factorization for solving dense linear systems on GPUsQianxiang Ma, Rio Yokota2025-02-04下载Hierarchical low-rank approximation of dense matrices can reduce the complexity of their factorization from O(N^3) to O(N). However, the complex structure of such hierarchical matrices makes them diff...
Random Adaptive Cache Placement PolicyVrushank Ahire, Pranav Menon, Aniruddh Muley, Abhinandan S. Prasad2025-02-04下载This paper presents a new hybrid cache replacement algorithm that combines random allocation with a modified V-Way cache implementation. Our RAC adapts to complex cache access patterns and optimizes c...
Extending Asynchronous Byzantine Agreement with Crusader AgreementMose Mizrahi Erbes, Roger Wattenhofer2025-02-04下载In this work, we study multivalued byzantine agreement (BA) in an asynchronous network of nn parties where up to t<n3t < \frac{n}{3} parties are byzantine.
Comparative Analysis of FPGA and GPU Performance for Machine Learning-Based Track Reconstruction at LHCbFotis I. Giasemis, Vladimir Lončar, Bertrand Granado, Vladimir Vava Gligorov2025-02-04下载In high-energy physics, the increasing luminosity and detector granularity at the Large Hadron Collider are driving the need for more efficient data processing solutions.
Broadcast in Almost Mixing TimeAnton Paramonov, Roger Wattenhofer2025-02-04下载We study the problem of broadcasting multiple messages in the CONGEST model. In this problem, a dedicated source node ss possesses a set MM of messages with every message of size O(logn)O(\log n) where $...
SMTFL: Secure Model Training to Untrusted Participants in Federated LearningZhihui Zhao, Xiaorong Dong, Yimo Ren, Jianhua Wang, Dan Yu, Hongsong Zhu, Yongle Chen2025-02-04下载Federated learning is an essential distributed model training technique. However, threats such as gradient inversion attacks and poisoning attacks pose significant risks to the privacy of training dat...
Ilargi: a GPU Compatible Factorized ML Model Training FrameworkWenbo Sun, Rihan Hai2025-02-04下载The machine learning (ML) training over disparate data sources traditionally involves materialization, which can impose substantial time and space overhead due to data movement and replication.
Evaluating Fault Tolerance and Scalability in Distributed File Systems: A Case Study of GFS, HDFS, and MinIOShubham Malhotra, Fnu Yashu, Muhammad Saqib, Dipkumar Mehta, Jagdish Jangid, Sachin Dixit2025-02-04下载Distributed File Systems (DFS) are essential for managing vast datasets across multiple servers, offering benefits in scalability, fault tolerance, and data accessibility.
Optimizing Spot Instance Reliability and Security Using Cloud-Native Data and ToolsMuhammad Saqib, Shubham Malhotra, Dipkumar Mehta, Jagdish Jangid, Fnu Yashu, Sachin Dixit2025-02-04下载This paper represents "Cloudlab", a comprehensive, cloud - native laboratory designed to support network security research and training. Built on Google Cloud and adhering to GitOps methodologies, Clo...
A Multi-Objective Framework for Optimizing GPU-Enabled VM Placement in Cloud Data Centers with Multi-Instance GPU TechnologyAhmad Siavashi, Mahmoud Momtazpour2025-02-04下载The extensive use of GPUs in cloud computing and the growing need for multitenancy have driven the development of innovative solutions for efficient GPU resource management.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
CReIS: Computation Reuse through Image Similarity in ICN-Based Edge ComputingAtiyeh Javaheri, Ali Bohlooli, Kamal Jamshidi2025-02-04下载At the edge, there is a high level of similarity in computing. One approach that has been proposed to enhance the efficiency of edge computing is computation reuse, which eliminates redundant computat...
Network Digital Twin for 5G-Enabled Mobile RobotsLuis Roda Sanchez, Lanfranco Zanzi, Xi Li, Guillem Gari, Xavier Costa Perez2025-02-04下载The maturity and commercial roll-out of 5G networks and its deployment for private networks makes 5G a key enabler for various vertical industries and applications, including robotics.
Bayesian Optimization for Repeater ProtocolsLorenzo La Corte, Kenneth Goodenough, Ananda G. Maity, Siddhartha Santra, David Elkouss2025-02-04下载Efficiently distributing secret keys over long distances remains a critical challenge in the development of quantum networks. "First-generation" quantum repeater chains distribute entanglement by exec...
Graph Neural Networks for O-RAN Mobility Management: A Link Prediction ApproachAna Gonzalez Bermudez, Miquel Farreras, Milan Groshev, José Antonio Trujillo, Isabel de la Bandera, Raquel Barco2025-02-04下载Mobility performance has been a key focus in cellular networks up to 5G. To enhance handover (HO) performance, 3GPP introduced Conditional Handover (CHO) and Layer 1/Layer 2 Triggered Mobility (LTM) m...
NFV-Enabled Service Recovery in Space-Air-Ground Integrated Networks: A Matching Game Based ApproachZiye Jia, Yilu Cao, Lijun He, Guangxia Li, Fuhui Zhou, Qihui Wu, Zhu Han2025-02-04下载To achieve ubiquitous connectivity of the sixth generation communication, the space-air-ground integrated network (SAGIN) is a popular topic. However, the dynamic nodes in SAGIN such as satellites and...
Efficient Laser Frequency Allocation in Packet-Optical Nodes with Coherent TransceiversConstantine A. Kyriakopoulos2025-02-04下载The introduction of silicon chipsets with the capability of processing incoming optical packet traffic, creates a new generation of packet-optical nodes, the whiteboxes.
Design and Simulation of the Adaptive Continuous Entanglement Generation ProtocolCaitao Zhan, Joaquin Chung, Allen Zang, Alexander Kolar, Rajkumar Kettimuthu2025-02-04下载Generating and distributing remote entangled pairs (EPs) is a primary function of quantum networks, as entanglement is the fundamental resource for key quantum network applications.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Cache is King: Smart Page Eviction with eBPFTal Zussman, Ioannis Zarkadas, Jeremy Carin, Andrew Cheng, Hubertus Franke, Jonas Pfefferle, Asaf Cidon2025-02-04下载The page cache is a central part of an OS. It reduces repeated accesses to storage by deciding which pages to retain in memory. As a result, the page cache has a significant impact on the performance ...
Random Adaptive Cache Placement PolicyVrushank Ahire, Pranav Menon, Aniruddh Muley, Abhinandan S. Prasad2025-02-04下载This paper presents a new hybrid cache replacement algorithm that combines random allocation with a modified V-Way cache implementation. Our RAC adapts to complex cache access patterns and optimizes c...

cs.PF - Performance

标题作者发布日期PDF摘要
Random Adaptive Cache Placement PolicyVrushank Ahire, Pranav Menon, Aniruddh Muley, Abhinandan S. Prasad2025-02-04下载This paper presents a new hybrid cache replacement algorithm that combines random allocation with a modified V-Way cache implementation. Our RAC adapts to complex cache access patterns and optimizes c...
Evaluating Fault Tolerance and Scalability in Distributed File Systems: A Case Study of GFS, HDFS, and MinIOShubham Malhotra, Fnu Yashu, Muhammad Saqib, Dipkumar Mehta, Jagdish Jangid, Sachin Dixit2025-02-04下载Distributed File Systems (DFS) are essential for managing vast datasets across multiple servers, offering benefits in scalability, fault tolerance, and data accessibility.
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level RoutingWenhao Zheng, Yixiao Chen, Weitong Zhang, Souvik Kundu, Yun Li, Zhengzhong Liu, Eric P. Xing, Hongyi Wang, Huaxiu Yao2025-02-04下载Large language models have achieved remarkable success in various tasks but suffer from high computational costs during inference, limiting their deployment in resource-constrained applications.

基于 VitePress 构建