Appearance
2025-02-04
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| LLM-USO: Large Language Model-based Universal Sizing Optimizer | Karthik Somayaji N. S, Peng Li | 2025-02-04 | 下载 | The design of analog circuits is a cornerstone of integrated circuit (IC) development, requiring the optimization of complex, interconnected sub-structures such as amplifiers, comparators, and buffers... |
| FPGA Innovation Research in the Netherlands: Present Landscape and Future Outlook | Nikolaos Alachiotis, Sjoerd van den Belt, Steven van der Vlugt, Reinier van der Walle, Mohsen Safari, Bruno Endres Forlin, Tiziano De Matteis, Zaid Al-Ars, Roel Jordans, António J. Sousa de Almeida, Federico Corradi, Christiaan Baaij, Ana-Lucia Varbanescu | 2025-02-04 | 下载 | FPGAs have transformed digital design by enabling versatile and customizable solutions that balance performance and power efficiency, yielding them essential for today's diverse computing challenges. |
| Random Adaptive Cache Placement Policy | Vrushank Ahire, Pranav Menon, Aniruddh Muley, Abhinandan S. Prasad | 2025-02-04 | 下载 | This paper presents a new hybrid cache replacement algorithm that combines random allocation with a modified V-Way cache implementation. Our RAC adapts to complex cache access patterns and optimizes c... |
| Towards Efficient LUT-based PIM: A Scalable and Low-Power Approach for Modern Workloads | Bahareh Khabbazan, Marc Riera, Antonio González | 2025-02-04 | 下载 | Data movement in memory-intensive workloads, such as deep learning, incurs energy costs that are over three orders of magnitude higher than the cost of computation. |
| Hardware and software build flow with SoCMake | Risto Pejašinović, Alessandro Caratelli, Anvesh Nookala, Benoît Walter Denkinger, Marco Andorno | 2025-02-04 | 下载 | The increasing demand for electronics is driving shorter development cycles for application-specific integrated circuits (ASICs). To meet these constraints, hardware designers emphasize reusability an... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Federated Low-Rank Tensor Estimation for Multimodal Image Reconstruction | Anh Van Nguyen, Diego Klabjan, Minseok Ryu, Kibaek Kim, Zichao Di | 2025-02-04 | 下载 | Low-rank tensor estimation offers a powerful approach to addressing high-dimensional data challenges and can substantially improve solutions to ill-posed inverse problems, such as image reconstruction... |
| Gradient Correction in Federated Learning with Adaptive Optimization | Evan Chen, Shiqiang Wang, Jianing Zhang, Dong-Jun Han, Chaoyue Liu, Christopher Brinton | 2025-02-04 | 下载 | In federated learning (FL), model training performance is strongly impacted by data heterogeneity across clients. Client-drift compensation methods have recently emerged as a solution to this issue, i... |
| Hecate: Unlocking Efficient Sparse Model Training via Fully Sharded Sparse Data Parallelism | Yuhao Qing, Guichao Zhu, Fanxin Li, Lintian Lei, Zekai Sun, Xiuxian Guan, Shixiong Zhao, Xusheng Chen, Dong Huang, Sen Wang, Heming Cui | 2025-02-04 | 下载 | Mixture-of-Experts (MoE) has emerged as a promising sparse paradigm for scaling up pre-trained models (PTMs) with remarkable cost-effectiveness. |
| H-MBR: Hypervisor-level Memory Bandwidth Reservation for Mixed Criticality Systems | Afonso Oliveira, Diogo Costa, Gonçalo Moreira, José Martins, Sandro Pinto | 2025-02-04 | 下载 | Recent advancements in fields such as automotive and aerospace have driven a growing demand for robust computational resources. Applications that were once designed for basic MCUs are now deployed on ... |
| LV-XAttn: Distributed Cross-Attention for Long Visual Inputs in Multimodal Large Language Models | Tzu-Tao Chang, Shivaram Venkataraman | 2025-02-04 | 下载 | Cross-attention is commonly adopted in multimodal large language models (MLLMs) for integrating visual information into the language backbone. |
| FPGA Innovation Research in the Netherlands: Present Landscape and Future Outlook | Nikolaos Alachiotis, Sjoerd van den Belt, Steven van der Vlugt, Reinier van der Walle, Mohsen Safari, Bruno Endres Forlin, Tiziano De Matteis, Zaid Al-Ars, Roel Jordans, António J. Sousa de Almeida, Federico Corradi, Christiaan Baaij, Ana-Lucia Varbanescu | 2025-02-04 | 下载 | FPGAs have transformed digital design by enabling versatile and customizable solutions that balance performance and power efficiency, yielding them essential for today's diverse computing challenges. |
| An inherently parallel H2-ULV factorization for solving dense linear systems on GPUs | Qianxiang Ma, Rio Yokota | 2025-02-04 | 下载 | Hierarchical low-rank approximation of dense matrices can reduce the complexity of their factorization from O(N^3) to O(N). However, the complex structure of such hierarchical matrices makes them diff... |
| Random Adaptive Cache Placement Policy | Vrushank Ahire, Pranav Menon, Aniruddh Muley, Abhinandan S. Prasad | 2025-02-04 | 下载 | This paper presents a new hybrid cache replacement algorithm that combines random allocation with a modified V-Way cache implementation. Our RAC adapts to complex cache access patterns and optimizes c... |
| Extending Asynchronous Byzantine Agreement with Crusader Agreement | Mose Mizrahi Erbes, Roger Wattenhofer | 2025-02-04 | 下载 | In this work, we study multivalued byzantine agreement (BA) in an asynchronous network of parties where up to parties are byzantine. |
| Comparative Analysis of FPGA and GPU Performance for Machine Learning-Based Track Reconstruction at LHCb | Fotis I. Giasemis, Vladimir Lončar, Bertrand Granado, Vladimir Vava Gligorov | 2025-02-04 | 下载 | In high-energy physics, the increasing luminosity and detector granularity at the Large Hadron Collider are driving the need for more efficient data processing solutions. |
| Broadcast in Almost Mixing Time | Anton Paramonov, Roger Wattenhofer | 2025-02-04 | 下载 | We study the problem of broadcasting multiple messages in the CONGEST model. In this problem, a dedicated source node possesses a set of messages with every message of size where $... |
| SMTFL: Secure Model Training to Untrusted Participants in Federated Learning | Zhihui Zhao, Xiaorong Dong, Yimo Ren, Jianhua Wang, Dan Yu, Hongsong Zhu, Yongle Chen | 2025-02-04 | 下载 | Federated learning is an essential distributed model training technique. However, threats such as gradient inversion attacks and poisoning attacks pose significant risks to the privacy of training dat... |
| Ilargi: a GPU Compatible Factorized ML Model Training Framework | Wenbo Sun, Rihan Hai | 2025-02-04 | 下载 | The machine learning (ML) training over disparate data sources traditionally involves materialization, which can impose substantial time and space overhead due to data movement and replication. |
| Evaluating Fault Tolerance and Scalability in Distributed File Systems: A Case Study of GFS, HDFS, and MinIO | Shubham Malhotra, Fnu Yashu, Muhammad Saqib, Dipkumar Mehta, Jagdish Jangid, Sachin Dixit | 2025-02-04 | 下载 | Distributed File Systems (DFS) are essential for managing vast datasets across multiple servers, offering benefits in scalability, fault tolerance, and data accessibility. |
| Optimizing Spot Instance Reliability and Security Using Cloud-Native Data and Tools | Muhammad Saqib, Shubham Malhotra, Dipkumar Mehta, Jagdish Jangid, Fnu Yashu, Sachin Dixit | 2025-02-04 | 下载 | This paper represents "Cloudlab", a comprehensive, cloud - native laboratory designed to support network security research and training. Built on Google Cloud and adhering to GitOps methodologies, Clo... |
| A Multi-Objective Framework for Optimizing GPU-Enabled VM Placement in Cloud Data Centers with Multi-Instance GPU Technology | Ahmad Siavashi, Mahmoud Momtazpour | 2025-02-04 | 下载 | The extensive use of GPUs in cloud computing and the growing need for multitenancy have driven the development of innovative solutions for efficient GPU resource management. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| CReIS: Computation Reuse through Image Similarity in ICN-Based Edge Computing | Atiyeh Javaheri, Ali Bohlooli, Kamal Jamshidi | 2025-02-04 | 下载 | At the edge, there is a high level of similarity in computing. One approach that has been proposed to enhance the efficiency of edge computing is computation reuse, which eliminates redundant computat... |
| Network Digital Twin for 5G-Enabled Mobile Robots | Luis Roda Sanchez, Lanfranco Zanzi, Xi Li, Guillem Gari, Xavier Costa Perez | 2025-02-04 | 下载 | The maturity and commercial roll-out of 5G networks and its deployment for private networks makes 5G a key enabler for various vertical industries and applications, including robotics. |
| Bayesian Optimization for Repeater Protocols | Lorenzo La Corte, Kenneth Goodenough, Ananda G. Maity, Siddhartha Santra, David Elkouss | 2025-02-04 | 下载 | Efficiently distributing secret keys over long distances remains a critical challenge in the development of quantum networks. "First-generation" quantum repeater chains distribute entanglement by exec... |
| Graph Neural Networks for O-RAN Mobility Management: A Link Prediction Approach | Ana Gonzalez Bermudez, Miquel Farreras, Milan Groshev, José Antonio Trujillo, Isabel de la Bandera, Raquel Barco | 2025-02-04 | 下载 | Mobility performance has been a key focus in cellular networks up to 5G. To enhance handover (HO) performance, 3GPP introduced Conditional Handover (CHO) and Layer 1/Layer 2 Triggered Mobility (LTM) m... |
| NFV-Enabled Service Recovery in Space-Air-Ground Integrated Networks: A Matching Game Based Approach | Ziye Jia, Yilu Cao, Lijun He, Guangxia Li, Fuhui Zhou, Qihui Wu, Zhu Han | 2025-02-04 | 下载 | To achieve ubiquitous connectivity of the sixth generation communication, the space-air-ground integrated network (SAGIN) is a popular topic. However, the dynamic nodes in SAGIN such as satellites and... |
| Efficient Laser Frequency Allocation in Packet-Optical Nodes with Coherent Transceivers | Constantine A. Kyriakopoulos | 2025-02-04 | 下载 | The introduction of silicon chipsets with the capability of processing incoming optical packet traffic, creates a new generation of packet-optical nodes, the whiteboxes. |
| Design and Simulation of the Adaptive Continuous Entanglement Generation Protocol | Caitao Zhan, Joaquin Chung, Allen Zang, Alexander Kolar, Rajkumar Kettimuthu | 2025-02-04 | 下载 | Generating and distributing remote entangled pairs (EPs) is a primary function of quantum networks, as entanglement is the fundamental resource for key quantum network applications. |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Cache is King: Smart Page Eviction with eBPF | Tal Zussman, Ioannis Zarkadas, Jeremy Carin, Andrew Cheng, Hubertus Franke, Jonas Pfefferle, Asaf Cidon | 2025-02-04 | 下载 | The page cache is a central part of an OS. It reduces repeated accesses to storage by deciding which pages to retain in memory. As a result, the page cache has a significant impact on the performance ... |
| Random Adaptive Cache Placement Policy | Vrushank Ahire, Pranav Menon, Aniruddh Muley, Abhinandan S. Prasad | 2025-02-04 | 下载 | This paper presents a new hybrid cache replacement algorithm that combines random allocation with a modified V-Way cache implementation. Our RAC adapts to complex cache access patterns and optimizes c... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Random Adaptive Cache Placement Policy | Vrushank Ahire, Pranav Menon, Aniruddh Muley, Abhinandan S. Prasad | 2025-02-04 | 下载 | This paper presents a new hybrid cache replacement algorithm that combines random allocation with a modified V-Way cache implementation. Our RAC adapts to complex cache access patterns and optimizes c... |
| Evaluating Fault Tolerance and Scalability in Distributed File Systems: A Case Study of GFS, HDFS, and MinIO | Shubham Malhotra, Fnu Yashu, Muhammad Saqib, Dipkumar Mehta, Jagdish Jangid, Sachin Dixit | 2025-02-04 | 下载 | Distributed File Systems (DFS) are essential for managing vast datasets across multiple servers, offering benefits in scalability, fault tolerance, and data accessibility. |
| CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing | Wenhao Zheng, Yixiao Chen, Weitong Zhang, Souvik Kundu, Yun Li, Zhengzhong Liu, Eric P. Xing, Hongyi Wang, Huaxiu Yao | 2025-02-04 | 下载 | Large language models have achieved remarkable success in various tasks but suffer from high computational costs during inference, limiting their deployment in resource-constrained applications. |