Appearance
2024-02-29
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Quantum Hardware Roofline: Evaluating the Impact of Gate Expressivity on Quantum Processor Design | Justin Kalloor, Mathias Weiden, Ed Younis, John Kubiatowicz, Bert De Jong, Costin Iancu | 2024-02-29 | 下载 | The design space of current quantum computers is expansive with no obvious winning solution. This leaves practitioners with a clear question: "What is the optimal system configuration to run an algori... |
| On Robustness and Generalization of ML-Based Congestion Predictors to Valid and Imperceptible Perturbations | Chester Holtz, Yucheng Wang, Chung-Kuan Cheng, Bill Lin | 2024-02-29 | 下载 | There is substantial interest in the use of machine learning (ML)-based techniques throughout the electronic computer-aided design (CAD) flow, particularly methods based on deep learning. |
| Commercial Evaluation of Zero-Skipping MAC Design for Bit Sparsity Exploitation in DL Inference | Harideep Nair, Prabhu Vellaisamy, Tsung-Han Lin, Perry Wang, Shawn Blanton, John Paul Shen | 2024-02-29 | 下载 | General Matrix Multiply (GEMM) units, consisting of multiply-accumulate (MAC) arrays, perform bulk of the computation in deep learning (DL). Recent work has proposed a novel MAC design, Bit-Pragmatic ... |
| NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable Functions | Marta Andronic, George A. Constantinides | 2024-02-29 | 下载 | Field-Programmable Gate Array (FPGA) accelerators have proven successful in handling latency- and resource-critical deep neural network (DNN) inference tasks. |
| MIMDRAM: An End-to-End Processing-Using-DRAM System for High-Throughput, Energy-Efficient and Programmer-Transparent Multiple-Instruction Multiple-Data Processing | Geraldo F. Oliveira, Ataberk Olgun, Abdullah Giray Yağlıkçı, F. Nisa Bostancı, Juan Gómez-Luna, Saugata Ghose, Onur Mutlu | 2024-02-29 | 下载 | Processing-using-DRAM (PUD) is a processing-in-memory (PIM) approach that uses a DRAM array's massive internal parallelism to execute very-wide data-parallel operations, in a single-instruction multip... |
| CoMeT: Count-Min-Sketch-based Row Tracking to Mitigate RowHammer at Low Cost | F. Nisa Bostanci, Ismail Emir Yuksel, Ataberk Olgun, Konstantinos Kanellopoulos, Yahya Can Tugrul, A. Giray Yaglikci, Mohammad Sadrosadati, Onur Mutlu | 2024-02-29 | 下载 | We propose a new RowHammer mitigation mechanism, CoMeT, that prevents RowHammer bitflips with low area, performance, and energy costs in DRAM-based systems at very low RowHammer thresholds. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Arrow Matrix Decomposition: A Novel Approach for Communication-Efficient Sparse Matrix Multiplication | Lukas Gianinazzi, Alexandros Nikolaos Ziogas, Langwen Huang, Piotr Luczynski, Saleh Ashkboos, Florian Scheidl, Armon Carigiet, Chio Ge, Nabil Abubaker, Maciej Besta, Tal Ben-Nun, Torsten Hoefler | 2024-02-29 | 下载 | We propose a novel approach to iterated sparse matrix dense matrix multiplication, a fundamental computational kernel in scientific computing and graph neural network training. |
| DeepOps & SLURM: Your GPU Cluster Guide | Arindam Majee | 2024-02-29 | 下载 | In the ever evolving landscape of deep learning, unlocking the potential of cutting-edge models demands computational resources that surpass the capabilities of individual machines. |
| MIMDRAM: An End-to-End Processing-Using-DRAM System for High-Throughput, Energy-Efficient and Programmer-Transparent Multiple-Instruction Multiple-Data Processing | Geraldo F. Oliveira, Ataberk Olgun, Abdullah Giray Yağlıkçı, F. Nisa Bostancı, Juan Gómez-Luna, Saugata Ghose, Onur Mutlu | 2024-02-29 | 下载 | Processing-using-DRAM (PUD) is a processing-in-memory (PIM) approach that uses a DRAM array's massive internal parallelism to execute very-wide data-parallel operations, in a single-instruction multip... |
| Global and Local Prompts Cooperation via Optimal Transport for Federated Learning | Hongxia Li, Wei Huang, Jingya Wang, Ye Shi | 2024-02-29 | 下载 | Prompt learning in pretrained visual-language models has shown remarkable flexibility across various downstream tasks. Leveraging its inherent lightweight nature, recent research attempted to integrat... |
| FlexLLM: Token-Level Co-Serving of LLM Inference and Finetuning with SLO Guarantees | Gabriele Oliaro, Xupeng Miao, Xinhao Cheng, Vineeth Kada, Mengdi Wu, Ruohan Gao, Yingyi Huang, Remi Delacourt, April Yang, Yingcheng Wang, Colin Unger, Zhihao Jia | 2024-02-29 | 下载 | Finetuning large language models (LLMs) is essential for task adaptation, yet today's serving stacks isolate inference and finetuning on separate GPU clusters -- wasting resources and under-utilizing ... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Structural Resilience and Connectivity of the IPv6 Internet: An AS-level Topology Examination | Bin Yuan, Tianbo Song | 2024-02-29 | 下载 | The study utilizes a comprehensive dataset informed by IPv6 routing information to provide statistics, degree distribution, joint degree distribution, and clustering analysis of the IPv6 Internet's st... |
| Intelligent Monitoring Framework for Cloud Services: A Data-Driven Approach | Pooja Srinivas, Fiza Husain, Anjaly Parayil, Ayush Choure, Chetan Bansal, Saravan Rajmohan | 2024-02-29 | 下载 | Cloud service owners need to continuously monitor their services to ensure high availability and reliability. Gaps in monitoring can lead to delay in incident detection and significant negative custom... |
| Vision-Radio Experimental Infrastructure Architecture Towards 6G | Filipe B. Teixeira, Manuel Ricardo, André Coelho, Hélder P. Oliveira, Paula Viana, Nuno Paulino, Helder Fontes, Paulo Marques, Rui Campos, Luis M. Pessoa | 2024-02-29 | 下载 | Telecommunications and computer vision have evolved separately so far. Yet, with the shift to sub-terahertz (sub-THz) and terahertz (THz) radio communications, there is an opportunity to explore compu... |
| Unveiling Internet Censorship: Analysing the Impact of Nation States' Content Control Efforts on Internet Architecture and Routing Patterns | Joshua Levett, Vassilios Vassilakis, Poonam Yadav | 2024-02-29 | 下载 | Heightened interest from nation states to perform content censorship make it evermore critical to identify the impact of censorship efforts on the Internet. |
| Attacks Against Mobility Prediction in 5G Networks | Syafiq Al Atiiq, Yachao Yuan, Christian Gehrmann, Jakob Sternby, Luis Barriga | 2024-02-29 | 下载 | The generation of mobile networks introduces a new Network Function (NF) that was not present in previous generations, namely the Network Data Analytics Function (NWDAF). |
| Energy-Efficient UAV Swarm Assisted MEC with Dynamic Clustering and Scheduling | Jialiuyuan Li, Jiayuan Chen, Changyan Yi, Tong Zhang, Kun Zhu, Jun Cai | 2024-02-29 | 下载 | In this paper, the energy-efficient unmanned aerial vehicle (UAV) swarm assisted mobile edge computing (MEC) with dynamic clustering and scheduling is studied. |
| Edge Computing Enabled Real-Time Video Analysis via Adaptive Spatial-Temporal Semantic Filtering | Xiang Chen, Wenjie Zhu, Jiayuan Chen, Tong Zhang, Changyan Yi, Jun Cai | 2024-02-29 | 下载 | This paper proposes a novel edge computing enabled real-time video analysis system for intelligent visual devices. The proposed system consists of a tracking-assisted object detection module (TAODM) a... |
| X-ResQ: Reverse Annealing for Quantum MIMO Detection with Flexible Parallelism | Minsung Kim, Abhishek Kumar Singh, Davide Venturelli, John Kaewell, Kyle Jamieson | 2024-02-29 | 下载 | Quantum Annealing (QA)-accelerated MIMO detection is an emerging research approach in the context of NextG wireless networks. The opportunity is to enable large MIMO systems and thus improve wireless ... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Quantum Hardware Roofline: Evaluating the Impact of Gate Expressivity on Quantum Processor Design | Justin Kalloor, Mathias Weiden, Ed Younis, John Kubiatowicz, Bert De Jong, Costin Iancu | 2024-02-29 | 下载 | The design space of current quantum computers is expansive with no obvious winning solution. This leaves practitioners with a clear question: "What is the optimal system configuration to run an algori... |
| Towards Assessing Spread in Sets of Software Architecture Designs | Vittorio Cortellessa, J. Andres Diaz-Pace, Daniele Di Pompeo, Michele Tucci | 2024-02-29 | 下载 | Several approaches have recently used automated techniques to generate architecture design alternatives by means of optimization techniques. These approaches aim at improving an initial architecture w... |