2024-02-29

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Quantum Hardware Roofline: Evaluating the Impact of Gate Expressivity on Quantum Processor Design	Justin Kalloor, Mathias Weiden, Ed Younis, John Kubiatowicz, Bert De Jong, Costin Iancu	2024-02-29	下载	The design space of current quantum computers is expansive with no obvious winning solution. This leaves practitioners with a clear question: "What is the optimal system configuration to run an algori...
On Robustness and Generalization of ML-Based Congestion Predictors to Valid and Imperceptible Perturbations	Chester Holtz, Yucheng Wang, Chung-Kuan Cheng, Bill Lin	2024-02-29	下载	There is substantial interest in the use of machine learning (ML)-based techniques throughout the electronic computer-aided design (CAD) flow, particularly methods based on deep learning.
Commercial Evaluation of Zero-Skipping MAC Design for Bit Sparsity Exploitation in DL Inference	Harideep Nair, Prabhu Vellaisamy, Tsung-Han Lin, Perry Wang, Shawn Blanton, John Paul Shen	2024-02-29	下载	General Matrix Multiply (GEMM) units, consisting of multiply-accumulate (MAC) arrays, perform bulk of the computation in deep learning (DL). Recent work has proposed a novel MAC design, Bit-Pragmatic ...
NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable Functions	Marta Andronic, George A. Constantinides	2024-02-29	下载	Field-Programmable Gate Array (FPGA) accelerators have proven successful in handling latency- and resource-critical deep neural network (DNN) inference tasks.
MIMDRAM: An End-to-End Processing-Using-DRAM System for High-Throughput, Energy-Efficient and Programmer-Transparent Multiple-Instruction Multiple-Data Processing	Geraldo F. Oliveira, Ataberk Olgun, Abdullah Giray Yağlıkçı, F. Nisa Bostancı, Juan Gómez-Luna, Saugata Ghose, Onur Mutlu	2024-02-29	下载	Processing-using-DRAM (PUD) is a processing-in-memory (PIM) approach that uses a DRAM array's massive internal parallelism to execute very-wide data-parallel operations, in a single-instruction multip...
CoMeT: Count-Min-Sketch-based Row Tracking to Mitigate RowHammer at Low Cost	F. Nisa Bostanci, Ismail Emir Yuksel, Ataberk Olgun, Konstantinos Kanellopoulos, Yahya Can Tugrul, A. Giray Yaglikci, Mohammad Sadrosadati, Onur Mutlu	2024-02-29	下载	We propose a new RowHammer mitigation mechanism, CoMeT, that prevents RowHammer bitflips with low area, performance, and energy costs in DRAM-based systems at very low RowHammer thresholds.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Arrow Matrix Decomposition: A Novel Approach for Communication-Efficient Sparse Matrix Multiplication	Lukas Gianinazzi, Alexandros Nikolaos Ziogas, Langwen Huang, Piotr Luczynski, Saleh Ashkboos, Florian Scheidl, Armon Carigiet, Chio Ge, Nabil Abubaker, Maciej Besta, Tal Ben-Nun, Torsten Hoefler	2024-02-29	下载	We propose a novel approach to iterated sparse matrix dense matrix multiplication, a fundamental computational kernel in scientific computing and graph neural network training.
DeepOps & SLURM: Your GPU Cluster Guide	Arindam Majee	2024-02-29	下载	In the ever evolving landscape of deep learning, unlocking the potential of cutting-edge models demands computational resources that surpass the capabilities of individual machines.
MIMDRAM: An End-to-End Processing-Using-DRAM System for High-Throughput, Energy-Efficient and Programmer-Transparent Multiple-Instruction Multiple-Data Processing	Geraldo F. Oliveira, Ataberk Olgun, Abdullah Giray Yağlıkçı, F. Nisa Bostancı, Juan Gómez-Luna, Saugata Ghose, Onur Mutlu	2024-02-29	下载	Processing-using-DRAM (PUD) is a processing-in-memory (PIM) approach that uses a DRAM array's massive internal parallelism to execute very-wide data-parallel operations, in a single-instruction multip...
Global and Local Prompts Cooperation via Optimal Transport for Federated Learning	Hongxia Li, Wei Huang, Jingya Wang, Ye Shi	2024-02-29	下载	Prompt learning in pretrained visual-language models has shown remarkable flexibility across various downstream tasks. Leveraging its inherent lightweight nature, recent research attempted to integrat...
FlexLLM: Token-Level Co-Serving of LLM Inference and Finetuning with SLO Guarantees	Gabriele Oliaro, Xupeng Miao, Xinhao Cheng, Vineeth Kada, Mengdi Wu, Ruohan Gao, Yingyi Huang, Remi Delacourt, April Yang, Yingcheng Wang, Colin Unger, Zhihao Jia	2024-02-29	下载	Finetuning large language models (LLMs) is essential for task adaptation, yet today's serving stacks isolate inference and finetuning on separate GPU clusters -- wasting resources and under-utilizing ...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Structural Resilience and Connectivity of the IPv6 Internet: An AS-level Topology Examination	Bin Yuan, Tianbo Song	2024-02-29	下载	The study utilizes a comprehensive dataset informed by IPv6 routing information to provide statistics, degree distribution, joint degree distribution, and clustering analysis of the IPv6 Internet's st...
Intelligent Monitoring Framework for Cloud Services: A Data-Driven Approach	Pooja Srinivas, Fiza Husain, Anjaly Parayil, Ayush Choure, Chetan Bansal, Saravan Rajmohan	2024-02-29	下载	Cloud service owners need to continuously monitor their services to ensure high availability and reliability. Gaps in monitoring can lead to delay in incident detection and significant negative custom...
Vision-Radio Experimental Infrastructure Architecture Towards 6G	Filipe B. Teixeira, Manuel Ricardo, André Coelho, Hélder P. Oliveira, Paula Viana, Nuno Paulino, Helder Fontes, Paulo Marques, Rui Campos, Luis M. Pessoa	2024-02-29	下载	Telecommunications and computer vision have evolved separately so far. Yet, with the shift to sub-terahertz (sub-THz) and terahertz (THz) radio communications, there is an opportunity to explore compu...
Unveiling Internet Censorship: Analysing the Impact of Nation States' Content Control Efforts on Internet Architecture and Routing Patterns	Joshua Levett, Vassilios Vassilakis, Poonam Yadav	2024-02-29	下载	Heightened interest from nation states to perform content censorship make it evermore critical to identify the impact of censorship efforts on the Internet.
Attacks Against Mobility Prediction in 5G Networks	Syafiq Al Atiiq, Yachao Yuan, Christian Gehrmann, Jakob Sternby, Luis Barriga	2024-02-29	下载	The $5^{th}$ generation of mobile networks introduces a new Network Function (NF) that was not present in previous generations, namely the Network Data Analytics Function (NWDAF).
Energy-Efficient UAV Swarm Assisted MEC with Dynamic Clustering and Scheduling	Jialiuyuan Li, Jiayuan Chen, Changyan Yi, Tong Zhang, Kun Zhu, Jun Cai	2024-02-29	下载	In this paper, the energy-efficient unmanned aerial vehicle (UAV) swarm assisted mobile edge computing (MEC) with dynamic clustering and scheduling is studied.
Edge Computing Enabled Real-Time Video Analysis via Adaptive Spatial-Temporal Semantic Filtering	Xiang Chen, Wenjie Zhu, Jiayuan Chen, Tong Zhang, Changyan Yi, Jun Cai	2024-02-29	下载	This paper proposes a novel edge computing enabled real-time video analysis system for intelligent visual devices. The proposed system consists of a tracking-assisted object detection module (TAODM) a...
X-ResQ: Reverse Annealing for Quantum MIMO Detection with Flexible Parallelism	Minsung Kim, Abhishek Kumar Singh, Davide Venturelli, John Kaewell, Kyle Jamieson	2024-02-29	下载	Quantum Annealing (QA)-accelerated MIMO detection is an emerging research approach in the context of NextG wireless networks. The opportunity is to enable large MIMO systems and thus improve wireless ...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Quantum Hardware Roofline: Evaluating the Impact of Gate Expressivity on Quantum Processor Design	Justin Kalloor, Mathias Weiden, Ed Younis, John Kubiatowicz, Bert De Jong, Costin Iancu	2024-02-29	下载	The design space of current quantum computers is expansive with no obvious winning solution. This leaves practitioners with a clear question: "What is the optimal system configuration to run an algori...
Towards Assessing Spread in Sets of Software Architecture Designs	Vittorio Cortellessa, J. Andres Diaz-Pace, Daniele Di Pompeo, Michele Tucci	2024-02-29	下载	Several approaches have recently used automated techniques to generate architecture design alternatives by means of optimization techniques. These approaches aim at improving an initial architecture w...