Skip to content

2024-02-29

cs.AR - Architecture

标题作者发布日期PDF摘要
Quantum Hardware Roofline: Evaluating the Impact of Gate Expressivity on Quantum Processor DesignJustin Kalloor, Mathias Weiden, Ed Younis, John Kubiatowicz, Bert De Jong, Costin Iancu2024-02-29下载The design space of current quantum computers is expansive with no obvious winning solution. This leaves practitioners with a clear question: "What is the optimal system configuration to run an algori...
On Robustness and Generalization of ML-Based Congestion Predictors to Valid and Imperceptible PerturbationsChester Holtz, Yucheng Wang, Chung-Kuan Cheng, Bill Lin2024-02-29下载There is substantial interest in the use of machine learning (ML)-based techniques throughout the electronic computer-aided design (CAD) flow, particularly methods based on deep learning.
Commercial Evaluation of Zero-Skipping MAC Design for Bit Sparsity Exploitation in DL InferenceHarideep Nair, Prabhu Vellaisamy, Tsung-Han Lin, Perry Wang, Shawn Blanton, John Paul Shen2024-02-29下载General Matrix Multiply (GEMM) units, consisting of multiply-accumulate (MAC) arrays, perform bulk of the computation in deep learning (DL). Recent work has proposed a novel MAC design, Bit-Pragmatic ...
NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable FunctionsMarta Andronic, George A. Constantinides2024-02-29下载Field-Programmable Gate Array (FPGA) accelerators have proven successful in handling latency- and resource-critical deep neural network (DNN) inference tasks.
MIMDRAM: An End-to-End Processing-Using-DRAM System for High-Throughput, Energy-Efficient and Programmer-Transparent Multiple-Instruction Multiple-Data ProcessingGeraldo F. Oliveira, Ataberk Olgun, Abdullah Giray Yağlıkçı, F. Nisa Bostancı, Juan Gómez-Luna, Saugata Ghose, Onur Mutlu2024-02-29下载Processing-using-DRAM (PUD) is a processing-in-memory (PIM) approach that uses a DRAM array's massive internal parallelism to execute very-wide data-parallel operations, in a single-instruction multip...
CoMeT: Count-Min-Sketch-based Row Tracking to Mitigate RowHammer at Low CostF. Nisa Bostanci, Ismail Emir Yuksel, Ataberk Olgun, Konstantinos Kanellopoulos, Yahya Can Tugrul, A. Giray Yaglikci, Mohammad Sadrosadati, Onur Mutlu2024-02-29下载We propose a new RowHammer mitigation mechanism, CoMeT, that prevents RowHammer bitflips with low area, performance, and energy costs in DRAM-based systems at very low RowHammer thresholds.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Arrow Matrix Decomposition: A Novel Approach for Communication-Efficient Sparse Matrix MultiplicationLukas Gianinazzi, Alexandros Nikolaos Ziogas, Langwen Huang, Piotr Luczynski, Saleh Ashkboos, Florian Scheidl, Armon Carigiet, Chio Ge, Nabil Abubaker, Maciej Besta, Tal Ben-Nun, Torsten Hoefler2024-02-29下载We propose a novel approach to iterated sparse matrix dense matrix multiplication, a fundamental computational kernel in scientific computing and graph neural network training.
DeepOps & SLURM: Your GPU Cluster GuideArindam Majee2024-02-29下载In the ever evolving landscape of deep learning, unlocking the potential of cutting-edge models demands computational resources that surpass the capabilities of individual machines.
MIMDRAM: An End-to-End Processing-Using-DRAM System for High-Throughput, Energy-Efficient and Programmer-Transparent Multiple-Instruction Multiple-Data ProcessingGeraldo F. Oliveira, Ataberk Olgun, Abdullah Giray Yağlıkçı, F. Nisa Bostancı, Juan Gómez-Luna, Saugata Ghose, Onur Mutlu2024-02-29下载Processing-using-DRAM (PUD) is a processing-in-memory (PIM) approach that uses a DRAM array's massive internal parallelism to execute very-wide data-parallel operations, in a single-instruction multip...
Global and Local Prompts Cooperation via Optimal Transport for Federated LearningHongxia Li, Wei Huang, Jingya Wang, Ye Shi2024-02-29下载Prompt learning in pretrained visual-language models has shown remarkable flexibility across various downstream tasks. Leveraging its inherent lightweight nature, recent research attempted to integrat...
FlexLLM: Token-Level Co-Serving of LLM Inference and Finetuning with SLO GuaranteesGabriele Oliaro, Xupeng Miao, Xinhao Cheng, Vineeth Kada, Mengdi Wu, Ruohan Gao, Yingyi Huang, Remi Delacourt, April Yang, Yingcheng Wang, Colin Unger, Zhihao Jia2024-02-29下载Finetuning large language models (LLMs) is essential for task adaptation, yet today's serving stacks isolate inference and finetuning on separate GPU clusters -- wasting resources and under-utilizing ...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Structural Resilience and Connectivity of the IPv6 Internet: An AS-level Topology ExaminationBin Yuan, Tianbo Song2024-02-29下载The study utilizes a comprehensive dataset informed by IPv6 routing information to provide statistics, degree distribution, joint degree distribution, and clustering analysis of the IPv6 Internet's st...
Intelligent Monitoring Framework for Cloud Services: A Data-Driven ApproachPooja Srinivas, Fiza Husain, Anjaly Parayil, Ayush Choure, Chetan Bansal, Saravan Rajmohan2024-02-29下载Cloud service owners need to continuously monitor their services to ensure high availability and reliability. Gaps in monitoring can lead to delay in incident detection and significant negative custom...
Vision-Radio Experimental Infrastructure Architecture Towards 6GFilipe B. Teixeira, Manuel Ricardo, André Coelho, Hélder P. Oliveira, Paula Viana, Nuno Paulino, Helder Fontes, Paulo Marques, Rui Campos, Luis M. Pessoa2024-02-29下载Telecommunications and computer vision have evolved separately so far. Yet, with the shift to sub-terahertz (sub-THz) and terahertz (THz) radio communications, there is an opportunity to explore compu...
Unveiling Internet Censorship: Analysing the Impact of Nation States' Content Control Efforts on Internet Architecture and Routing PatternsJoshua Levett, Vassilios Vassilakis, Poonam Yadav2024-02-29下载Heightened interest from nation states to perform content censorship make it evermore critical to identify the impact of censorship efforts on the Internet.
Attacks Against Mobility Prediction in 5G NetworksSyafiq Al Atiiq, Yachao Yuan, Christian Gehrmann, Jakob Sternby, Luis Barriga2024-02-29下载The 5th5^{th} generation of mobile networks introduces a new Network Function (NF) that was not present in previous generations, namely the Network Data Analytics Function (NWDAF).
Energy-Efficient UAV Swarm Assisted MEC with Dynamic Clustering and SchedulingJialiuyuan Li, Jiayuan Chen, Changyan Yi, Tong Zhang, Kun Zhu, Jun Cai2024-02-29下载In this paper, the energy-efficient unmanned aerial vehicle (UAV) swarm assisted mobile edge computing (MEC) with dynamic clustering and scheduling is studied.
Edge Computing Enabled Real-Time Video Analysis via Adaptive Spatial-Temporal Semantic FilteringXiang Chen, Wenjie Zhu, Jiayuan Chen, Tong Zhang, Changyan Yi, Jun Cai2024-02-29下载This paper proposes a novel edge computing enabled real-time video analysis system for intelligent visual devices. The proposed system consists of a tracking-assisted object detection module (TAODM) a...
X-ResQ: Reverse Annealing for Quantum MIMO Detection with Flexible ParallelismMinsung Kim, Abhishek Kumar Singh, Davide Venturelli, John Kaewell, Kyle Jamieson2024-02-29下载Quantum Annealing (QA)-accelerated MIMO detection is an emerging research approach in the context of NextG wireless networks. The opportunity is to enable large MIMO systems and thus improve wireless ...

cs.PF - Performance

标题作者发布日期PDF摘要
Quantum Hardware Roofline: Evaluating the Impact of Gate Expressivity on Quantum Processor DesignJustin Kalloor, Mathias Weiden, Ed Younis, John Kubiatowicz, Bert De Jong, Costin Iancu2024-02-29下载The design space of current quantum computers is expansive with no obvious winning solution. This leaves practitioners with a clear question: "What is the optimal system configuration to run an algori...
Towards Assessing Spread in Sets of Software Architecture DesignsVittorio Cortellessa, J. Andres Diaz-Pace, Daniele Di Pompeo, Michele Tucci2024-02-29下载Several approaches have recently used automated techniques to generate architecture design alternatives by means of optimization techniques. These approaches aim at improving an initial architecture w...

基于 VitePress 构建