Appearance
2024-04-30
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Selective Parallel Loading of Large-Scale Compressed Graphs with ParaGrapher | Mohsen Koohi Esfahani, Marco D'Antonio, Syed Ibtisam Tauhidi, Thai Son Mai, Hans Vandierendonck | 2024-04-30 | 下载 | Comprehensive evaluation is one of the basis of experimental science. In High-Performance Graph Processing, a thorough evaluation of contributions becomes more achievable by supporting common input fo... |
| EvGNN: An Event-driven Graph Neural Network Accelerator for Edge Vision | Yufeng Yang, Adrian Kneip, Charlotte Frenkel | 2024-04-30 | 下载 | Edge vision systems combining sensing and embedded processing promise low-latency, decentralized, and energy-efficient solutions that forgo reliance on the cloud. |
| Sensorized Soft Skin for Dexterous Robotic Hands | Jana Egli, Benedek Forrai, Thomas Buchner, Jiangtao Su, Xiaodong Chen, Robert K. Katzschmann | 2024-04-30 | 下载 | Conventional industrial robots often use two-fingered grippers or suction cups to manipulate objects or interact with the world. Because of their simplified design, they are unable to reproduce the de... |
| Low-overhead General-purpose Near-Data Processing in CXL Memory Expanders | Hyungkyu Ham, Jeongmin Hong, Geonwoo Park, Yunseon Shin, Okkyun Woo, Wonhyuk Yang, Jinhoon Bae, Eunhyeok Park, Hyojin Sung, Euicheol Lim, Gwangsun Kim | 2024-04-30 | 下载 | Emerging Compute Express Link (CXL) enables cost-efficient memory expansion beyond the local DRAM of processors. While its CXLmem protocol provides minimal latency overhead through an optimized pro... |
| PEFSL: A deployment Pipeline for Embedded Few-Shot Learning on a FPGA SoC | Lucas Grativol Ribeiro, Lubin Gauthier, Mathieu Leonardon, Jérémy Morlier, Antoine Lavrard-Meyer, Guillaume Muller, Virginie Fresse, Matthieu Arzel | 2024-04-30 | 下载 | This paper tackles the challenges of implementing few-shot learning on embedded systems, specifically FPGA SoCs, a vital approach for adapting to diverse classification tasks, especially when the cost... |
| Fusing Depthwise and Pointwise Convolutions for Efficient Inference on GPUs | Fareed Qararyah, Muhammad Waqar Azhar, Mohammad Ali Maleki, Pedro Trancoso | 2024-04-30 | 下载 | Depthwise and pointwise convolutions have fewer parameters and perform fewer operations than standard convolutions. As a result, they have become increasingly used in various compact DNNs, including c... |
| Logistic Map Pseudo Random Number Generator in FPGA | Mateo Jalen Andrew Calderon, Lee Jun Lei Lucas, Syarifuddin Azhar Bin Rosli, Stephanie See Hui Ying, Jarell Lim En Yu, Maoyang Xiang, T. Hui Teo | 2024-04-30 | 下载 | This project develops a pseudo-random number generator (PRNG) using the logistic map, implemented in Verilog HDL on an FPGA and processes its output through a Central Limit Theorem (CLT) function to a... |
| Thermal Performance of a Liquid-cooling Assisted Thin Wickless Vapor Chamber | Arani Mukhopadhyay, Anish Pal, Mohamad Jafari Gukeh, Constantine M. Megaridis | 2024-04-30 | 下载 | The ever-increasing need for power consumption in electronic devices, coupled with the requirement for thinner size, calls for the development of efficient heat spreading components. |
| Evaluation of Thermal Performance of a Wick-free Vapor Chamber in Power Electronics Cooling | Arani Mukhopadhyay, Anish Pal, Congbo Bao, Mohamad Jafari Gukeh, Sudip K. Mazumder, Constantine M. Megaridis | 2024-04-30 | 下载 | Efficient thermal management in high-power electronics cooling can be achieved using phase-change heat transfer devices, such as vapor chambers. |
| MACO: Exploring GEMM Acceleration on a Loosely-Coupled Multi-core Processor | Bingcai Sui, Junzhong Shen, Caixia Sun, Junhui Wang, Zhong Zheng, Wei Guo | 2024-04-30 | 下载 | General-purpose processor vendors have integrated customized accelerator in their products due to the widespread use of General Matrix-Matrix Multiplication (GEMM) kernels. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| DiaQ: Efficient State-Vector Quantum Simulation | Srikar Chundury, Jiajia Li, In-Saeng Suh, Frank Mueller | 2024-04-30 | 下载 | In the current era of Noisy Intermediate Scale Quantum (NISQ) computing, efficient digital simulation of quantum systems holds significant importance for quantum algorithm development, verification an... |
| CurvFed: Curvature-Aligned Federated Learning for Fairness without Demographics | Harshit Sharma, Shaily Roy, Asif Salekin | 2024-04-30 | 下载 | Modern human sensing applications often rely on data distributed across users and devices, where privacy concerns prevent centralized training. |
| Automated, Reliable, and Efficient Continental-Scale Replication of 7.3 Petabytes of Climate Simulation Data: A Case Study | Lukasz Lacinski, Lee Liming, Steven Turoscy, Cameron Harr, Kyle Chard, Eli Dart, Paul Durack, Sasha Ames, Forrest M. Hoffman, Ian T. Foster | 2024-04-30 | 下载 | We report on our experiences replicating 7.3 petabytes (PB) of Earth System Grid Federation (ESGF) climate simulation data from Lawrence Livermore National Laboratory (LLNL) in California to Argonne N... |
| SpComm3D: A Framework for Enabling Sparse Communication in 3D Sparse Kernels | Nabil Abubaker, Torsten Hoefler | 2024-04-30 | 下载 | Existing 3D algorithms for distributed-memory sparse kernels suffer from limited scalability due to reliance on bulk sparsity-agnostic communication. |
| DF Louvain: Fast Incrementally Expanding Approach for Community Detection on Dynamic Graphs | Subhajit Sahu | 2024-04-30 | 下载 | Community detection is the problem of recognizing natural divisions in networks. A relevant challenge in this problem is to find communities on rapidly evolving graphs. |
| Quantum Cloud Computing: Trends and Challenges | Muhammed Golec, Emir Sahin Hatay, Mustafa Golec, Murat Uyar, Merve Golec, Sukhpal Singh Gill | 2024-04-30 | 下载 | Quantum computing (QC) is a new paradigm that will revolutionize various areas of computing, especially cloud computing. QC, still in its infancy, is a costly technology capable of operating in highly... |
| Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapping | Chenyu Jiang, Ye Tian, Zhen Jia, Shuai Zheng, Chuan Wu, Yida Wang | 2024-04-30 | 下载 | The Mixture-of-Expert (MoE) technique plays a crucial role in expanding the size of DNN model parameters. However, it faces the challenge of extended all-to-all communication latency during the traini... |
| Fusing Depthwise and Pointwise Convolutions for Efficient Inference on GPUs | Fareed Qararyah, Muhammad Waqar Azhar, Mohammad Ali Maleki, Pedro Trancoso | 2024-04-30 | 下载 | Depthwise and pointwise convolutions have fewer parameters and perform fewer operations than standard convolutions. As a result, they have become increasingly used in various compact DNNs, including c... |
| Pilot Contamination in Massive MIMO Systems: Challenges and Future Prospects | Muhammad Kamran Saeed, Ashfaq Khokhar, Shakil Ahmed | 2024-04-30 | 下载 | Massive multiple input multiple output (M-MIMO) technology plays a pivotal role in fifth-generation (5G) and beyond communication systems, offering a wide range of benefits, from increased spectral ef... |
| AdaOper: Energy-efficient and Responsive Concurrent DNN Inference on Mobile Devices | Zheng Lin, Bin Guo, Sicong Liu, Wentao Zhou, Yasan Ding, Yu Zhang, Zhiwen Yu | 2024-04-30 | 下载 | Deep neural network (DNN) has driven extensive applications in mobile technology. However, for long-running mobile apps like voice assistants or video applications on smartphones, energy efficiency is... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Optimized Non-Primary Channel Access Design in IEEE 802.11bn | Dongyu Wei, Liu Cao, Lyutianyang Zhang, Xiangyu Gao, Hao Yin | 2024-04-30 | 下载 | The IEEE 802.11 standards, culminating in IEEE 802.11be (Wi-Fi 7), have significantly expanded bandwidth capacities from 20 MHz to 320 MHz, marking a crucial evolution in wireless access technology. |
| Optimized Distribution of Entanglement Graph States in Quantum Networks | Xiaojie Fan, Caitao Zhan, Himanshu Gupta, C. R. Ramakrishnan | 2024-04-30 | 下载 | Building large-scale quantum computers, essential to demonstrating quantum advantage, is a key challenge. Quantum Networks (QNs) can help address this challenge by enabling the construction of large, ... |
| Context-Aware Mobile Network Performance Prediction Using Network & Remote Sensing Data | Ali Shibli, Tahar Zanouda | 2024-04-30 | 下载 | Accurate estimation of Network Performance is crucial for several tasks in telecom networks. Telecom networks regularly serve a vast number of radio nodes. |
| Scale-Robust Timely Asynchronous Decentralized Learning | Purbesh Mitra, Sennur Ulukus | 2024-04-30 | 下载 | We consider an asynchronous decentralized learning system, which consists of a network of connected devices trying to learn a machine learning model without any centralized parameter server. |
| Harnessing Federated Generative Learning for Green and Sustainable Internet of Things | Yuanhang Qi, M. Shamim Hossain | 2024-04-30 | 下载 | The rapid proliferation of devices in the Internet of Things (IoT) has ushered in a transformative era of data-driven connectivity across various domains. |
| Recommenadation aided Caching using Combinatorial Multi-armed Bandits | Pavamana K J, Chandramani Kishore Singh | 2024-04-30 | 下载 | We study content caching with recommendations in a wireless network where the users are connected through a base station equipped with a finite-capacity cache. |
| ColosSUMO: Evaluating Cooperative Driving Applications with Colosseum | Gabriele Gemmi, Pedram Johari, Paolo Casari, Michele Polese, Tommaso Melodia, Michele Segata | 2024-04-30 | 下载 | The quest for safer and more efficient transportation through cooperative, connected and automated mobility (CCAM) calls for realistic performance analysis tools, especially with respect to wireless c... |
| Radio Resource Management Design for RSMA: Optimization of Beamforming, User Admission, and Discrete/Continuous Rates with Imperfect SIC | L. F. Abanto-Leon, A. Krishnamoorthy, A. Garcia-Saavedra, G. H. Sim, R. Schober, M. Hollick | 2024-04-30 | 下载 | This paper investigates the radio resource management (RRM) design for multiuser rate-splitting multiple access (RSMA), accounting for various characteristics of practical wireless systems, such as th... |
| Reducing Communication Overhead in the IoT-Edge-Cloud Continuum: A Survey on Protocols and Data Reduction Strategies | Dora Kreković, Petar Krivić, Ivana Podnar Žarko, Mario Kušek, Danh Le-Phuoc | 2024-04-30 | 下载 | The adoption of the Internet of Things (IoT) deployments has led to a sharp increase in network traffic as a vast number of IoT devices communicate with each other and IoT services through the IoT-edg... |
| AutoNet: Automatic Reachability Policy Management in Public Cloud Networks | German Sviridov, Zheng Tao Shen, Jorge Cardoso | 2024-04-30 | 下载 | Virtual Private Cloud (VPC) is the main network abstraction technology used in public cloud systems. VPCs are composed of a set of network services that permit the definition of complex network reacha... |
| Alternative paths computation for congestion mitigation in segment-routing networks | Sébastien Martin, Youcef Magnouche, Paolo Medagliani, Jérémie Leguay | 2024-04-30 | 下载 | In backbone networks, it is fundamental to quickly protect traffic against any unexpected event, such as failures or congestions, which may impact Quality of Service (QoS). |
| Pilot Contamination in Massive MIMO Systems: Challenges and Future Prospects | Muhammad Kamran Saeed, Ashfaq Khokhar, Shakil Ahmed | 2024-04-30 | 下载 | Massive multiple input multiple output (M-MIMO) technology plays a pivotal role in fifth-generation (5G) and beyond communication systems, offering a wide range of benefits, from increased spectral ef... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| VeriFence: Lightweight and Precise Spectre Defenses for Untrusted Linux Kernel Extensions | Luis Gerhorst, Henriette Herzog, Peter Wägemann, Maximilian Ott, Rüdiger Kapitza, Timo Hönig | 2024-04-30 | 下载 | High-performance IO demands low-overhead communication between user- and kernel space. This demand can no longer be fulfilled by traditional system calls. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Selective Parallel Loading of Large-Scale Compressed Graphs with ParaGrapher | Mohsen Koohi Esfahani, Marco D'Antonio, Syed Ibtisam Tauhidi, Thai Son Mai, Hans Vandierendonck | 2024-04-30 | 下载 | Comprehensive evaluation is one of the basis of experimental science. In High-Performance Graph Processing, a thorough evaluation of contributions becomes more achievable by supporting common input fo... |
| Fusing Depthwise and Pointwise Convolutions for Efficient Inference on GPUs | Fareed Qararyah, Muhammad Waqar Azhar, Mohammad Ali Maleki, Pedro Trancoso | 2024-04-30 | 下载 | Depthwise and pointwise convolutions have fewer parameters and perform fewer operations than standard convolutions. As a result, they have become increasingly used in various compact DNNs, including c... |