2024-04-30

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Selective Parallel Loading of Large-Scale Compressed Graphs with ParaGrapher	Mohsen Koohi Esfahani, Marco D'Antonio, Syed Ibtisam Tauhidi, Thai Son Mai, Hans Vandierendonck	2024-04-30	下载	Comprehensive evaluation is one of the basis of experimental science. In High-Performance Graph Processing, a thorough evaluation of contributions becomes more achievable by supporting common input fo...
EvGNN: An Event-driven Graph Neural Network Accelerator for Edge Vision	Yufeng Yang, Adrian Kneip, Charlotte Frenkel	2024-04-30	下载	Edge vision systems combining sensing and embedded processing promise low-latency, decentralized, and energy-efficient solutions that forgo reliance on the cloud.
Sensorized Soft Skin for Dexterous Robotic Hands	Jana Egli, Benedek Forrai, Thomas Buchner, Jiangtao Su, Xiaodong Chen, Robert K. Katzschmann	2024-04-30	下载	Conventional industrial robots often use two-fingered grippers or suction cups to manipulate objects or interact with the world. Because of their simplified design, they are unable to reproduce the de...
Low-overhead General-purpose Near-Data Processing in CXL Memory Expanders	Hyungkyu Ham, Jeongmin Hong, Geonwoo Park, Yunseon Shin, Okkyun Woo, Wonhyuk Yang, Jinhoon Bae, Eunhyeok Park, Hyojin Sung, Euicheol Lim, Gwangsun Kim	2024-04-30	下载	Emerging Compute Express Link (CXL) enables cost-efficient memory expansion beyond the local DRAM of processors. While its CXL $.$ mem protocol provides minimal latency overhead through an optimized pro...
PEFSL: A deployment Pipeline for Embedded Few-Shot Learning on a FPGA SoC	Lucas Grativol Ribeiro, Lubin Gauthier, Mathieu Leonardon, Jérémy Morlier, Antoine Lavrard-Meyer, Guillaume Muller, Virginie Fresse, Matthieu Arzel	2024-04-30	下载	This paper tackles the challenges of implementing few-shot learning on embedded systems, specifically FPGA SoCs, a vital approach for adapting to diverse classification tasks, especially when the cost...
Fusing Depthwise and Pointwise Convolutions for Efficient Inference on GPUs	Fareed Qararyah, Muhammad Waqar Azhar, Mohammad Ali Maleki, Pedro Trancoso	2024-04-30	下载	Depthwise and pointwise convolutions have fewer parameters and perform fewer operations than standard convolutions. As a result, they have become increasingly used in various compact DNNs, including c...
Logistic Map Pseudo Random Number Generator in FPGA	Mateo Jalen Andrew Calderon, Lee Jun Lei Lucas, Syarifuddin Azhar Bin Rosli, Stephanie See Hui Ying, Jarell Lim En Yu, Maoyang Xiang, T. Hui Teo	2024-04-30	下载	This project develops a pseudo-random number generator (PRNG) using the logistic map, implemented in Verilog HDL on an FPGA and processes its output through a Central Limit Theorem (CLT) function to a...
Thermal Performance of a Liquid-cooling Assisted Thin Wickless Vapor Chamber	Arani Mukhopadhyay, Anish Pal, Mohamad Jafari Gukeh, Constantine M. Megaridis	2024-04-30	下载	The ever-increasing need for power consumption in electronic devices, coupled with the requirement for thinner size, calls for the development of efficient heat spreading components.
Evaluation of Thermal Performance of a Wick-free Vapor Chamber in Power Electronics Cooling	Arani Mukhopadhyay, Anish Pal, Congbo Bao, Mohamad Jafari Gukeh, Sudip K. Mazumder, Constantine M. Megaridis	2024-04-30	下载	Efficient thermal management in high-power electronics cooling can be achieved using phase-change heat transfer devices, such as vapor chambers.
MACO: Exploring GEMM Acceleration on a Loosely-Coupled Multi-core Processor	Bingcai Sui, Junzhong Shen, Caixia Sun, Junhui Wang, Zhong Zheng, Wei Guo	2024-04-30	下载	General-purpose processor vendors have integrated customized accelerator in their products due to the widespread use of General Matrix-Matrix Multiplication (GEMM) kernels.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
DiaQ: Efficient State-Vector Quantum Simulation	Srikar Chundury, Jiajia Li, In-Saeng Suh, Frank Mueller	2024-04-30	下载	In the current era of Noisy Intermediate Scale Quantum (NISQ) computing, efficient digital simulation of quantum systems holds significant importance for quantum algorithm development, verification an...
CurvFed: Curvature-Aligned Federated Learning for Fairness without Demographics	Harshit Sharma, Shaily Roy, Asif Salekin	2024-04-30	下载	Modern human sensing applications often rely on data distributed across users and devices, where privacy concerns prevent centralized training.
Automated, Reliable, and Efficient Continental-Scale Replication of 7.3 Petabytes of Climate Simulation Data: A Case Study	Lukasz Lacinski, Lee Liming, Steven Turoscy, Cameron Harr, Kyle Chard, Eli Dart, Paul Durack, Sasha Ames, Forrest M. Hoffman, Ian T. Foster	2024-04-30	下载	We report on our experiences replicating 7.3 petabytes (PB) of Earth System Grid Federation (ESGF) climate simulation data from Lawrence Livermore National Laboratory (LLNL) in California to Argonne N...
SpComm3D: A Framework for Enabling Sparse Communication in 3D Sparse Kernels	Nabil Abubaker, Torsten Hoefler	2024-04-30	下载	Existing 3D algorithms for distributed-memory sparse kernels suffer from limited scalability due to reliance on bulk sparsity-agnostic communication.
DF Louvain: Fast Incrementally Expanding Approach for Community Detection on Dynamic Graphs	Subhajit Sahu	2024-04-30	下载	Community detection is the problem of recognizing natural divisions in networks. A relevant challenge in this problem is to find communities on rapidly evolving graphs.
Quantum Cloud Computing: Trends and Challenges	Muhammed Golec, Emir Sahin Hatay, Mustafa Golec, Murat Uyar, Merve Golec, Sukhpal Singh Gill	2024-04-30	下载	Quantum computing (QC) is a new paradigm that will revolutionize various areas of computing, especially cloud computing. QC, still in its infancy, is a costly technology capable of operating in highly...
Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapping	Chenyu Jiang, Ye Tian, Zhen Jia, Shuai Zheng, Chuan Wu, Yida Wang	2024-04-30	下载	The Mixture-of-Expert (MoE) technique plays a crucial role in expanding the size of DNN model parameters. However, it faces the challenge of extended all-to-all communication latency during the traini...
Fusing Depthwise and Pointwise Convolutions for Efficient Inference on GPUs	Fareed Qararyah, Muhammad Waqar Azhar, Mohammad Ali Maleki, Pedro Trancoso	2024-04-30	下载	Depthwise and pointwise convolutions have fewer parameters and perform fewer operations than standard convolutions. As a result, they have become increasingly used in various compact DNNs, including c...
Pilot Contamination in Massive MIMO Systems: Challenges and Future Prospects	Muhammad Kamran Saeed, Ashfaq Khokhar, Shakil Ahmed	2024-04-30	下载	Massive multiple input multiple output (M-MIMO) technology plays a pivotal role in fifth-generation (5G) and beyond communication systems, offering a wide range of benefits, from increased spectral ef...
AdaOper: Energy-efficient and Responsive Concurrent DNN Inference on Mobile Devices	Zheng Lin, Bin Guo, Sicong Liu, Wentao Zhou, Yasan Ding, Yu Zhang, Zhiwen Yu	2024-04-30	下载	Deep neural network (DNN) has driven extensive applications in mobile technology. However, for long-running mobile apps like voice assistants or video applications on smartphones, energy efficiency is...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Optimized Non-Primary Channel Access Design in IEEE 802.11bn	Dongyu Wei, Liu Cao, Lyutianyang Zhang, Xiangyu Gao, Hao Yin	2024-04-30	下载	The IEEE 802.11 standards, culminating in IEEE 802.11be (Wi-Fi 7), have significantly expanded bandwidth capacities from 20 MHz to 320 MHz, marking a crucial evolution in wireless access technology.
Optimized Distribution of Entanglement Graph States in Quantum Networks	Xiaojie Fan, Caitao Zhan, Himanshu Gupta, C. R. Ramakrishnan	2024-04-30	下载	Building large-scale quantum computers, essential to demonstrating quantum advantage, is a key challenge. Quantum Networks (QNs) can help address this challenge by enabling the construction of large, ...
Context-Aware Mobile Network Performance Prediction Using Network & Remote Sensing Data	Ali Shibli, Tahar Zanouda	2024-04-30	下载	Accurate estimation of Network Performance is crucial for several tasks in telecom networks. Telecom networks regularly serve a vast number of radio nodes.
Scale-Robust Timely Asynchronous Decentralized Learning	Purbesh Mitra, Sennur Ulukus	2024-04-30	下载	We consider an asynchronous decentralized learning system, which consists of a network of connected devices trying to learn a machine learning model without any centralized parameter server.
Harnessing Federated Generative Learning for Green and Sustainable Internet of Things	Yuanhang Qi, M. Shamim Hossain	2024-04-30	下载	The rapid proliferation of devices in the Internet of Things (IoT) has ushered in a transformative era of data-driven connectivity across various domains.
Recommenadation aided Caching using Combinatorial Multi-armed Bandits	Pavamana K J, Chandramani Kishore Singh	2024-04-30	下载	We study content caching with recommendations in a wireless network where the users are connected through a base station equipped with a finite-capacity cache.
ColosSUMO: Evaluating Cooperative Driving Applications with Colosseum	Gabriele Gemmi, Pedram Johari, Paolo Casari, Michele Polese, Tommaso Melodia, Michele Segata	2024-04-30	下载	The quest for safer and more efficient transportation through cooperative, connected and automated mobility (CCAM) calls for realistic performance analysis tools, especially with respect to wireless c...
Radio Resource Management Design for RSMA: Optimization of Beamforming, User Admission, and Discrete/Continuous Rates with Imperfect SIC	L. F. Abanto-Leon, A. Krishnamoorthy, A. Garcia-Saavedra, G. H. Sim, R. Schober, M. Hollick	2024-04-30	下载	This paper investigates the radio resource management (RRM) design for multiuser rate-splitting multiple access (RSMA), accounting for various characteristics of practical wireless systems, such as th...
Reducing Communication Overhead in the IoT-Edge-Cloud Continuum: A Survey on Protocols and Data Reduction Strategies	Dora Kreković, Petar Krivić, Ivana Podnar Žarko, Mario Kušek, Danh Le-Phuoc	2024-04-30	下载	The adoption of the Internet of Things (IoT) deployments has led to a sharp increase in network traffic as a vast number of IoT devices communicate with each other and IoT services through the IoT-edg...
AutoNet: Automatic Reachability Policy Management in Public Cloud Networks	German Sviridov, Zheng Tao Shen, Jorge Cardoso	2024-04-30	下载	Virtual Private Cloud (VPC) is the main network abstraction technology used in public cloud systems. VPCs are composed of a set of network services that permit the definition of complex network reacha...
Alternative paths computation for congestion mitigation in segment-routing networks	Sébastien Martin, Youcef Magnouche, Paolo Medagliani, Jérémie Leguay	2024-04-30	下载	In backbone networks, it is fundamental to quickly protect traffic against any unexpected event, such as failures or congestions, which may impact Quality of Service (QoS).
Pilot Contamination in Massive MIMO Systems: Challenges and Future Prospects	Muhammad Kamran Saeed, Ashfaq Khokhar, Shakil Ahmed	2024-04-30	下载	Massive multiple input multiple output (M-MIMO) technology plays a pivotal role in fifth-generation (5G) and beyond communication systems, offering a wide range of benefits, from increased spectral ef...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
VeriFence: Lightweight and Precise Spectre Defenses for Untrusted Linux Kernel Extensions	Luis Gerhorst, Henriette Herzog, Peter Wägemann, Maximilian Ott, Rüdiger Kapitza, Timo Hönig	2024-04-30	下载	High-performance IO demands low-overhead communication between user- and kernel space. This demand can no longer be fulfilled by traditional system calls.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Selective Parallel Loading of Large-Scale Compressed Graphs with ParaGrapher	Mohsen Koohi Esfahani, Marco D'Antonio, Syed Ibtisam Tauhidi, Thai Son Mai, Hans Vandierendonck	2024-04-30	下载	Comprehensive evaluation is one of the basis of experimental science. In High-Performance Graph Processing, a thorough evaluation of contributions becomes more achievable by supporting common input fo...
Fusing Depthwise and Pointwise Convolutions for Efficient Inference on GPUs	Fareed Qararyah, Muhammad Waqar Azhar, Mohammad Ali Maleki, Pedro Trancoso	2024-04-30	下载	Depthwise and pointwise convolutions have fewer parameters and perform fewer operations than standard convolutions. As a result, they have become increasingly used in various compact DNNs, including c...