Skip to content

2024-04-30

cs.AR - Architecture

标题作者发布日期PDF摘要
Selective Parallel Loading of Large-Scale Compressed Graphs with ParaGrapherMohsen Koohi Esfahani, Marco D'Antonio, Syed Ibtisam Tauhidi, Thai Son Mai, Hans Vandierendonck2024-04-30下载Comprehensive evaluation is one of the basis of experimental science. In High-Performance Graph Processing, a thorough evaluation of contributions becomes more achievable by supporting common input fo...
EvGNN: An Event-driven Graph Neural Network Accelerator for Edge VisionYufeng Yang, Adrian Kneip, Charlotte Frenkel2024-04-30下载Edge vision systems combining sensing and embedded processing promise low-latency, decentralized, and energy-efficient solutions that forgo reliance on the cloud.
Sensorized Soft Skin for Dexterous Robotic HandsJana Egli, Benedek Forrai, Thomas Buchner, Jiangtao Su, Xiaodong Chen, Robert K. Katzschmann2024-04-30下载Conventional industrial robots often use two-fingered grippers or suction cups to manipulate objects or interact with the world. Because of their simplified design, they are unable to reproduce the de...
Low-overhead General-purpose Near-Data Processing in CXL Memory ExpandersHyungkyu Ham, Jeongmin Hong, Geonwoo Park, Yunseon Shin, Okkyun Woo, Wonhyuk Yang, Jinhoon Bae, Eunhyeok Park, Hyojin Sung, Euicheol Lim, Gwangsun Kim2024-04-30下载Emerging Compute Express Link (CXL) enables cost-efficient memory expansion beyond the local DRAM of processors. While its CXL..mem protocol provides minimal latency overhead through an optimized pro...
PEFSL: A deployment Pipeline for Embedded Few-Shot Learning on a FPGA SoCLucas Grativol Ribeiro, Lubin Gauthier, Mathieu Leonardon, Jérémy Morlier, Antoine Lavrard-Meyer, Guillaume Muller, Virginie Fresse, Matthieu Arzel2024-04-30下载This paper tackles the challenges of implementing few-shot learning on embedded systems, specifically FPGA SoCs, a vital approach for adapting to diverse classification tasks, especially when the cost...
Fusing Depthwise and Pointwise Convolutions for Efficient Inference on GPUsFareed Qararyah, Muhammad Waqar Azhar, Mohammad Ali Maleki, Pedro Trancoso2024-04-30下载Depthwise and pointwise convolutions have fewer parameters and perform fewer operations than standard convolutions. As a result, they have become increasingly used in various compact DNNs, including c...
Logistic Map Pseudo Random Number Generator in FPGAMateo Jalen Andrew Calderon, Lee Jun Lei Lucas, Syarifuddin Azhar Bin Rosli, Stephanie See Hui Ying, Jarell Lim En Yu, Maoyang Xiang, T. Hui Teo2024-04-30下载This project develops a pseudo-random number generator (PRNG) using the logistic map, implemented in Verilog HDL on an FPGA and processes its output through a Central Limit Theorem (CLT) function to a...
Thermal Performance of a Liquid-cooling Assisted Thin Wickless Vapor ChamberArani Mukhopadhyay, Anish Pal, Mohamad Jafari Gukeh, Constantine M. Megaridis2024-04-30下载The ever-increasing need for power consumption in electronic devices, coupled with the requirement for thinner size, calls for the development of efficient heat spreading components.
Evaluation of Thermal Performance of a Wick-free Vapor Chamber in Power Electronics CoolingArani Mukhopadhyay, Anish Pal, Congbo Bao, Mohamad Jafari Gukeh, Sudip K. Mazumder, Constantine M. Megaridis2024-04-30下载Efficient thermal management in high-power electronics cooling can be achieved using phase-change heat transfer devices, such as vapor chambers.
MACO: Exploring GEMM Acceleration on a Loosely-Coupled Multi-core ProcessorBingcai Sui, Junzhong Shen, Caixia Sun, Junhui Wang, Zhong Zheng, Wei Guo2024-04-30下载General-purpose processor vendors have integrated customized accelerator in their products due to the widespread use of General Matrix-Matrix Multiplication (GEMM) kernels.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
DiaQ: Efficient State-Vector Quantum SimulationSrikar Chundury, Jiajia Li, In-Saeng Suh, Frank Mueller2024-04-30下载In the current era of Noisy Intermediate Scale Quantum (NISQ) computing, efficient digital simulation of quantum systems holds significant importance for quantum algorithm development, verification an...
CurvFed: Curvature-Aligned Federated Learning for Fairness without DemographicsHarshit Sharma, Shaily Roy, Asif Salekin2024-04-30下载Modern human sensing applications often rely on data distributed across users and devices, where privacy concerns prevent centralized training.
Automated, Reliable, and Efficient Continental-Scale Replication of 7.3 Petabytes of Climate Simulation Data: A Case StudyLukasz Lacinski, Lee Liming, Steven Turoscy, Cameron Harr, Kyle Chard, Eli Dart, Paul Durack, Sasha Ames, Forrest M. Hoffman, Ian T. Foster2024-04-30下载We report on our experiences replicating 7.3 petabytes (PB) of Earth System Grid Federation (ESGF) climate simulation data from Lawrence Livermore National Laboratory (LLNL) in California to Argonne N...
SpComm3D: A Framework for Enabling Sparse Communication in 3D Sparse KernelsNabil Abubaker, Torsten Hoefler2024-04-30下载Existing 3D algorithms for distributed-memory sparse kernels suffer from limited scalability due to reliance on bulk sparsity-agnostic communication.
DF Louvain: Fast Incrementally Expanding Approach for Community Detection on Dynamic GraphsSubhajit Sahu2024-04-30下载Community detection is the problem of recognizing natural divisions in networks. A relevant challenge in this problem is to find communities on rapidly evolving graphs.
Quantum Cloud Computing: Trends and ChallengesMuhammed Golec, Emir Sahin Hatay, Mustafa Golec, Murat Uyar, Merve Golec, Sukhpal Singh Gill2024-04-30下载Quantum computing (QC) is a new paradigm that will revolutionize various areas of computing, especially cloud computing. QC, still in its infancy, is a costly technology capable of operating in highly...
Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication OverlappingChenyu Jiang, Ye Tian, Zhen Jia, Shuai Zheng, Chuan Wu, Yida Wang2024-04-30下载The Mixture-of-Expert (MoE) technique plays a crucial role in expanding the size of DNN model parameters. However, it faces the challenge of extended all-to-all communication latency during the traini...
Fusing Depthwise and Pointwise Convolutions for Efficient Inference on GPUsFareed Qararyah, Muhammad Waqar Azhar, Mohammad Ali Maleki, Pedro Trancoso2024-04-30下载Depthwise and pointwise convolutions have fewer parameters and perform fewer operations than standard convolutions. As a result, they have become increasingly used in various compact DNNs, including c...
Pilot Contamination in Massive MIMO Systems: Challenges and Future ProspectsMuhammad Kamran Saeed, Ashfaq Khokhar, Shakil Ahmed2024-04-30下载Massive multiple input multiple output (M-MIMO) technology plays a pivotal role in fifth-generation (5G) and beyond communication systems, offering a wide range of benefits, from increased spectral ef...
AdaOper: Energy-efficient and Responsive Concurrent DNN Inference on Mobile DevicesZheng Lin, Bin Guo, Sicong Liu, Wentao Zhou, Yasan Ding, Yu Zhang, Zhiwen Yu2024-04-30下载Deep neural network (DNN) has driven extensive applications in mobile technology. However, for long-running mobile apps like voice assistants or video applications on smartphones, energy efficiency is...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Optimized Non-Primary Channel Access Design in IEEE 802.11bnDongyu Wei, Liu Cao, Lyutianyang Zhang, Xiangyu Gao, Hao Yin2024-04-30下载The IEEE 802.11 standards, culminating in IEEE 802.11be (Wi-Fi 7), have significantly expanded bandwidth capacities from 20 MHz to 320 MHz, marking a crucial evolution in wireless access technology.
Optimized Distribution of Entanglement Graph States in Quantum NetworksXiaojie Fan, Caitao Zhan, Himanshu Gupta, C. R. Ramakrishnan2024-04-30下载Building large-scale quantum computers, essential to demonstrating quantum advantage, is a key challenge. Quantum Networks (QNs) can help address this challenge by enabling the construction of large, ...
Context-Aware Mobile Network Performance Prediction Using Network & Remote Sensing DataAli Shibli, Tahar Zanouda2024-04-30下载Accurate estimation of Network Performance is crucial for several tasks in telecom networks. Telecom networks regularly serve a vast number of radio nodes.
Scale-Robust Timely Asynchronous Decentralized LearningPurbesh Mitra, Sennur Ulukus2024-04-30下载We consider an asynchronous decentralized learning system, which consists of a network of connected devices trying to learn a machine learning model without any centralized parameter server.
Harnessing Federated Generative Learning for Green and Sustainable Internet of ThingsYuanhang Qi, M. Shamim Hossain2024-04-30下载The rapid proliferation of devices in the Internet of Things (IoT) has ushered in a transformative era of data-driven connectivity across various domains.
Recommenadation aided Caching using Combinatorial Multi-armed BanditsPavamana K J, Chandramani Kishore Singh2024-04-30下载We study content caching with recommendations in a wireless network where the users are connected through a base station equipped with a finite-capacity cache.
ColosSUMO: Evaluating Cooperative Driving Applications with ColosseumGabriele Gemmi, Pedram Johari, Paolo Casari, Michele Polese, Tommaso Melodia, Michele Segata2024-04-30下载The quest for safer and more efficient transportation through cooperative, connected and automated mobility (CCAM) calls for realistic performance analysis tools, especially with respect to wireless c...
Radio Resource Management Design for RSMA: Optimization of Beamforming, User Admission, and Discrete/Continuous Rates with Imperfect SICL. F. Abanto-Leon, A. Krishnamoorthy, A. Garcia-Saavedra, G. H. Sim, R. Schober, M. Hollick2024-04-30下载This paper investigates the radio resource management (RRM) design for multiuser rate-splitting multiple access (RSMA), accounting for various characteristics of practical wireless systems, such as th...
Reducing Communication Overhead in the IoT-Edge-Cloud Continuum: A Survey on Protocols and Data Reduction StrategiesDora Kreković, Petar Krivić, Ivana Podnar Žarko, Mario Kušek, Danh Le-Phuoc2024-04-30下载The adoption of the Internet of Things (IoT) deployments has led to a sharp increase in network traffic as a vast number of IoT devices communicate with each other and IoT services through the IoT-edg...
AutoNet: Automatic Reachability Policy Management in Public Cloud NetworksGerman Sviridov, Zheng Tao Shen, Jorge Cardoso2024-04-30下载Virtual Private Cloud (VPC) is the main network abstraction technology used in public cloud systems. VPCs are composed of a set of network services that permit the definition of complex network reacha...
Alternative paths computation for congestion mitigation in segment-routing networksSébastien Martin, Youcef Magnouche, Paolo Medagliani, Jérémie Leguay2024-04-30下载In backbone networks, it is fundamental to quickly protect traffic against any unexpected event, such as failures or congestions, which may impact Quality of Service (QoS).
Pilot Contamination in Massive MIMO Systems: Challenges and Future ProspectsMuhammad Kamran Saeed, Ashfaq Khokhar, Shakil Ahmed2024-04-30下载Massive multiple input multiple output (M-MIMO) technology plays a pivotal role in fifth-generation (5G) and beyond communication systems, offering a wide range of benefits, from increased spectral ef...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
VeriFence: Lightweight and Precise Spectre Defenses for Untrusted Linux Kernel ExtensionsLuis Gerhorst, Henriette Herzog, Peter Wägemann, Maximilian Ott, Rüdiger Kapitza, Timo Hönig2024-04-30下载High-performance IO demands low-overhead communication between user- and kernel space. This demand can no longer be fulfilled by traditional system calls.

cs.PF - Performance

标题作者发布日期PDF摘要
Selective Parallel Loading of Large-Scale Compressed Graphs with ParaGrapherMohsen Koohi Esfahani, Marco D'Antonio, Syed Ibtisam Tauhidi, Thai Son Mai, Hans Vandierendonck2024-04-30下载Comprehensive evaluation is one of the basis of experimental science. In High-Performance Graph Processing, a thorough evaluation of contributions becomes more achievable by supporting common input fo...
Fusing Depthwise and Pointwise Convolutions for Efficient Inference on GPUsFareed Qararyah, Muhammad Waqar Azhar, Mohammad Ali Maleki, Pedro Trancoso2024-04-30下载Depthwise and pointwise convolutions have fewer parameters and perform fewer operations than standard convolutions. As a result, they have become increasingly used in various compact DNNs, including c...

基于 VitePress 构建