Skip to content

2024-09-27

cs.AR - Architecture

标题作者发布日期PDF摘要
Voxel-CIM: An Efficient Compute-in-Memory Accelerator for Voxel-based Point Cloud Neural NetworksXipeng Lin, Shanshi Huang, Hongwu Jiang2024-09-27下载The 3D point cloud perception has emerged as a fundamental role for a wide range of applications. In particular, with the rapid development of neural networks, the voxel-based networks attract great a...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Accelerating stencils on the Tenstorrent Grayskull RISC-V acceleratorNick Brown, Ryan Barton2024-09-27下载The RISC-V Instruction Set Architecture (ISA) has enjoyed phenomenal growth in recent years, however it still to gain popularity in HPC. Whilst adopting RISC-V CPU solutions in HPC might be some way o...
Fully integrating the Flang Fortran compiler with standard MLIRNick Brown2024-09-27下载Fortran is the lingua franca of HPC code development and as such it is crucial that we as a community have open source Fortran compilers capable of generating high performance executables.
Hierarchical Federated ADMMSeyed Mohammad Azimi-Abarghouyi, Nicola Bastianello, Karl H. Johansson, Viktoria Fodor2024-09-27下载In this paper, we depart from the widely-used gradient descent-based hierarchical federated learning (FL) algorithms to develop a novel hierarchical FL framework based on the alternating direction met...
Hello SME! Generating Fast Matrix Multiplication Kernels Using the Scalable Matrix ExtensionStefan Remke, Alexander Breuer2024-09-27下载Modern central processing units (CPUs) feature single-instruction, multiple-data pipelines to accelerate compute-intensive floating-point and fixed-point workloads.
TensorSocket: Shared Data Loading for Deep Learning TrainingTies Robroek, Neil Kim Nielsen, Pınar Tözün2024-09-27下载Training deep learning models is a repetitive and resource-intensive process. Data scientists often train several models before landing on a set of parameters (e.g.
Exploring DAOS Interfaces and PerformanceNicolau Manubens, Johann Lombardi, Simon D. Smart, Emanuele Danovaro, Tiago Quintino, Dean Hildebrand, Adrian Jackson2024-09-27下载Distributed Asynchronous Object Store (DAOS) is a novel software-defined object store leveraging Non-Volatile Memory (NVM) devices, designed for high performance.
Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge IntegrationMahdi Morafah, Vyacheslav Kungurtsev, Hojin Chang, Chen Chen, Bill Lin2024-09-27下载Federated Learning has emerged as a promising paradigm for collaborative machine learning, while preserving user data privacy. Despite its potential, standard FL lacks support for diverse heterogeneou...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
SAMBA: Scalable Approximate Forwarding For NDN Implicit FIB AggregationAmir Esmaeili, Abderrahmen Mtibaa2024-09-27下载The Internet landscape has witnessed a significant shift toward Information Centric Networking (ICN) due to the exponential growth of data-driven applications.
Towards Energy- and Cost-Efficient 6G NetworksTommy Azzino, Aria HasanzadeZonuzy, Jianghong Luo, Navid Abedini, Tao Luo2024-09-27下载As the world enters the journey toward the 6th generation (6G) of wireless technology, the promises of ultra-high data rates, unprecedented low latency, and a massive surge in connected devices requir...
Trust, But Verify, Operator-Reported GeolocationKatherine Izhikevich, Ben Du, Sumanth Rao, Alisha Ukani, Liz Izhikevich2024-09-27下载Geolocation plays a critical role in understanding the Internet. In this work, we provide an in-depth analysis of operator-misreported geolocation.
Adversarial Challenges in Network Intrusion Detection Systems: Research Insights and Future ProspectsSabrine Ennaji, Fabio De Gaspari, Dorjan Hitaj, Alicia Kbidi, Luigi V. Mancini2024-09-27下载Machine learning has brought significant advances in cybersecurity, particularly in the development of Intrusion Detection Systems (IDS). These improvements are mainly attributed to the ability of mac...
Enhancing Spectrum Efficiency in 6G Satellite Networks: A GAIL-Powered Policy Learning via Asynchronous Federated Inverse Reinforcement LearningSheikh Salman Hassan, Yu Min Park, Yan Kyaw Tun, Walid Saad, Zhu Han, Choong Seon Hong2024-09-27下载In this paper, a novel generative adversarial imitation learning (GAIL)-powered policy learning approach is proposed for optimizing beamforming, spectrum allocation, and remote user equipment (RUE) as...
Online and Utility-Power Efficient Task Scheduling in Homogeneous Fog NetworksFatemeh Ebadi, Vahid Shah-Mansouri2024-09-27下载Fog computing is of particular interest to Internet of Things (IoT), where inexpensive simple devices can offload their computation tasks to nearby Fog Nodes.
Towards Event-Triggered NMPC for Efficient 6G Communications: Experimental Results and Open ProblemsJens Püttschneider, Julian Golembiewski, Niklas A. Wagner, Christian Wietfeld, Timm Faulwasser2024-09-27下载Networked control systems enable real-time control and coordination of distributed systems, leveraging the low latency, high reliability, and massive connectivity offered by 5G and future 6G networks.

cs.PF - Performance

标题作者发布日期PDF摘要
ZERNIPAX: A Fast and Accurate Zernike Polynomial Calculator in PythonYigit Gunsur Elmacioglu, Rory Conlin, Daniel W. Dudt, Dario Panici, Egemen Kolemen2024-09-27下载Zernike polynomials serve as an orthogonal basis on the unit disc, and have proven to be effective in optics simulations, astrophysics, and more recently in plasma simulations.
Cluster-BPI: Efficient Fine-Grain Blind Power Identification for Defending against Hardware Thermal Trojans in Multicore SoCsMohamed R. Elshamy, Mehdi Elahi, Ahmad Patooghy, Abdel-Hameed A. Badawy2024-09-27下载Modern multicore System-on-Chips (SoCs) feature hardware monitoring mechanisms that measure total power consumption. However, these aggregate measurements are often insufficient for fine-grained therm...
Toward Greener Matrix Operations by Lossless Compressed FormatsFrancesco Tosoni, Philip Bille, Valerio Brunacci, Alessio De Angelis, Paolo Ferragina, Giovanni Manzini2024-09-27下载Sparse matrix-vector multiplication (SpMV) is a fundamental operation in machine learning, scientific computing, and graph algorithms. In this paper, we investigate the space, time, and energy efficie...
Balanced Splitting: A Framework for Achieving Zero-wait in the Multiserver-job ModelJonatha Anselmi, Josu Doncel2024-09-27下载We present a new framework for designing nonpreemptive and job-size oblivious scheduling policies in the multiserver-job queueing model. The main requirement is to identify a static and balanced sub-p...

基于 VitePress 构建