Appearance
2024-09-27
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Voxel-CIM: An Efficient Compute-in-Memory Accelerator for Voxel-based Point Cloud Neural Networks | Xipeng Lin, Shanshi Huang, Hongwu Jiang | 2024-09-27 | 下载 | The 3D point cloud perception has emerged as a fundamental role for a wide range of applications. In particular, with the rapid development of neural networks, the voxel-based networks attract great a... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Accelerating stencils on the Tenstorrent Grayskull RISC-V accelerator | Nick Brown, Ryan Barton | 2024-09-27 | 下载 | The RISC-V Instruction Set Architecture (ISA) has enjoyed phenomenal growth in recent years, however it still to gain popularity in HPC. Whilst adopting RISC-V CPU solutions in HPC might be some way o... |
| Fully integrating the Flang Fortran compiler with standard MLIR | Nick Brown | 2024-09-27 | 下载 | Fortran is the lingua franca of HPC code development and as such it is crucial that we as a community have open source Fortran compilers capable of generating high performance executables. |
| Hierarchical Federated ADMM | Seyed Mohammad Azimi-Abarghouyi, Nicola Bastianello, Karl H. Johansson, Viktoria Fodor | 2024-09-27 | 下载 | In this paper, we depart from the widely-used gradient descent-based hierarchical federated learning (FL) algorithms to develop a novel hierarchical FL framework based on the alternating direction met... |
| Hello SME! Generating Fast Matrix Multiplication Kernels Using the Scalable Matrix Extension | Stefan Remke, Alexander Breuer | 2024-09-27 | 下载 | Modern central processing units (CPUs) feature single-instruction, multiple-data pipelines to accelerate compute-intensive floating-point and fixed-point workloads. |
| TensorSocket: Shared Data Loading for Deep Learning Training | Ties Robroek, Neil Kim Nielsen, Pınar Tözün | 2024-09-27 | 下载 | Training deep learning models is a repetitive and resource-intensive process. Data scientists often train several models before landing on a set of parameters (e.g. |
| Exploring DAOS Interfaces and Performance | Nicolau Manubens, Johann Lombardi, Simon D. Smart, Emanuele Danovaro, Tiago Quintino, Dean Hildebrand, Adrian Jackson | 2024-09-27 | 下载 | Distributed Asynchronous Object Store (DAOS) is a novel software-defined object store leveraging Non-Volatile Memory (NVM) devices, designed for high performance. |
| Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge Integration | Mahdi Morafah, Vyacheslav Kungurtsev, Hojin Chang, Chen Chen, Bill Lin | 2024-09-27 | 下载 | Federated Learning has emerged as a promising paradigm for collaborative machine learning, while preserving user data privacy. Despite its potential, standard FL lacks support for diverse heterogeneou... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| SAMBA: Scalable Approximate Forwarding For NDN Implicit FIB Aggregation | Amir Esmaeili, Abderrahmen Mtibaa | 2024-09-27 | 下载 | The Internet landscape has witnessed a significant shift toward Information Centric Networking (ICN) due to the exponential growth of data-driven applications. |
| Towards Energy- and Cost-Efficient 6G Networks | Tommy Azzino, Aria HasanzadeZonuzy, Jianghong Luo, Navid Abedini, Tao Luo | 2024-09-27 | 下载 | As the world enters the journey toward the 6th generation (6G) of wireless technology, the promises of ultra-high data rates, unprecedented low latency, and a massive surge in connected devices requir... |
| Trust, But Verify, Operator-Reported Geolocation | Katherine Izhikevich, Ben Du, Sumanth Rao, Alisha Ukani, Liz Izhikevich | 2024-09-27 | 下载 | Geolocation plays a critical role in understanding the Internet. In this work, we provide an in-depth analysis of operator-misreported geolocation. |
| Adversarial Challenges in Network Intrusion Detection Systems: Research Insights and Future Prospects | Sabrine Ennaji, Fabio De Gaspari, Dorjan Hitaj, Alicia Kbidi, Luigi V. Mancini | 2024-09-27 | 下载 | Machine learning has brought significant advances in cybersecurity, particularly in the development of Intrusion Detection Systems (IDS). These improvements are mainly attributed to the ability of mac... |
| Enhancing Spectrum Efficiency in 6G Satellite Networks: A GAIL-Powered Policy Learning via Asynchronous Federated Inverse Reinforcement Learning | Sheikh Salman Hassan, Yu Min Park, Yan Kyaw Tun, Walid Saad, Zhu Han, Choong Seon Hong | 2024-09-27 | 下载 | In this paper, a novel generative adversarial imitation learning (GAIL)-powered policy learning approach is proposed for optimizing beamforming, spectrum allocation, and remote user equipment (RUE) as... |
| Online and Utility-Power Efficient Task Scheduling in Homogeneous Fog Networks | Fatemeh Ebadi, Vahid Shah-Mansouri | 2024-09-27 | 下载 | Fog computing is of particular interest to Internet of Things (IoT), where inexpensive simple devices can offload their computation tasks to nearby Fog Nodes. |
| Towards Event-Triggered NMPC for Efficient 6G Communications: Experimental Results and Open Problems | Jens Püttschneider, Julian Golembiewski, Niklas A. Wagner, Christian Wietfeld, Timm Faulwasser | 2024-09-27 | 下载 | Networked control systems enable real-time control and coordination of distributed systems, leveraging the low latency, high reliability, and massive connectivity offered by 5G and future 6G networks. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| ZERNIPAX: A Fast and Accurate Zernike Polynomial Calculator in Python | Yigit Gunsur Elmacioglu, Rory Conlin, Daniel W. Dudt, Dario Panici, Egemen Kolemen | 2024-09-27 | 下载 | Zernike polynomials serve as an orthogonal basis on the unit disc, and have proven to be effective in optics simulations, astrophysics, and more recently in plasma simulations. |
| Cluster-BPI: Efficient Fine-Grain Blind Power Identification for Defending against Hardware Thermal Trojans in Multicore SoCs | Mohamed R. Elshamy, Mehdi Elahi, Ahmad Patooghy, Abdel-Hameed A. Badawy | 2024-09-27 | 下载 | Modern multicore System-on-Chips (SoCs) feature hardware monitoring mechanisms that measure total power consumption. However, these aggregate measurements are often insufficient for fine-grained therm... |
| Toward Greener Matrix Operations by Lossless Compressed Formats | Francesco Tosoni, Philip Bille, Valerio Brunacci, Alessio De Angelis, Paolo Ferragina, Giovanni Manzini | 2024-09-27 | 下载 | Sparse matrix-vector multiplication (SpMV) is a fundamental operation in machine learning, scientific computing, and graph algorithms. In this paper, we investigate the space, time, and energy efficie... |
| Balanced Splitting: A Framework for Achieving Zero-wait in the Multiserver-job Model | Jonatha Anselmi, Josu Doncel | 2024-09-27 | 下载 | We present a new framework for designing nonpreemptive and job-size oblivious scheduling policies in the multiserver-job queueing model. The main requirement is to identify a static and balanced sub-p... |