Appearance
2024-09-17
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| eBPF-mm: Userspace-guided memory management in Linux with eBPF | Konstantinos Mores, Stratos Psomadakis, Georgios Goumas | 2024-09-17 | 下载 | We leverage eBPF in order to implement custom policies in the Linux memory subsystem. Inspired by CBMM, we create a mechanism that provides the kernel with hints regarding the benefit of promoting a p... |
| IBM Quantum Computers: Evolution, Performance, and Future Directions | M. AbuGhanem | 2024-09-17 | 下载 | Quantum computers represent a transformative frontier in computational technology, promising exponential speedups beyond classical computing limits. |
| FSL-HDnn: A 5.7 TOPS/W End-to-end Few-shot Learning Classifier Accelerator with Feature Extraction and Hyperdimensional Computing | Haichao Yang, Chang Eun Song, Weihong Xu, Behnam Khaleghi, Uday Mallappa, Monil Shah, Keming Fan, Mingu Kang, Tajana Rosing | 2024-09-17 | 下载 | This paper introduces FSL-HDnn, an energy-efficient accelerator that implements the end-to-end pipeline of feature extraction, classification, and on-chip few-shot learning (FSL) through gradient-free... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| CountChain: A Decentralized Oracle Network for Counting Systems | Behkish Nassirzadeh, Stefanos Leonardos, Albert Heinle, Anwar Hasan, Vijay Ganesh | 2024-09-17 | 下载 | Blockchain integration in industries like online advertising is hindered by its connectivity limitations to off-chain data. These industries heavily rely on precise counting systems for collecting and... |
| Advances in APPFL: A Comprehensive and Extensible Federated Learning Framework | Zilinghan Li, Shilan He, Ze Yang, Minseok Ryu, Kibaek Kim, Ravi Madduri | 2024-09-17 | 下载 | Federated learning (FL) is a distributed machine learning paradigm enabling collaborative model training while preserving data privacy. In today's landscape, where most data is proprietary, confidenti... |
| Temporal Load Imbalance on Ondes3D Seismic Simulator for Different Multicore Architectures | Ana Luisa Veroneze Solórzano, Philippe Olivier Alexandre Navaux, Lucas Mello Schnorr | 2024-09-17 | 下载 | The variety of today's multicore architectures motivates researchers to explore parallel scientific applications on different platforms. Load imbalance is one performance issue that can prejudice para... |
| Communication Lower Bounds and Optimal Algorithms for Symmetric Matrix Computations | Hussam Al Daas, Grey Ballard, Laura Grigori, Suraj Kumar, Kathryn Rouse, Mathieu Verite | 2024-09-17 | 下载 | In this article, we focus on the communication costs of three symmetric matrix computations: i) multiplying a matrix with its transpose, known as a symmetric rank-k update (SYRK) ii) adding the result... |
| Federated Learning with Integrated Sensing, Communication, and Computation: Frameworks and Performance Analysis | Yipeng Liang, Qimei Chen, Hao Jiang | 2024-09-17 | 下载 | With the emergence of integrated sensing, communication, and computation (ISCC) in the upcoming 6G era, federated learning with ISCC (FL-ISCC), integrating sample collection, local training, and param... |
| Energy Efficiency Support for Software Defined Networks: a Serverless Computing Approach | Fatemeh Banaie, Karim Djemame, Abdulaziz Alhindi, Vasilios Kelefouras | 2024-09-17 | 下载 | Automatic network management strategies have become paramount for meeting the needs of innovative real-time and data-intensive applications, such as in the Internet of Things. |
| A Reinforcement Learning Environment for Automatic Code Optimization in the MLIR Compiler | Mohammed Tirichine, Nassim Ameur, Nazim Bendib, Iheb Nassim Aouadj, Bouchama Djad, Rafik Bouloudene, Riyadh Baghdadi | 2024-09-17 | 下载 | Code optimization is a crucial task that aims to enhance code performance. However, this process is often tedious and complex, highlighting the necessity for automatic code optimization techniques. |
| Delay Analysis of EIP-4844 | Pourya Soltani, Farid Ashtiani | 2024-09-17 | 下载 | Proto-Danksharding, proposed in Ethereum Improvement Proposal 4844 (EIP-4844), aims to incrementally improve the scalability of the Ethereum blockchain by introducing a new type of transaction known a... |
| Ladon: High-Performance Multi-BFT Consensus via Dynamic Global Ordering (Extended Version) | Hanzheng Lyu, Shaokang Xie, Jianyu Niu, Chen Feng, Yinqian Zhang, Ivan Beschastnikh | 2024-09-17 | 下载 | Multi-BFT consensus runs multiple leader-based consensus instances in parallel, circumventing the leader bottleneck of a single instance. However, it contains an Achilles' heel: the need to globally o... |
| Skip TLB flushes for reused pages within mmap's | Frederic Schimmelpfennig, André Brinkmann, Hossein Asadi, Reza Salkhordeh | 2024-09-17 | 下载 | Memory access efficiency is significantly enhanced by caching recent address translations in the CPUs' Translation Lookaside Buffers (TLBs). However, since the operating system is not aware of which c... |
| Dynamic DAG-Application Scheduling for Multi-Tier Edge Computing in Heterogeneous Networks | Xiang Li, Mustafa Abdallah, Yuan-Yao Lou, Mung Chiang, Kwang Taik Kim, Saurabh Bagchi | 2024-09-17 | 下载 | Edge computing is deemed a promising technique to execute latency-sensitive applications by offloading computation-intensive tasks to edge servers. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Micro-orchestration of RAN functions accelerated in FPGA SoC devices | Nikolaos Bartzoudis, José Rubio Fernández, David López-Bueno, Godfrey Kibalya, Angelos Antonopoulos | 2024-09-17 | 下载 | This work provides a vision on how to tackle the underutilization of compute resources in FPGA SoC devices used across 5G and edge computing infrastructures. |
| Designing Reliable Virtualized Radio Access Networks | Ufuk Usubütün, André Gomes, Shankaranarayanan Puzhavakath Narayanan, Matti Hiltunen, Shivendra Panwar | 2024-09-17 | 下载 | As virtualization of Radio Access Networks (RAN) gains momentum, understanding the impact of hardware and software disaggregation on resiliency becomes critical to meet the high availability requireme... |
| LoRa Communication for Agriculture 4.0: Opportunities, Challenges, and Future Directions | Lameya Aldhaheri, Noor Alshehhi, Irfana Ilyas Jameela Manzil, Ruhul Amin Khalil, Shumaila Javaid, Nasir Saeed, Mohamed-Slim Alouini | 2024-09-17 | 下载 | The emerging field of smart agriculture leverages the Internet of Things (IoT) to revolutionize farming practices. This paper investigates the transformative potential of Long Range (LoRa) technology ... |
| AutoFlow: An Autoencoder-based Approach for IP Flow Record Compression with Minimal Impact on Traffic Classification | Adrian Pekar | 2024-09-17 | 下载 | Network monitoring generates massive volumes of IP flow records, posing significant challenges for storage and analysis. This paper presents a novel deep learning-based approach to compressing these r... |
| Trends, Advancements and Challenges in Intelligent Optimization in Satellite Communication | Philippe Krajsic, Viola Suess, Zehong Cao, Ryszard Kowalczyk, Bogdan Franczyk | 2024-09-17 | 下载 | Efficient satellite communications play an enormously important role in all of our daily lives. This includes the transmission of data for communication purposes, the operation of IoT applications or ... |
| Dynamic DAG-Application Scheduling for Multi-Tier Edge Computing in Heterogeneous Networks | Xiang Li, Mustafa Abdallah, Yuan-Yao Lou, Mung Chiang, Kwang Taik Kim, Saurabh Bagchi | 2024-09-17 | 下载 | Edge computing is deemed a promising technique to execute latency-sensitive applications by offloading computation-intensive tasks to edge servers. |
| Fast and Post-Quantum Authentication for Real-time Next Generation Networks with Bloom Filter | Kiarash Sedghighadikolaei, Attila A Yavuz | 2024-09-17 | 下载 | Large-scale next-generation networked systems like smart grids and vehicular networks facilitate extensive automation and autonomy through real-time communication of sensitive messages. |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Analysis of Synchronization Mechanisms in Operating Systems | Oluwatoyin Kode, Temitope Oyemade | 2024-09-17 | 下载 | This research analyzed the performance and consistency of four synchronization mechanisms-reentrant locks, semaphores, synchronized methods, and synchronized blocks-across three operating systems: mac... |
| eBPF-mm: Userspace-guided memory management in Linux with eBPF | Konstantinos Mores, Stratos Psomadakis, Georgios Goumas | 2024-09-17 | 下载 | We leverage eBPF in order to implement custom policies in the Linux memory subsystem. Inspired by CBMM, we create a mechanism that provides the kernel with hints regarding the benefit of promoting a p... |
| Skip TLB flushes for reused pages within mmap's | Frederic Schimmelpfennig, André Brinkmann, Hossein Asadi, Reza Salkhordeh | 2024-09-17 | 下载 | Memory access efficiency is significantly enhanced by caching recent address translations in the CPUs' Translation Lookaside Buffers (TLBs). However, since the operating system is not aware of which c... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Temporal Load Imbalance on Ondes3D Seismic Simulator for Different Multicore Architectures | Ana Luisa Veroneze Solórzano, Philippe Olivier Alexandre Navaux, Lucas Mello Schnorr | 2024-09-17 | 下载 | The variety of today's multicore architectures motivates researchers to explore parallel scientific applications on different platforms. Load imbalance is one performance issue that can prejudice para... |
| Can Graph Reordering Speed Up Graph Neural Network Training? An Experimental Study | Nikolai Merkel, Pierre Toussing, Ruben Mayer, Hans-Arno Jacobsen | 2024-09-17 | 下载 | Graph neural networks (GNNs) are a type of neural network capable of learning on graph-structured data. However, training GNNs on large-scale graphs is challenging due to iterative aggregations of hig... |