Appearance
2024-11-21
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Swift: A Multi-FPGA Framework for Scaling Up Accelerated Graph Analytics | Oluwole Jaiyeoba, Abdullah T. Mughrabi, Morteza Baradaran, Beenish Gul, Kevin Skadron | 2024-11-21 | 下载 | Graph analytics are vital in fields such as social networks, biomedical research, and graph neural networks (GNNs). However, traditional CPUs and GPUs struggle with the memory bottlenecks caused by la... |
| Masala-CHAI: A Large-Scale SPICE Netlist Dataset for Analog Circuits by Harnessing AI | Jitendra Bhandari, Vineet Bhat, Yuheng He, Hamed Rahmani, Siddharth Garg, Ramesh Karri | 2024-11-21 | 下载 | Masala-CHAI is a fully automated framework leveraging large language models (LLMs) to generate Simulation Programs with Integrated Circuit Emphasis (SPICE) netlists. |
| CKTSO: High-Performance Parallel Sparse Linear Solver for General Circuit Simulations | Xiaoming Chen | 2024-11-21 | 下载 | This paper introduces CKTSO (abbreviation of "circuit solver"), a novel sparse linear solver specially designed for the simulation program with integrated circuit emphasis (SPICE). |
| Experimental comparison of graph-based approximate nearest neighbor search algorithms on edge devices | Ali Ganbarov, Jicheng Yuan, Anh Le-Tuan, Manfred Hauswirth, Danh Le-Phuoc | 2024-11-21 | 下载 | In this paper, we present an experimental comparison of various graph-based approximate nearest neighbor (ANN) search algorithms deployed on edge devices for real-time nearest neighbor search applicat... |
| RISC-V Word-Size Modular Instructions for Residue Number Systems | Laurent-Stéphane Didier, Jean-Marc Robert | 2024-11-21 | 下载 | Residue Number Systems (RNS) are parallel number systems that allow the computation on large numbers. They are used in high performance digital signal processing devices and cryptographic applications... |
| Dissecting Conditional Branch Predictors of Apple Firestorm and Qualcomm Oryon for Software Optimization and Architectural Analysis | Jiajie Chen, Peng Qu, Youhui Zhang | 2024-11-21 | 下载 | Branch predictor (BP) is a critical component of modern processors, and its accurate modeling is essential for compilers and applications. However, processor vendors have disclosed limited details abo... |
| Schemato -- An LLM for Netlist-to-Schematic Conversion | Ryoga Matsuo, Stefan Uhlich, Arun Venkitaraman, Andrea Bonetti, Chia-Yu Hsieh, Ali Momeni, Lukas Mauch, Augusto Capone, Eisaku Ohbuchi, Lorenzo Servadei | 2024-11-21 | 下载 | Machine learning models are advancing circuit design, particularly in analog circuits. They typically generate netlists that lack human interpretability. |
| GraCo -- A Graph Composer for Integrated Circuits | Stefan Uhlich, Andrea Bonetti, Arun Venkitaraman, Ali Momeni, Ryoga Matsuo, Chia-Yu Hsieh, Eisaku Ohbuchi, Lorenzo Servadei | 2024-11-21 | 下载 | Designing integrated circuits involves substantial complexity, posing challenges in revealing its potential applications - from custom digital cells to analog circuits. |
| EDA-Aware RTL Generation with Large Language Models | Mubashir ul Islam, Humza Sami, Pierre-Emmanuel Gaillardon, Valerio Tenace | 2024-11-21 | 下载 | Large Language Models (LLMs) have become increasingly popular for generating RTL code. However, producing error-free RTL code in a zero-shot setting remains highly challenging for even state-of-the-ar... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| LLOR: Automated Repair of OpenMP Programs | Utpal Bora, Saurabh Joshi, Gautam Muduganti, Ramakrishna Upadrasta | 2024-11-21 | 下载 | In this paper, we present a technique for repairing data race errors in parallel programs written in C/C++ and Fortran using the OpenMP API. Our technique can also remove barriers that are deemed unne... |
| Aggregating Funnels for Faster Fetch&Add and Queues | Younghun Roh, Yuanhao Wei, Eric Ruppert, Panagiota Fatourou, Siddhartha Jayanti, Julian Shun | 2024-11-21 | 下载 | Many concurrent algorithms require processes to perform fetch-and-add operations on a single memory location, which can be a hot spot of contention. |
| Open Challenges in the Formal Verification of Autonomous Driving | Paolo Burgio, Angelo Ferrando, Marco Villani | 2024-11-21 | 下载 | In the realm of autonomous driving, the development and integration of highly complex and heterogeneous systems are standard practice. Modern vehicles are not monolithic systems; instead, they are com... |
| Towards Adaptive Asynchronous Federated Learning for Human Activity Recognition | Rastko Gajanin, Anastasiya Danilenka, Andrea Morichetta, Stefan Nastic | 2024-11-21 | 下载 | In this work, we tackle the problem of performing multi-label classification in the case of extremely heterogeneous data and with decentralized Machine Learning. |
| Experimental comparison of graph-based approximate nearest neighbor search algorithms on edge devices | Ali Ganbarov, Jicheng Yuan, Anh Le-Tuan, Manfred Hauswirth, Danh Le-Phuoc | 2024-11-21 | 下载 | In this paper, we present an experimental comparison of various graph-based approximate nearest neighbor (ANN) search algorithms deployed on edge devices for real-time nearest neighbor search applicat... |
| RISC-V Word-Size Modular Instructions for Residue Number Systems | Laurent-Stéphane Didier, Jean-Marc Robert | 2024-11-21 | 下载 | Residue Number Systems (RNS) are parallel number systems that allow the computation on large numbers. They are used in high performance digital signal processing devices and cryptographic applications... |
| FedRAV: Hierarchically Federated Region-Learning for Traffic Object Classification of Autonomous Vehicles | Yijun Zhai, Pengzhan Zhou, Yuepeng He, Fang Qu, Zhida Qin, Xianlong Jiao, Guiyan Liu, Songtao Guo | 2024-11-21 | 下载 | The emerging federated learning enables distributed autonomous vehicles to train equipped deep learning models collaboratively without exposing their raw data, providing great potential for utilizing ... |
| Split Federated Learning Over Heterogeneous Edge Devices: Algorithm and Optimization | Yunrui Sun, Gang Hu, Yinglei Teng, Dunbo Cai | 2024-11-21 | 下载 | Split Learning (SL) is a promising collaborative machine learning approach, enabling resource-constrained devices to train models without sharing raw data, while reducing computational load and preser... |
| Asynchronous Federated Learning Using Outdated Local Updates Over TDMA Channel | Jaeyoung Song, Jun-Pyo Hong | 2024-11-21 | 下载 | In this paper, we consider asynchronous federated learning (FL) over time-division multiple access (TDMA)-based communication networks. Considering TDMA for transmitting local updates can introduce ... |
| InstCache: A Predictive Cache for LLM Serving | Longwei Zou, Yan Liu, Jiamu Kang, Tingfeng Liu, Jiangang Kong, Yangdong Deng | 2024-11-21 | 下载 | The revolutionary capabilities of Large Language Models (LLMs) are attracting rapidly growing popularity and leading to soaring user requests to inference serving systems. |
| DCSim: Computing and Networking Integration based Container Scheduling Simulator for Data Centers | Jinlong Hu, Zhizhe Rao, Xingchen Liu, Lihao Deng, Shoubin Dong | 2024-11-21 | 下载 | The increasing prevalence of cloud-native technologies, particularly containers, has led to the widespread adoption of containerized deployments in data centers. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Initial Evidence of Elevated Reconnaissance Attacks Against Nodes in P2P Overlay Networks | Scott Seidenberger, Anindya Maiti | 2024-11-21 | 下载 | We hypothesize that peer-to-peer (P2P) overlay network nodes can be attractive to attackers due to their visibility, sustained uptime, and resource potential. |
| Performance Analysis of Traditional and Network Coded Transmission in Infrastructure-less Multi-hop Wireless Networks | Muhammad Ali, Alister Burr | 2024-11-21 | 下载 | Infrastructure-less Multi-hop Wireless Networks are the backbone for mission critical communications such as in disaster and battlefield scenarios. |
| CAIP: Detecting Router Misconfigurations with Context-Aware Iterative Prompting of LLMs | Xi Jiang, Aaron Gember-Jacobson, Nick Feamster | 2024-11-21 | 下载 | Model checkers and consistency checkers detect critical errors in router configurations, but these tools require significant manual effort to develop and maintain. |
| Q-CSM: Q-Learning-based Cognitive Service Management in Heterogeneous IoT Networks | Kubra Duran, Mehmet Ozdem, Kerem Gursu, Berk Canberk | 2024-11-21 | 下载 | The dramatic increase in the number of smart services and their diversity poses a significant challenge in Internet of Things (IoT) networks: heterogeneity. |
| Explainable Multi-Agent Reinforcement Learning for Extended Reality Codec Adaptation | Pedro Enrique Iturria-Rivera, Raimundas Gaigalas, Medhat Elsayed, Majid Bavand, Yigit Ozcan, Melike Erol-Kantarci | 2024-11-21 | 下载 | Extended Reality (XR) services are set to transform applications over 5th and 6th generation wireless networks, delivering immersive experiences. |
| Generative AI-enabled Digital Twins for 6G-enhanced Smart Cities | Kubra Duran, Lal Verda Cakir, Mehmet Ozdem, Kerem Gursu, Berk Canberk | 2024-11-21 | 下载 | 6G networks are envisioned to enable a wide range of applications, such as autonomous vehicles and smart cities. However, this rapid expansion of network topologies makes the management of 6G wireless... |
| A Multi-Layer Blockchain Simulator and Performance Evaluation of Social Internet of Vehicles with Multi-Connectivity Management | Yi-Ting Sun, Hsin-Chieh Lee, Yun-Chen Yu, Ting-Feng Wu, Ibrahim Althamary, Chih-Wei Huang | 2024-11-21 | 下载 | The evolution of vehicle-to-everything (V2X) communication brings significant challenges, such as data integrity and vulnerabilities stemming from centralized management. |
| Towards Smart Fronthauling Management: Experimental Insights from a 5G Testbed | Marcello Morini, Eugenio Moro, Ilario Filippini, Danilo De Donno, Salvatore Moscato, Antonio Capone | 2024-11-21 | 下载 | The fronthaul connection is a key component of Centralized RAN (C-RAN) architectures, consistently required to handle high capacity demands. However, this critical feature is at risk when the transpor... |
| Unconsidered Installations: Discovering IoT Deployments in the IPv6 Internet | Markus Dahlmanns, Felix Heidenreich, Johannes Lohmöller, Jan Pennekamp, Klaus Wehrle, Martin Henze | 2024-11-21 | 下载 | Internet-wide studies provide extremely valuable insight into how operators manage their Internet of Things (IoT) deployments in reality and often reveal grievances, e.g., significant security issues. |
| FastRAG: Retrieval Augmented Generation for Semi-structured Data | Amar Abane, Anis Bekri, Abdella Battou, Saddek Bensalem | 2024-11-21 | 下载 | Efficiently processing and interpreting network data is critical for the operation of increasingly complex networks. Recent advances in Large Language Models (LLM) and Retrieval-Augmented Generation (... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Experimental comparison of graph-based approximate nearest neighbor search algorithms on edge devices | Ali Ganbarov, Jicheng Yuan, Anh Le-Tuan, Manfred Hauswirth, Danh Le-Phuoc | 2024-11-21 | 下载 | In this paper, we present an experimental comparison of various graph-based approximate nearest neighbor (ANN) search algorithms deployed on edge devices for real-time nearest neighbor search applicat... |
| Static Reuse Profile Estimation for Array Applications | Abdur Razzak, Atanu Barai, Nandakishore Santhi, Abdel-Hameed A. Badawy | 2024-11-21 | 下载 | Reuse distance analysis is a widely recognized method for application characterization that illustrates cache locality. Although there are various techniques to calculate the reuse profile from dynami... |