Skip to content

2024-11-21

cs.AR - Architecture

标题作者发布日期PDF摘要
Swift: A Multi-FPGA Framework for Scaling Up Accelerated Graph AnalyticsOluwole Jaiyeoba, Abdullah T. Mughrabi, Morteza Baradaran, Beenish Gul, Kevin Skadron2024-11-21下载Graph analytics are vital in fields such as social networks, biomedical research, and graph neural networks (GNNs). However, traditional CPUs and GPUs struggle with the memory bottlenecks caused by la...
Masala-CHAI: A Large-Scale SPICE Netlist Dataset for Analog Circuits by Harnessing AIJitendra Bhandari, Vineet Bhat, Yuheng He, Hamed Rahmani, Siddharth Garg, Ramesh Karri2024-11-21下载Masala-CHAI is a fully automated framework leveraging large language models (LLMs) to generate Simulation Programs with Integrated Circuit Emphasis (SPICE) netlists.
CKTSO: High-Performance Parallel Sparse Linear Solver for General Circuit SimulationsXiaoming Chen2024-11-21下载This paper introduces CKTSO (abbreviation of "circuit solver"), a novel sparse linear solver specially designed for the simulation program with integrated circuit emphasis (SPICE).
Experimental comparison of graph-based approximate nearest neighbor search algorithms on edge devicesAli Ganbarov, Jicheng Yuan, Anh Le-Tuan, Manfred Hauswirth, Danh Le-Phuoc2024-11-21下载In this paper, we present an experimental comparison of various graph-based approximate nearest neighbor (ANN) search algorithms deployed on edge devices for real-time nearest neighbor search applicat...
RISC-V Word-Size Modular Instructions for Residue Number SystemsLaurent-Stéphane Didier, Jean-Marc Robert2024-11-21下载Residue Number Systems (RNS) are parallel number systems that allow the computation on large numbers. They are used in high performance digital signal processing devices and cryptographic applications...
Dissecting Conditional Branch Predictors of Apple Firestorm and Qualcomm Oryon for Software Optimization and Architectural AnalysisJiajie Chen, Peng Qu, Youhui Zhang2024-11-21下载Branch predictor (BP) is a critical component of modern processors, and its accurate modeling is essential for compilers and applications. However, processor vendors have disclosed limited details abo...
Schemato -- An LLM for Netlist-to-Schematic ConversionRyoga Matsuo, Stefan Uhlich, Arun Venkitaraman, Andrea Bonetti, Chia-Yu Hsieh, Ali Momeni, Lukas Mauch, Augusto Capone, Eisaku Ohbuchi, Lorenzo Servadei2024-11-21下载Machine learning models are advancing circuit design, particularly in analog circuits. They typically generate netlists that lack human interpretability.
GraCo -- A Graph Composer for Integrated CircuitsStefan Uhlich, Andrea Bonetti, Arun Venkitaraman, Ali Momeni, Ryoga Matsuo, Chia-Yu Hsieh, Eisaku Ohbuchi, Lorenzo Servadei2024-11-21下载Designing integrated circuits involves substantial complexity, posing challenges in revealing its potential applications - from custom digital cells to analog circuits.
EDA-Aware RTL Generation with Large Language ModelsMubashir ul Islam, Humza Sami, Pierre-Emmanuel Gaillardon, Valerio Tenace2024-11-21下载Large Language Models (LLMs) have become increasingly popular for generating RTL code. However, producing error-free RTL code in a zero-shot setting remains highly challenging for even state-of-the-ar...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
LLOR: Automated Repair of OpenMP ProgramsUtpal Bora, Saurabh Joshi, Gautam Muduganti, Ramakrishna Upadrasta2024-11-21下载In this paper, we present a technique for repairing data race errors in parallel programs written in C/C++ and Fortran using the OpenMP API. Our technique can also remove barriers that are deemed unne...
Aggregating Funnels for Faster Fetch&Add and QueuesYounghun Roh, Yuanhao Wei, Eric Ruppert, Panagiota Fatourou, Siddhartha Jayanti, Julian Shun2024-11-21下载Many concurrent algorithms require processes to perform fetch-and-add operations on a single memory location, which can be a hot spot of contention.
Open Challenges in the Formal Verification of Autonomous DrivingPaolo Burgio, Angelo Ferrando, Marco Villani2024-11-21下载In the realm of autonomous driving, the development and integration of highly complex and heterogeneous systems are standard practice. Modern vehicles are not monolithic systems; instead, they are com...
Towards Adaptive Asynchronous Federated Learning for Human Activity RecognitionRastko Gajanin, Anastasiya Danilenka, Andrea Morichetta, Stefan Nastic2024-11-21下载In this work, we tackle the problem of performing multi-label classification in the case of extremely heterogeneous data and with decentralized Machine Learning.
Experimental comparison of graph-based approximate nearest neighbor search algorithms on edge devicesAli Ganbarov, Jicheng Yuan, Anh Le-Tuan, Manfred Hauswirth, Danh Le-Phuoc2024-11-21下载In this paper, we present an experimental comparison of various graph-based approximate nearest neighbor (ANN) search algorithms deployed on edge devices for real-time nearest neighbor search applicat...
RISC-V Word-Size Modular Instructions for Residue Number SystemsLaurent-Stéphane Didier, Jean-Marc Robert2024-11-21下载Residue Number Systems (RNS) are parallel number systems that allow the computation on large numbers. They are used in high performance digital signal processing devices and cryptographic applications...
FedRAV: Hierarchically Federated Region-Learning for Traffic Object Classification of Autonomous VehiclesYijun Zhai, Pengzhan Zhou, Yuepeng He, Fang Qu, Zhida Qin, Xianlong Jiao, Guiyan Liu, Songtao Guo2024-11-21下载The emerging federated learning enables distributed autonomous vehicles to train equipped deep learning models collaboratively without exposing their raw data, providing great potential for utilizing ...
Split Federated Learning Over Heterogeneous Edge Devices: Algorithm and OptimizationYunrui Sun, Gang Hu, Yinglei Teng, Dunbo Cai2024-11-21下载Split Learning (SL) is a promising collaborative machine learning approach, enabling resource-constrained devices to train models without sharing raw data, while reducing computational load and preser...
Asynchronous Federated Learning Using Outdated Local Updates Over TDMA ChannelJaeyoung Song, Jun-Pyo Hong2024-11-21下载In this paper, we consider asynchronous federated learning (FL) over time-division multiple access (TDMA)-based communication networks. Considering TDMA for transmitting local updates can introduce ...
InstCache: A Predictive Cache for LLM ServingLongwei Zou, Yan Liu, Jiamu Kang, Tingfeng Liu, Jiangang Kong, Yangdong Deng2024-11-21下载The revolutionary capabilities of Large Language Models (LLMs) are attracting rapidly growing popularity and leading to soaring user requests to inference serving systems.
DCSim: Computing and Networking Integration based Container Scheduling Simulator for Data CentersJinlong Hu, Zhizhe Rao, Xingchen Liu, Lihao Deng, Shoubin Dong2024-11-21下载The increasing prevalence of cloud-native technologies, particularly containers, has led to the widespread adoption of containerized deployments in data centers.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Initial Evidence of Elevated Reconnaissance Attacks Against Nodes in P2P Overlay NetworksScott Seidenberger, Anindya Maiti2024-11-21下载We hypothesize that peer-to-peer (P2P) overlay network nodes can be attractive to attackers due to their visibility, sustained uptime, and resource potential.
Performance Analysis of Traditional and Network Coded Transmission in Infrastructure-less Multi-hop Wireless NetworksMuhammad Ali, Alister Burr2024-11-21下载Infrastructure-less Multi-hop Wireless Networks are the backbone for mission critical communications such as in disaster and battlefield scenarios.
CAIP: Detecting Router Misconfigurations with Context-Aware Iterative Prompting of LLMsXi Jiang, Aaron Gember-Jacobson, Nick Feamster2024-11-21下载Model checkers and consistency checkers detect critical errors in router configurations, but these tools require significant manual effort to develop and maintain.
Q-CSM: Q-Learning-based Cognitive Service Management in Heterogeneous IoT NetworksKubra Duran, Mehmet Ozdem, Kerem Gursu, Berk Canberk2024-11-21下载The dramatic increase in the number of smart services and their diversity poses a significant challenge in Internet of Things (IoT) networks: heterogeneity.
Explainable Multi-Agent Reinforcement Learning for Extended Reality Codec AdaptationPedro Enrique Iturria-Rivera, Raimundas Gaigalas, Medhat Elsayed, Majid Bavand, Yigit Ozcan, Melike Erol-Kantarci2024-11-21下载Extended Reality (XR) services are set to transform applications over 5th and 6th generation wireless networks, delivering immersive experiences.
Generative AI-enabled Digital Twins for 6G-enhanced Smart CitiesKubra Duran, Lal Verda Cakir, Mehmet Ozdem, Kerem Gursu, Berk Canberk2024-11-21下载6G networks are envisioned to enable a wide range of applications, such as autonomous vehicles and smart cities. However, this rapid expansion of network topologies makes the management of 6G wireless...
A Multi-Layer Blockchain Simulator and Performance Evaluation of Social Internet of Vehicles with Multi-Connectivity ManagementYi-Ting Sun, Hsin-Chieh Lee, Yun-Chen Yu, Ting-Feng Wu, Ibrahim Althamary, Chih-Wei Huang2024-11-21下载The evolution of vehicle-to-everything (V2X) communication brings significant challenges, such as data integrity and vulnerabilities stemming from centralized management.
Towards Smart Fronthauling Management: Experimental Insights from a 5G TestbedMarcello Morini, Eugenio Moro, Ilario Filippini, Danilo De Donno, Salvatore Moscato, Antonio Capone2024-11-21下载The fronthaul connection is a key component of Centralized RAN (C-RAN) architectures, consistently required to handle high capacity demands. However, this critical feature is at risk when the transpor...
Unconsidered Installations: Discovering IoT Deployments in the IPv6 InternetMarkus Dahlmanns, Felix Heidenreich, Johannes Lohmöller, Jan Pennekamp, Klaus Wehrle, Martin Henze2024-11-21下载Internet-wide studies provide extremely valuable insight into how operators manage their Internet of Things (IoT) deployments in reality and often reveal grievances, e.g., significant security issues.
FastRAG: Retrieval Augmented Generation for Semi-structured DataAmar Abane, Anis Bekri, Abdella Battou, Saddek Bensalem2024-11-21下载Efficiently processing and interpreting network data is critical for the operation of increasingly complex networks. Recent advances in Large Language Models (LLM) and Retrieval-Augmented Generation (...

cs.PF - Performance

标题作者发布日期PDF摘要
Experimental comparison of graph-based approximate nearest neighbor search algorithms on edge devicesAli Ganbarov, Jicheng Yuan, Anh Le-Tuan, Manfred Hauswirth, Danh Le-Phuoc2024-11-21下载In this paper, we present an experimental comparison of various graph-based approximate nearest neighbor (ANN) search algorithms deployed on edge devices for real-time nearest neighbor search applicat...
Static Reuse Profile Estimation for Array ApplicationsAbdur Razzak, Atanu Barai, Nandakishore Santhi, Abdel-Hameed A. Badawy2024-11-21下载Reuse distance analysis is a widely recognized method for application characterization that illustrates cache locality. Although there are various techniques to calculate the reuse profile from dynami...

基于 VitePress 构建