Appearance
2024-11-05
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| MetRex: A Benchmark for Verilog Code Metric Reasoning Using LLMs | Manar Abdelatty, Jingxiao Ma, Sherief Reda | 2024-11-05 | 下载 | Large Language Models (LLMs) have been applied to various hardware design tasks, including Verilog code generation, EDA tool scripting, and RTL bug fixing. |
| DP-HLS: A High-Level Synthesis Framework for Accelerating Dynamic Programming Algorithms in Bioinformatics | Yingqi Cao, Anshu Gupta, Jason Liang, Yatish Turakhia | 2024-11-05 | 下载 | Dynamic programming (DP) based algorithms are essential yet compute-intensive parts of numerous bioinformatics pipelines, which typically involve populating a 2-D scoring matrix based on a recursive f... |
| Kernel Approximation using Analog In-Memory Computing | Julian Büchel, Giacomo Camposampiero, Athanasios Vasilopoulos, Corey Lammie, Manuel Le Gallo, Abbas Rahimi, Abu Sebastian | 2024-11-05 | 下载 | Kernel functions are vital ingredients of several machine learning algorithms, but often incur significant memory and computational costs. We introduce an approach to kernel approximation in machine l... |
| Hardware for converting floating-point to the microscaling (MX) format | Danila Gorodecky, Leonel Sousa | 2024-11-05 | 下载 | This paper proposes hardware converters for the microscaling format (MX-format), a reduced representation of floating-point numbers. We present an algorithm and a memory-free hardware model for conver... |
| SpiDR: A Reconfigurable Digital Compute-in-Memory Spiking Neural Network Accelerator for Event-based Perception | Deepika Sharma, Shubham Negi, Trishit Dutta, Amogh Agrawal, Kaushik Roy | 2024-11-05 | 下载 | Spiking Neural Networks (SNNs), with their inherent recurrence, offer an efficient method for processing the asynchronous temporal data generated by Dynamic Vision Sensors (DVS), making them well-suit... |
| The Hitchhiker's Guide to Programming and Optimizing Cache Coherent Heterogeneous Systems: CXL, NVLink-C2C, and AMD Infinity Fabric | Zixuan Wang, Suyash Mahar, Luyi Li, Jangseon Park, Jinpyo Kim, Theodore Michailidis, Yue Pan, Mingyao Shen, Tajana Rosing, Dean Tullsen, Steven Swanson, Jishen Zhao | 2024-11-05 | 下载 | We present a thorough analysis of the use of modern heterogeneous systems interconnected by various cachecoherent links, including CXL, NVLink-C2C, and Infinity Fabric. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Shared Memory-Aware Latency-Sensitive Message Aggregation for Fine-Grained Communication | Kavitha Chandrasekar, Laxmikant Kale | 2024-11-05 | 下载 | Message aggregation is often used with a goal to reduce communication cost in HPC applications. The difference in the order of overhead of sending a message and cost of per byte transferred motivates ... |
| AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution | Zhiqiang Xie, Hao Kang, Ying Sheng, Tushar Krishna, Kayvon Fatahalian, Christos Kozyrakis | 2024-11-05 | 下载 | With more advanced natural language understanding and reasoning capabilities, large language model (LLM)-powered agents are increasingly developed in simulated environments to perform complex tasks, i... |
| DP-HLS: A High-Level Synthesis Framework for Accelerating Dynamic Programming Algorithms in Bioinformatics | Yingqi Cao, Anshu Gupta, Jason Liang, Yatish Turakhia | 2024-11-05 | 下载 | Dynamic programming (DP) based algorithms are essential yet compute-intensive parts of numerous bioinformatics pipelines, which typically involve populating a 2-D scoring matrix based on a recursive f... |
| An Open API Architecture to Discover the Trustworthy Explanation of Cloud AI Services | Zerui Wang, Yan Liu, Jun Huang | 2024-11-05 | 下载 | This article presents the design of an open-API-based explainable AI (XAI) service to provide feature contribution explanations for cloud AI services. |
| Distributed Quantum Advantage for Local Problems | Alkida Balliu, Sebastian Brandt, Xavier Coiteux-Roy, Francesco d'Amore, Massimo Equi, François Le Gall, Henrik Lievonen, Augusto Modanese, Dennis Olivetti, Marc-Olivier Renou, Jukka Suomela, Lucas Tendick, Isadora Veeren | 2024-11-05 | 下载 | We present the first local problem that shows a super-constant separation between the classical randomized LOCAL model of distributed computing and its quantum counterpart. |
| LOGSAFE: Logic-Guided Verification for Trustworthy Federated Time-Series Learning | Dung Thuy Nguyen, Ziyan An, Taylor T. Johnson, Meiyi Ma, Kevin Leach | 2024-11-05 | 下载 | This paper introduces LOGSAFE, a defense mechanism for federated learning in time series settings, particularly within cyber-physical systems. |
| Instant Resonance: Dual Strategy Enhances the Data Consensus Success Rate of Blockchain Threshold Signature Oracles | Youquan Xian, Xueying Zeng, Chunpei Li, Dongcheng Li, Peng Wang, Peng Liu, Xianxian Li | 2024-11-05 | 下载 | With the rapid development of Decentralized Finance (DeFi) and Real-World Assets (RWA), the importance of blockchain oracles in real-time data acquisition has become increasingly prominent. |
| Photon: Federated LLM Pre-Training | Lorenzo Sani, Alex Iacob, Zeyu Cao, Royson Lee, Bill Marino, Yan Gao, Dongqi Cai, Zexi Li, Wanru Zhao, Xinchi Qiu, Nicholas D. Lane | 2024-11-05 | 下载 | Scaling large language models (LLMs) demands extensive data and computing resources, which are traditionally constrained to data centers by the high-bandwidth requirements of distributed training. |
| iAnomaly: A Toolkit for Generating Performance Anomaly Datasets in Edge-Cloud Integrated Computing Environments | Duneesha Fernando, Maria A. Rodriguez, Rajkumar Buyya | 2024-11-05 | 下载 | Microservice architectures are increasingly used to modularize IoT applications and deploy them in distributed and heterogeneous edge computing environments. |
| CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration | Hongpeng Jin, Yanzhao Wu | 2024-11-05 | 下载 | Large Language Models (LLMs) exhibit remarkable human-like predictive capabilities. However, it is challenging to deploy LLMs to provide efficient and adaptive inference services at the edge. |
| The Hitchhiker's Guide to Programming and Optimizing Cache Coherent Heterogeneous Systems: CXL, NVLink-C2C, and AMD Infinity Fabric | Zixuan Wang, Suyash Mahar, Luyi Li, Jangseon Park, Jinpyo Kim, Theodore Michailidis, Yue Pan, Mingyao Shen, Tajana Rosing, Dean Tullsen, Steven Swanson, Jishen Zhao | 2024-11-05 | 下载 | We present a thorough analysis of the use of modern heterogeneous systems interconnected by various cachecoherent links, including CXL, NVLink-C2C, and Infinity Fabric. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Model-based Deep Learning for QoS-Aware Rate-Splitting Multiple Access Wireless Systems | Hanwen Zhang, Mingzhe Chen, Alireza Vahid, Feng Ye, Haijian Sun | 2024-11-05 | 下载 | Next generation communications demand for better spectrum management, lower latency, and guaranteed quality-of-service (QoS). Recently, Artificial intelligence (AI) has been widely introduced to advan... |
| TwiNet: Connecting Real World Networks to their Digital Twins Through a Live Bidirectional Link | Clifton Paul Robinson, Andrea Lacava, Pedram Johari, Francesca Cuomo, Tommaso Melodia | 2024-11-05 | 下载 | The wireless spectrum's increasing complexity poses challenges and opportunities, highlighting the necessity for real-time solutions and robust data processing capabilities. |
| GeMID: Generalizable Models for IoT Device Identification | Kahraman Kostas, Rabia Yasa Kostas, Mike Just, Michael A. Lones | 2024-11-05 | 下载 | With the proliferation of devices on the Internet of Things (IoT), ensuring their security has become paramount. Device identification (DI), which distinguishes IoT devices based on their traffic patt... |
| An Open API Architecture to Discover the Trustworthy Explanation of Cloud AI Services | Zerui Wang, Yan Liu, Jun Huang | 2024-11-05 | 下载 | This article presents the design of an open-API-based explainable AI (XAI) service to provide feature contribution explanations for cloud AI services. |
| On the Detection of Non-Cooperative RISs: Scan B-Testing via Deep Support Vector Data Description | George Stamatelis, Panagiotis Gavriilidis, Aymen Fakhreddine, George C. Alexandropoulos | 2024-11-05 | 下载 | In this paper, we study the problem of promptly detecting the presence of non-cooperative activity from one or more Reconfigurable Intelligent Surfaces (RISs) with unknown characteristics lying in the... |
| Statistical Analysis to Support CSI-Based Sensing Methods | Elena Tonini | 2024-11-05 | 下载 | Building upon the foundational work of the Bachelor's Degree Thesis titled "Analysis and Characterization of Wi-Fi Channel State Information'', this thesis significantly advances the research by condu... |
| UNet: A Generic and Reliable Multi-UAV Communication and Networking Architecture for Heterogeneous Applications | Sanku Kumar Roy, Mohamed Samshad, Ketan Rajawat | 2024-11-05 | 下载 | The rapid growth of UAV applications necessitates a robust communication and networking architecture capable of addressing the diverse requirements of various applications concurrently, rather than re... |
| Blockchain-Based Multi-Path Mobile Access Point Selection for Secure 5G VANETs | Zhiou Zhang, Weian Guo, Li Li, Dongyang Li | 2024-11-05 | 下载 | This letter presents a blockchain-based multi-path mobile access point (MAP) selection strategy for secure 5G vehicular ad-hoc networks (VANETs). |
| Rozproszone Wykrywanie Zajętości Widma Oparte na Uczeniu Federacyjnym | Łukasz Kułacz, Adrian Kliks | 2024-11-05 | 下载 | Spectrum occupancy detection is a key enabler for dynamic spectrum access, where machine learning algorithms are successfully utilized for detection improvement. |
| Personal Data Protection in AI-Native 6G Systems | Keivan Navaie | 2024-11-05 | 下载 | As 6G evolves into an AI-native technology, the integration of artificial intelligence (AI) and Generative AI into cellular communication systems presents unparalleled opportunities for enhancing conn... |
| Enhanced Real-Time Threat Detection in 5G Networks: A Self-Attention RNN Autoencoder Approach for Spectral Intrusion Analysis | Mohammadreza Kouchaki, Minglong Zhang, Aly S. Abdalla, Guangchen Lan, Christopher G. Brinton, Vuk Marojevic | 2024-11-05 | 下载 | In the rapidly evolving landscape of 5G technology, safeguarding Radio Frequency (RF) environments against sophisticated intrusions is paramount, especially in dynamic spectrum access and management. |
| NinjaDoH: A Censorship-Resistant Moving Target DoH Server Using Hyperscalers and IPNS | Scott Seidenberger, Marc Beret, Raveen Wijewickrama, Murtuza Jadliwala, Anindya Maiti | 2024-11-05 | 下载 | We introduce NinjaDoH, a novel DNS over HTTPS (DoH) protocol that leverages the InterPlanetary Name System (IPNS), along with public cloud infrastructure, to create a censorship-resistant moving targe... |
| Energy Efficient and Balanced Task Assignment Strategy for Multi-UAV Patrol Inspection System in Mobile Edge Computing Network | Kuan Jia, Dingcheng Yang, Yapeng Wang, Tianyun Shui, Chenji Liu | 2024-11-05 | 下载 | This paper considers a patrol inspection scenario where multiple unmanned aerial vehicles (UAVs) are adopted to traverse multiple predetermined cruise points for data collection. |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| The Hitchhiker's Guide to Programming and Optimizing Cache Coherent Heterogeneous Systems: CXL, NVLink-C2C, and AMD Infinity Fabric | Zixuan Wang, Suyash Mahar, Luyi Li, Jangseon Park, Jinpyo Kim, Theodore Michailidis, Yue Pan, Mingyao Shen, Tajana Rosing, Dean Tullsen, Steven Swanson, Jishen Zhao | 2024-11-05 | 下载 | We present a thorough analysis of the use of modern heterogeneous systems interconnected by various cachecoherent links, including CXL, NVLink-C2C, and Infinity Fabric. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| P-MOSS: Scheduling Main-Memory Indexes Over NUMA Servers Using Next Token Prediction | Yeasir Rayhan, Walid G. Aref | 2024-11-05 | 下载 | Ever since the Dennard scaling broke down in the early 2000s and the frequency of the CPUs stalled, vendors have started to increase the core count in each CPU chip at the expense of introducing heter... |
| The Hitchhiker's Guide to Programming and Optimizing Cache Coherent Heterogeneous Systems: CXL, NVLink-C2C, and AMD Infinity Fabric | Zixuan Wang, Suyash Mahar, Luyi Li, Jangseon Park, Jinpyo Kim, Theodore Michailidis, Yue Pan, Mingyao Shen, Tajana Rosing, Dean Tullsen, Steven Swanson, Jishen Zhao | 2024-11-05 | 下载 | We present a thorough analysis of the use of modern heterogeneous systems interconnected by various cachecoherent links, including CXL, NVLink-C2C, and Infinity Fabric. |
| DeepContext: A Context-aware, Cross-platform, and Cross-framework Tool for Performance Profiling and Analysis of Deep Learning Workloads | Qidong Zhao, Hao Wu, Yuming Hao, Zilingfeng Ye, Jiajia Li, Xu Liu, Keren Zhou | 2024-11-05 | 下载 | Effective performance profiling and analysis are essential for optimizing training and inference of deep learning models, especially given the growing complexity of heterogeneous computing environment... |