Skip to content

2024-11-05

cs.AR - Architecture

标题作者发布日期PDF摘要
MetRex: A Benchmark for Verilog Code Metric Reasoning Using LLMsManar Abdelatty, Jingxiao Ma, Sherief Reda2024-11-05下载Large Language Models (LLMs) have been applied to various hardware design tasks, including Verilog code generation, EDA tool scripting, and RTL bug fixing.
DP-HLS: A High-Level Synthesis Framework for Accelerating Dynamic Programming Algorithms in BioinformaticsYingqi Cao, Anshu Gupta, Jason Liang, Yatish Turakhia2024-11-05下载Dynamic programming (DP) based algorithms are essential yet compute-intensive parts of numerous bioinformatics pipelines, which typically involve populating a 2-D scoring matrix based on a recursive f...
Kernel Approximation using Analog In-Memory ComputingJulian Büchel, Giacomo Camposampiero, Athanasios Vasilopoulos, Corey Lammie, Manuel Le Gallo, Abbas Rahimi, Abu Sebastian2024-11-05下载Kernel functions are vital ingredients of several machine learning algorithms, but often incur significant memory and computational costs. We introduce an approach to kernel approximation in machine l...
Hardware for converting floating-point to the microscaling (MX) formatDanila Gorodecky, Leonel Sousa2024-11-05下载This paper proposes hardware converters for the microscaling format (MX-format), a reduced representation of floating-point numbers. We present an algorithm and a memory-free hardware model for conver...
SpiDR: A Reconfigurable Digital Compute-in-Memory Spiking Neural Network Accelerator for Event-based PerceptionDeepika Sharma, Shubham Negi, Trishit Dutta, Amogh Agrawal, Kaushik Roy2024-11-05下载Spiking Neural Networks (SNNs), with their inherent recurrence, offer an efficient method for processing the asynchronous temporal data generated by Dynamic Vision Sensors (DVS), making them well-suit...
The Hitchhiker's Guide to Programming and Optimizing Cache Coherent Heterogeneous Systems: CXL, NVLink-C2C, and AMD Infinity FabricZixuan Wang, Suyash Mahar, Luyi Li, Jangseon Park, Jinpyo Kim, Theodore Michailidis, Yue Pan, Mingyao Shen, Tajana Rosing, Dean Tullsen, Steven Swanson, Jishen Zhao2024-11-05下载We present a thorough analysis of the use of modern heterogeneous systems interconnected by various cachecoherent links, including CXL, NVLink-C2C, and Infinity Fabric.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Shared Memory-Aware Latency-Sensitive Message Aggregation for Fine-Grained CommunicationKavitha Chandrasekar, Laxmikant Kale2024-11-05下载Message aggregation is often used with a goal to reduce communication cost in HPC applications. The difference in the order of overhead of sending a message and cost of per byte transferred motivates ...
AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order ExecutionZhiqiang Xie, Hao Kang, Ying Sheng, Tushar Krishna, Kayvon Fatahalian, Christos Kozyrakis2024-11-05下载With more advanced natural language understanding and reasoning capabilities, large language model (LLM)-powered agents are increasingly developed in simulated environments to perform complex tasks, i...
DP-HLS: A High-Level Synthesis Framework for Accelerating Dynamic Programming Algorithms in BioinformaticsYingqi Cao, Anshu Gupta, Jason Liang, Yatish Turakhia2024-11-05下载Dynamic programming (DP) based algorithms are essential yet compute-intensive parts of numerous bioinformatics pipelines, which typically involve populating a 2-D scoring matrix based on a recursive f...
An Open API Architecture to Discover the Trustworthy Explanation of Cloud AI ServicesZerui Wang, Yan Liu, Jun Huang2024-11-05下载This article presents the design of an open-API-based explainable AI (XAI) service to provide feature contribution explanations for cloud AI services.
Distributed Quantum Advantage for Local ProblemsAlkida Balliu, Sebastian Brandt, Xavier Coiteux-Roy, Francesco d'Amore, Massimo Equi, François Le Gall, Henrik Lievonen, Augusto Modanese, Dennis Olivetti, Marc-Olivier Renou, Jukka Suomela, Lucas Tendick, Isadora Veeren2024-11-05下载We present the first local problem that shows a super-constant separation between the classical randomized LOCAL model of distributed computing and its quantum counterpart.
LOGSAFE: Logic-Guided Verification for Trustworthy Federated Time-Series LearningDung Thuy Nguyen, Ziyan An, Taylor T. Johnson, Meiyi Ma, Kevin Leach2024-11-05下载This paper introduces LOGSAFE, a defense mechanism for federated learning in time series settings, particularly within cyber-physical systems.
Instant Resonance: Dual Strategy Enhances the Data Consensus Success Rate of Blockchain Threshold Signature OraclesYouquan Xian, Xueying Zeng, Chunpei Li, Dongcheng Li, Peng Wang, Peng Liu, Xianxian Li2024-11-05下载With the rapid development of Decentralized Finance (DeFi) and Real-World Assets (RWA), the importance of blockchain oracles in real-time data acquisition has become increasingly prominent.
Photon: Federated LLM Pre-TrainingLorenzo Sani, Alex Iacob, Zeyu Cao, Royson Lee, Bill Marino, Yan Gao, Dongqi Cai, Zexi Li, Wanru Zhao, Xinchi Qiu, Nicholas D. Lane2024-11-05下载Scaling large language models (LLMs) demands extensive data and computing resources, which are traditionally constrained to data centers by the high-bandwidth requirements of distributed training.
iAnomaly: A Toolkit for Generating Performance Anomaly Datasets in Edge-Cloud Integrated Computing EnvironmentsDuneesha Fernando, Maria A. Rodriguez, Rajkumar Buyya2024-11-05下载Microservice architectures are increasingly used to modularize IoT applications and deploy them in distributed and heterogeneous edge computing environments.
CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge CollaborationHongpeng Jin, Yanzhao Wu2024-11-05下载Large Language Models (LLMs) exhibit remarkable human-like predictive capabilities. However, it is challenging to deploy LLMs to provide efficient and adaptive inference services at the edge.
The Hitchhiker's Guide to Programming and Optimizing Cache Coherent Heterogeneous Systems: CXL, NVLink-C2C, and AMD Infinity FabricZixuan Wang, Suyash Mahar, Luyi Li, Jangseon Park, Jinpyo Kim, Theodore Michailidis, Yue Pan, Mingyao Shen, Tajana Rosing, Dean Tullsen, Steven Swanson, Jishen Zhao2024-11-05下载We present a thorough analysis of the use of modern heterogeneous systems interconnected by various cachecoherent links, including CXL, NVLink-C2C, and Infinity Fabric.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Model-based Deep Learning for QoS-Aware Rate-Splitting Multiple Access Wireless SystemsHanwen Zhang, Mingzhe Chen, Alireza Vahid, Feng Ye, Haijian Sun2024-11-05下载Next generation communications demand for better spectrum management, lower latency, and guaranteed quality-of-service (QoS). Recently, Artificial intelligence (AI) has been widely introduced to advan...
TwiNet: Connecting Real World Networks to their Digital Twins Through a Live Bidirectional LinkClifton Paul Robinson, Andrea Lacava, Pedram Johari, Francesca Cuomo, Tommaso Melodia2024-11-05下载The wireless spectrum's increasing complexity poses challenges and opportunities, highlighting the necessity for real-time solutions and robust data processing capabilities.
GeMID: Generalizable Models for IoT Device IdentificationKahraman Kostas, Rabia Yasa Kostas, Mike Just, Michael A. Lones2024-11-05下载With the proliferation of devices on the Internet of Things (IoT), ensuring their security has become paramount. Device identification (DI), which distinguishes IoT devices based on their traffic patt...
An Open API Architecture to Discover the Trustworthy Explanation of Cloud AI ServicesZerui Wang, Yan Liu, Jun Huang2024-11-05下载This article presents the design of an open-API-based explainable AI (XAI) service to provide feature contribution explanations for cloud AI services.
On the Detection of Non-Cooperative RISs: Scan B-Testing via Deep Support Vector Data DescriptionGeorge Stamatelis, Panagiotis Gavriilidis, Aymen Fakhreddine, George C. Alexandropoulos2024-11-05下载In this paper, we study the problem of promptly detecting the presence of non-cooperative activity from one or more Reconfigurable Intelligent Surfaces (RISs) with unknown characteristics lying in the...
Statistical Analysis to Support CSI-Based Sensing MethodsElena Tonini2024-11-05下载Building upon the foundational work of the Bachelor's Degree Thesis titled "Analysis and Characterization of Wi-Fi Channel State Information'', this thesis significantly advances the research by condu...
UNet: A Generic and Reliable Multi-UAV Communication and Networking Architecture for Heterogeneous ApplicationsSanku Kumar Roy, Mohamed Samshad, Ketan Rajawat2024-11-05下载The rapid growth of UAV applications necessitates a robust communication and networking architecture capable of addressing the diverse requirements of various applications concurrently, rather than re...
Blockchain-Based Multi-Path Mobile Access Point Selection for Secure 5G VANETsZhiou Zhang, Weian Guo, Li Li, Dongyang Li2024-11-05下载This letter presents a blockchain-based multi-path mobile access point (MAP) selection strategy for secure 5G vehicular ad-hoc networks (VANETs).
Rozproszone Wykrywanie Zajętości Widma Oparte na Uczeniu FederacyjnymŁukasz Kułacz, Adrian Kliks2024-11-05下载Spectrum occupancy detection is a key enabler for dynamic spectrum access, where machine learning algorithms are successfully utilized for detection improvement.
Personal Data Protection in AI-Native 6G SystemsKeivan Navaie2024-11-05下载As 6G evolves into an AI-native technology, the integration of artificial intelligence (AI) and Generative AI into cellular communication systems presents unparalleled opportunities for enhancing conn...
Enhanced Real-Time Threat Detection in 5G Networks: A Self-Attention RNN Autoencoder Approach for Spectral Intrusion AnalysisMohammadreza Kouchaki, Minglong Zhang, Aly S. Abdalla, Guangchen Lan, Christopher G. Brinton, Vuk Marojevic2024-11-05下载In the rapidly evolving landscape of 5G technology, safeguarding Radio Frequency (RF) environments against sophisticated intrusions is paramount, especially in dynamic spectrum access and management.
NinjaDoH: A Censorship-Resistant Moving Target DoH Server Using Hyperscalers and IPNSScott Seidenberger, Marc Beret, Raveen Wijewickrama, Murtuza Jadliwala, Anindya Maiti2024-11-05下载We introduce NinjaDoH, a novel DNS over HTTPS (DoH) protocol that leverages the InterPlanetary Name System (IPNS), along with public cloud infrastructure, to create a censorship-resistant moving targe...
Energy Efficient and Balanced Task Assignment Strategy for Multi-UAV Patrol Inspection System in Mobile Edge Computing NetworkKuan Jia, Dingcheng Yang, Yapeng Wang, Tianyun Shui, Chenji Liu2024-11-05下载This paper considers a patrol inspection scenario where multiple unmanned aerial vehicles (UAVs) are adopted to traverse multiple predetermined cruise points for data collection.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
The Hitchhiker's Guide to Programming and Optimizing Cache Coherent Heterogeneous Systems: CXL, NVLink-C2C, and AMD Infinity FabricZixuan Wang, Suyash Mahar, Luyi Li, Jangseon Park, Jinpyo Kim, Theodore Michailidis, Yue Pan, Mingyao Shen, Tajana Rosing, Dean Tullsen, Steven Swanson, Jishen Zhao2024-11-05下载We present a thorough analysis of the use of modern heterogeneous systems interconnected by various cachecoherent links, including CXL, NVLink-C2C, and Infinity Fabric.

cs.PF - Performance

标题作者发布日期PDF摘要
P-MOSS: Scheduling Main-Memory Indexes Over NUMA Servers Using Next Token PredictionYeasir Rayhan, Walid G. Aref2024-11-05下载Ever since the Dennard scaling broke down in the early 2000s and the frequency of the CPUs stalled, vendors have started to increase the core count in each CPU chip at the expense of introducing heter...
The Hitchhiker's Guide to Programming and Optimizing Cache Coherent Heterogeneous Systems: CXL, NVLink-C2C, and AMD Infinity FabricZixuan Wang, Suyash Mahar, Luyi Li, Jangseon Park, Jinpyo Kim, Theodore Michailidis, Yue Pan, Mingyao Shen, Tajana Rosing, Dean Tullsen, Steven Swanson, Jishen Zhao2024-11-05下载We present a thorough analysis of the use of modern heterogeneous systems interconnected by various cachecoherent links, including CXL, NVLink-C2C, and Infinity Fabric.
DeepContext: A Context-aware, Cross-platform, and Cross-framework Tool for Performance Profiling and Analysis of Deep Learning WorkloadsQidong Zhao, Hao Wu, Yuming Hao, Zilingfeng Ye, Jiajia Li, Xu Liu, Keren Zhou2024-11-05下载Effective performance profiling and analysis are essential for optimizing training and inference of deep learning models, especially given the growing complexity of heterogeneous computing environment...

基于 VitePress 构建