Skip to content

2024-11-01

cs.AR - Architecture

标题作者发布日期PDF摘要
Automatically Improving LLM-based Verilog Generation using EDA Tool FeedbackJason Blocklove, Shailja Thakur, Benjamin Tan, Hammond Pearce, Siddharth Garg, Ramesh Karri2024-11-01下载Traditionally, digital hardware designs are written in the Verilog hardware description language (HDL) and debugged manually by engineers. This can be time-consuming and error-prone for complex design...
Multilayer Dataflow: Orchestrate Butterfly Sparsity to Accelerate Attention ComputationHaibin Wu, Wenming Li, Kai Yan, Zhihua Fan, Peiyang Wu, Yuqun Liu, Yanhuan Liu, Ziqing Qiang, Meng Wu, Kunming Liu, Xiaochun Ye, Dongrui Fan2024-11-01下载Recent neural networks (NNs) with self-attention exhibit competitiveness across different AI domains, but the essential attention mechanism brings massive computation and memory demands.
DeepSeq2: Enhanced Sequential Circuit Learning with Disentangled RepresentationsSadaf Khan, Zhengyuan Shi, Ziyang Zheng, Min Li, Qiang Xu2024-11-01下载Circuit representation learning is increasingly pivotal in Electronic Design Automation (EDA), serving various downstream tasks with enhanced model efficiency and accuracy.
Pandora's Box in Your SSD: The Untold Dangers of NVMeRick Wertenbroek, Alberto Dassatti2024-11-01下载Modern operating systems manage and abstract hardware resources, to ensure efficient execution of user workloads. The operating system must securely interface with often untrusted user code while rely...
Inference-to-complete: A High-performance and Programmable Data-plane Co-processor for Neural-network-driven Traffic AnalysisDong Wen, Zhongpei Liu, Tong Yang, Tao Li, Tianyun Li, Chenglong Li, Jie Li, Zhigang Sun2024-11-01下载Neural-networks-driven intelligent data-plane (NN-driven IDP) is becoming an emerging topic for excellent accuracy and high performance. Meanwhile we argue that NN-driven IDP should satisfy three desi...
LUTMUL: Exceed Conventional FPGA Roofline Limit by LUT-based Efficient Multiplication for Neural Network InferenceYanyue Xie, Zhengang Li, Dana Diaconu, Suranga Handagala, Miriam Leeser, Xue Lin2024-11-01下载For FPGA-based neural network accelerators, digital signal processing (DSP) blocks have traditionally been the cornerstone for handling multiplications.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
When Speculation Spills Secrets: Side Channels via Speculative Decoding In LLMsJiankun Wei, Abdulrahman Abdulrazzag, Tianchen Zhang, Adel Muursepp, Gururaj Saileshwar2024-11-01下载Deployed large language models (LLMs) often rely on speculative decoding, a technique that generates and verifies multiple candidate tokens in parallel, to improve throughput and latency.
Cephalo: Harnessing Heterogeneous GPU Clusters for Training Transformer ModelsRunsheng Benson Guo, Utkarsh Anand, Arthur Chen, Khuzaima Daudjee2024-11-01下载Training transformer models requires substantial GPU compute and memory resources. In homogeneous clusters, distributed strategies allocate resources evenly, but this approach is inefficient for heter...
Identify Backdoored Model in Federated Learning via Individual UnlearningJiahao Xu, Zikai Zhang, Rui Hu2024-11-01下载Backdoor attacks present a significant threat to the robustness of Federated Learning (FL) due to their stealth and effectiveness. They maintain both the main task of the FL system and the backdoor ta...
LCP: Enhancing Scientific Data Management with Lossy Compression for ParticlesLongtao Zhang, Ruoyu Li, Congrong Ren, Sheng Di, Jinyang Liu, Jiajun Huang, Robert Underwood, Pascal Grosset, Dingwen Tao, Xin Liang, Hanqi Guo, Franck Capello, Kai Zhao2024-11-01下载Many scientific applications opt for particles instead of meshes as their basic primitives to model complex systems composed of billions of discrete entities.
Private, Augmentation-Robust and Task-Agnostic Data Valuation Approach for Data MarketplaceTayyebeh Jahani-Nezhad, Parsa Moradi, Mohammad Ali Maddah-Ali, Giuseppe Caire2024-11-01下载Evaluating datasets in data marketplaces, where the buyer aim to purchase valuable data, is a critical challenge. In this paper, we introduce an innovative task-agnostic data valuation method called P...
Transforming Agriculture: Exploring Diverse Practices and Technological InnovationsRamakant Kumar2024-11-01下载Agriculture is a vital sector that significantly contributes to the economy and food security, particularly in regions like Varanasi, India. This paper explores various types of agriculture practiced ...
Federated Voxel Scene Graph for Intracranial HemorrhageAntoine P. Sanner, Jonathan Stieber, Nils F. Grauhan, Suam Kim, Marc A. Brockmann, Ahmed E. Othman, Anirban Mukhopadhyay2024-11-01下载Intracranial Hemorrhage is a potentially lethal condition whose manifestation is vastly diverse and shifts across clinical centers worldwide. Deep-learning-based solutions are starting to model comple...
3-Slot-Finality Protocol for EthereumFrancesco D'Amato, Roberto Saltini, Thanh-Hai Tran, Luca Zanolini2024-11-01下载Gasper, the consensus protocol currently employed by Ethereum, typically requires 64 to 95 slots -- the units of time during which a new chain extending the previous one by one block is proposed and v...
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model PerformanceJaskirat Singh, Bram Adams, Ahmed E. Hassan2024-11-01下载To help MLOps engineers decide which operator to use in which deployment scenario, this study aims to empirically assess the accuracy vs latency trade-off of white-box (training-based) and black-box o...
SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compileRuisi Zhang, Tianyu Liu, Will Feng, Andrew Gu, Sanket Purandare, Wanchao Liang, Francisco Massa2024-11-01下载Distributed training of large models consumes enormous computation resources and requires substantial engineering efforts to compose various training techniques.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Effective ML Model Versioning in Edge NetworksFin Gentzen, Mounir Bensalem, Admela Jukan2024-11-01下载Machine learning (ML) models, data and software need to be regularly updated whenever essential version updates are released and feasible for integration.
Radiance Field Delta Video Compression in Edge-Enabled Vehicular MetaverseMatúš Dopiriak, Eugen Šlapak, Juraj Gazda, Devendra Singh Gurjar, Mohammad Abdullah Al Faruque, Marco Levorato2024-11-01下载Connected and autonomous vehicles (CAVs) offload computationally intensive tasks to multi-access edge computing (MEC) servers via vehicle-to-infrastructure (V2I) communication, enabling applications w...
AI-based traffic analysis in digital twin networksSarah Al-Shareeda, Khayal Huseynov, Lal Verda Cakir, Craig Thomson, Mehmet Ozdem, Berk Canberk2024-11-01下载In today's networked world, Digital Twin Networks (DTNs) are revolutionizing how we understand and optimize physical networks. These networks, also known as 'Digital Twin Networks (DTNs)' or 'Networks...
Wireless Federated Learning over UAV-enabled Integrated Sensing and CommunicationShaba Shaon, Tien Nguyen, Lina Mohjazi, Aryan Kaushik, Dinh C. Nguyen2024-11-01下载This paper studies a new latency optimization problem in unmanned aerial vehicles (UAVs)-enabled federated learning (FL) with integrated sensing and communication.
Tactical Edge IoT in Defense and National SecurityPaula Fraga-Lamas, Tiago M. Fernandez-Carames2024-11-01下载The deployment of Internet of Things (IoT) systems in Defense and National Security faces some limitations that can be addressed with Edge Computing approaches.
IoT Architectures for Indoor Radon Management: A Prospective AnalysisOscar Blanco-Novoa, Paulo Barros, Paula Fraga-Lamas, Sergio Ivan Lopes, Tiago M. Fernandez-Carames2024-11-01下载The demand for real-time Indoor Air Quality (IAQ) management has increased recently, since low-cost and modern sensors such as Particulate Matter (PM), Volatile Organic Compounds (VOCs), Carbon Monoxi...
Synergistic Interplay of Large Language Model and Digital Twin for Autonomous Optical Networks: Field DemonstrationsYuchen Song, Yao Zhang, Anni Zhou, Yan Shi, Shikui Shen, Xiongyan Tang, Jin Li, Min Zhang, Danshi Wang2024-11-01下载The development of large language models (LLM) has revolutionized various fields and is anticipated to drive the advancement of autonomous systems.
Diffusion Models as Network Optimizers: Explorations and AnalysisRuihuai Liang, Bo Yang, Pengyu Chen, Xianjin Li, Yifan Xue, Zhiwen Yu, Xuelin Cao, Yan Zhang, Mérouane Debbah, H. Vincent Poor, Chau Yuen2024-11-01下载Network optimization is a fundamental challenge in the Internet of Things (IoT) network, often characterized by complex features that make it difficult to solve these problems.
Inference-to-complete: A High-performance and Programmable Data-plane Co-processor for Neural-network-driven Traffic AnalysisDong Wen, Zhongpei Liu, Tong Yang, Tao Li, Tianyun Li, Chenglong Li, Jie Li, Zhigang Sun2024-11-01下载Neural-networks-driven intelligent data-plane (NN-driven IDP) is becoming an emerging topic for excellent accuracy and high performance. Meanwhile we argue that NN-driven IDP should satisfy three desi...
Distributed Computation Offloading for Energy Provision Minimization in WP-MEC Networks with Multiple HAPsXiaoying Liu, Anping Chen, Kechen Zheng, Kaikai Chi, Bin Yang, Tarik Taleb2024-11-01下载This paper investigates a wireless powered mobile edge computing (WP-MEC) network with multiple hybrid access points (HAPs) in a dynamic environment, where wireless devices (WDs) harvest energy from r...
Diffusion-based Auction Mechanism for Efficient Resource Management in 6G-enabled Vehicular MetaversesJiawen Kang, Yongju Tong, Yue Zhong, Junlong Chen, Minrui Xu, Dusit Niyato, Runrong Deng, Shiwen Mao2024-11-01下载The rise of 6G-enable Vehicular Metaverses is transforming the automotive industry by integrating immersive, real-time vehicular services through ultra-low latency and high bandwidth connectivity.
Task-oriented Age of Information for Remote Monitoring SystemsShuying Gan, Xijun Wang, Chao Xu, Xiang Chen2024-11-01下载The emergence of intelligent applications has fostered the development of a task-oriented communication paradigm, where a comprehensive, universal, and practical metric is crucial for unleashing the p...
Quantum Entanglement Path Selection and Qubit Allocation via Adversarial Group Neural BanditsYin Huang, Lei Wang, Jie Xu2024-11-01下载Quantum Data Networks (QDNs) have emerged as a promising framework in the field of information processing and transmission, harnessing the principles of quantum mechanics.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Enhancing Adaptive Mixed-Criticality Scheduling with Deep Reinforcement LearningBruno Mendes, Pedro F. Souto, Pedro C. Diniz2024-11-01下载Adaptive Mixed-Criticality (AMC) is a fixed-priority preemptive scheduling algorithm for mixed-criticality hard real-time systems. It dominates many other scheduling algorithms for mixed-criticality s...

cs.PF - Performance

标题作者发布日期PDF摘要
Diversity in Network-Friendly RecommendationsEvangelia Tzimpimpaki, Thrasyvoulos Spyropoulos2024-11-01下载In recent years, the Internet has been dominated by content-rich platforms, employing recommendation systems to provide users with more appealing content (e.g., videos in YouTube, movies in Netflix).
Inducing Semi-Structured Sparsity by Masking for Efficient Model Inference in Convolutional NetworksDavid A. Danhofer2024-11-01下载The crucial role of convolutional models, both as standalone vision models and backbones in foundation models, necessitates effective acceleration techniques.

基于 VitePress 构建