Skip to content

2024-10-15

cs.AR - Architecture

标题作者发布日期PDF摘要
FVEval: Understanding Language Model Capabilities in Formal Verification of Digital HardwareMinwoo Kang, Mingjie Liu, Ghaith Bany Hamad, Syed Suhaib, Haoxing Ren2024-10-15下载The remarkable reasoning and code generation capabilities of large language models (LLMs) have spurred significant interest in applying LLMs to enable task automation in digital chip design.
MLPerf Power: Benchmarking the Energy Efficiency of Machine Learning Systems from Microwatts to Megawatts for Sustainable AIArya Tschand, Arun Tejusve Raghunath Rajan, Sachin Idgunji, Anirban Ghosh, Jeremy Holleman, Csaba Kiraly, Pawan Ambalkar, Ritika Borkar, Ramesh Chukka, Trevor Cockrell, Oliver Curtis, Grigori Fursin, Miro Hodak, Hiwot Kassa, Anton Lokhmotov, Dejan Miskovic, Yuechao Pan, Manu Prasad Manmathan, Liz Raymond, Tom St. John, Arjun Suresh, Rowan Taubitz, Sean Zhan, Scott Wasson, David Kanter, Vijay Janapa Reddi2024-10-15下载Rapid adoption of machine learning (ML) technologies has led to a surge in power consumption across diverse systems, from tiny IoT devices to massive datacenter clusters.
DPD-NeuralEngine: A 22-nm 6.6-TOPS/W/mm2^2 Recurrent Neural Network Accelerator for Wideband Power Amplifier Digital Pre-DistortionAng Li, Haolin Wu, Yizhuo Wu, Qinyu Chen, Leo C. N. de Vreede, Chang Gao2024-10-15下载The increasing adoption of Deep Neural Network (DNN)-based Digital Pre-distortion (DPD) in modern communication systems necessitates efficient hardware implementations.
Taming Performance Variability caused by Client-Side Hardware ConfigurationGeorgia Antoniou, Haris Volos, Yiannakis Sazeides2024-10-15下载Many online services running in datacenters are implemented using a microservice software architecture characterized by strict latency requirements.
Sorted Weight Sectioning for Energy-Efficient Unstructured Sparse DNNs on Compute-in-Memory CrossbarsMatheus Farias, H. T. Kung2024-10-15下载We introduce \textit{sorted weight sectioning} (SWS): a weight allocation algorithm that places sorted deep neural network (DNN) weight sections on bit-sliced compute-in-memory (CIM) crossbars to re...
Theoretical Analysis of the Efficient-Memory Matrix Storage Method for Quantum Emulation Accelerators with Gate Fusion on FPGAsTran Xuan Hieu Le, Hoai Luan Pham, Tuan Hai Vu, Vu Trung Duong Le, Nakashima Yasuhiko2024-10-15下载Quantum emulators play an important role in the development and testing of quantum algorithms, especially given the limitations of the current FTQC era.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Juggernaut: Efficient Crypto-Agnostic Byzantine AgreementDaniel Collins, Yuval Efron, Jovan Komatovic2024-10-15下载It is well known that a trusted setup allows one to solve the Byzantine agreement problem in the presence of t<n/2t<n/2 corruptions, bypassing the setup-free t<n/3t<n/3 barrier.
Optimal Checkpoint Interval with Availability as an Objective FunctionNirmal Raj Saxena, Saurabh Hukerikar, Mikolaj Blaz, Swapna Raj2024-10-15下载We present a simplified derivation of the optimal checkpoint interval in Young_1974 [1]. The optimal checkpoint interval derivation in [1] is based on minimizing the total lost time as an objective-fu...
Consider an Applications-First Approach for PDCMichelle Strout2024-10-15下载I propose an applications-first approach for adjusting how parallel and distributed computing concepts are incorporated into curricula. By focusing on practical applications that leverage parallelism ...
Accelerating Python Applications with Dask and ProxyStoreJ. Gregory Pauloski, Klaudiusz Rydzy, Valerie Hayot-Sasson, Ian Foster, Kyle Chard2024-10-15下载Applications are increasingly written as dynamic workflows underpinned by an execution framework that manages asynchronous computations across distributed hardware.
MLPerf Power: Benchmarking the Energy Efficiency of Machine Learning Systems from Microwatts to Megawatts for Sustainable AIArya Tschand, Arun Tejusve Raghunath Rajan, Sachin Idgunji, Anirban Ghosh, Jeremy Holleman, Csaba Kiraly, Pawan Ambalkar, Ritika Borkar, Ramesh Chukka, Trevor Cockrell, Oliver Curtis, Grigori Fursin, Miro Hodak, Hiwot Kassa, Anton Lokhmotov, Dejan Miskovic, Yuechao Pan, Manu Prasad Manmathan, Liz Raymond, Tom St. John, Arjun Suresh, Rowan Taubitz, Sean Zhan, Scott Wasson, David Kanter, Vijay Janapa Reddi2024-10-15下载Rapid adoption of machine learning (ML) technologies has led to a surge in power consumption across diverse systems, from tiny IoT devices to massive datacenter clusters.
Cilium and VDM -- Towards Formal Analysis of Cilium PoliciesTomas Kulik, Jalil Boudjadar2024-10-15下载Industrial control systems are becoming more distributed and interconnected to allow for interaction with modern computing infrastructures. Furthermore, the amount of data generated by these systems i...
From promise to practice: realizing high-performance decentralized trainingZesen Wang, Jiaojiao Zhang, Xuyang Wu, Mikael Johansson2024-10-15下载Decentralized training of deep neural networks has attracted significant attention for its theoretically superior scalability over synchronous data-parallel methods like All-Reduce.
Age-of-Gradient Updates for Federated Learning over Random Access ChannelsYu Heng Wu, Houman Asgari, Stefano Rini, Andrea Munari2024-10-15下载This paper studies the problem of federated training of a deep neural network (DNN) over a random access channel (RACH) such as in computer networks, wireless networks, and cellular systems.
Min-Max Gathering on Infinite GridAbhinav Chakraborty, Pritam Goswami, Satakshi Ghosh2024-10-15下载Gathering is a fundamental coordination problem in swarm robotics, where the objective is to bring robots together at a point not known to them at the beginning.
ATTNChecker: Highly-Optimized Fault Tolerant Attention for Large Language Model TrainingYuhang Liang, Xinyi Li, Jie Ren, Ang Li, Bo Fang, Jieyang Chen2024-10-15下载Large Language Models (LLMs) have demonstrated remarkable performance in various natural language processing tasks. However, the training of these models is computationally intensive and susceptible t...
Federated Learning framework for LoRaWAN-enabled IIoT communication: A case studyOscar Torres Sanchez, Guilherme Borges, Duarte Raposo, André Rodrigues, Fernando Boavida, Jorge Sá Silva2024-10-15下载The development of intelligent Industrial Internet of Things (IIoT) systems promises to revolutionize operational and maintenance practices, driving improvements in operational efficiency.
Neuromorphic Programming: Emerging Directions for Brain-Inspired HardwareSteven Abreu, Jens E. Pedersen2024-10-15下载The value of brain-inspired neuromorphic computers critically depends on our ability to program them for relevant tasks. Currently, neuromorphic hardware often relies on machine learning methods adapt...
Trust-free Personalized Decentralized LearningYawen Li, Yan Li, Junping Du, Yingxia Shao, Meiyu Liang, Guanhua Ye2024-10-15下载Personalized collaborative learning in federated settings faces a critical trade-off between customization and participant trust. Existing approaches typically rely on centralized coordinators or trus...
Representation Similarity: A Better Guidance of DNN Layer Sharing for Edge Computing without TrainingBryan Bo Cao, Abhinav Sharma, Manavjeet Singh, Anshul Gandhi, Samir Das, Shubham Jain2024-10-15下载Edge computing has emerged as an alternative to reduce transmission and processing delay and preserve privacy of the video streams. However, the ever-increasing complexity of Deep Neural Networks (DNN...
Isambard-AI: a leadership class supercomputer optimised specifically for Artificial IntelligenceSimon McIntosh-Smith, Sadaf R Alam, Christopher Woods2024-10-15下载Isambard-AI is a new, leadership-class supercomputer, designed to support AI-related research. Based on the HPE Cray EX4000 system, and housed in a new, energy efficient Modular Data Centre in Bristol...
Asynchronous 3-Majority Dynamics with Many OpinionsColin Cooper, Frederik Mallmann-Trenn, Tomasz Radzik, Nobutaka Shimizu, Takeharu Shiraga2024-10-15下载We consider 3-Majority, a probabilistic consensus dynamics on a complete graph with nn vertices, each vertex starting with one of kk initial opinions.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Enhancing IoT Communication and Localization via Smarter AntennaTianxiang Li, Haofan Lu, Omid Abari2024-10-15下载The convergence of sensing and communication functionalities is poised to become a pivotal feature of the sixth-generation (6G) wireless networks.
Data-Driven Cellular Network Selector for Vehicle TeleoperationsBarak Gahtan, Reuven Cohen, Alex M. Bronstein, Eli Shapira2024-10-15下载Remote control of robotic systems, also known as teleoperation, is crucial for the development of autonomous vehicle (AV) technology. It allows a remote operator to view live video from AVs and, in so...
Telco-DPR: A Hybrid Dataset for Evaluating Retrieval Models of 3GPP Technical SpecificationsThaina Saraiva, Marco Sousa, Pedro Vieira, António Rodrigues2024-10-15下载This paper proposes a Question-Answering (QA) system for the telecom domain using 3rd Generation Partnership Project (3GPP) technical documents.
Federated Learning framework for LoRaWAN-enabled IIoT communication: A case studyOscar Torres Sanchez, Guilherme Borges, Duarte Raposo, André Rodrigues, Fernando Boavida, Jorge Sá Silva2024-10-15下载The development of intelligent Industrial Internet of Things (IIoT) systems promises to revolutionize operational and maintenance practices, driving improvements in operational efficiency.
Demo: Testing AI-driven MAC Learning in Autonomic NetworksLeonard Paeleke, Navid Keshtiarast, Paul Seehofer, Roland Bless, Holger Karl, Marina Petrova, Martina Zitterbart2024-10-15下载6G networks will be highly dynamic, re-configurable, and resilient. To enable and support such features, employing AI has been suggested. Integrating AIin networks will likely require distributed AI d...
Optimizing Version Innovation Age for Monitoring Markovian Source in Energy-Harvesting SystemsMehrdad Salimnejad, Anthony Ephremides, Marios Kountouris, Nikolaos Pappas2024-10-15下载We study the real-time remote tracking of a two-state Markov process by an energy harvesting source. The source decides whether to transmit over an unreliable channel based on the state.
Energy Efficient Transmission Parameters Selection Method Using Reinforcement Learning in Distributed LoRa NetworksRyotai Airiyoshi, Mikio Hasegawa, Tomoaki Ohtsuki, Aohan Li2024-10-15下载With the increase in demand for Internet of Things (IoT) applications, the number of IoT devices has drastically grown, making spectrum resources seriously insufficient.
Enhancing Management of Large-Scale Optical Networks through RFID Technology IntegrationXiaoying Zheng, Xingqi Xuan, Shilie Zheng, Xiaonan Hui, Xianmin Zhang2024-10-15下载Managing large-scale optical distribution networks is a daunting task. This paper introduces a novel solution using radio frequency identification (RFID) technology to transform the procedure we monit...
Exploring Content Concealment in EmailLucas Betts, Robert Biddle, Danielle Lottridge, Giovanni Russello2024-10-15下载The never-ending barrage of malicious emails, such as spam and phishing, is of constant concern for users, who rely on countermeasures such as email filters to keep the intended recipient safe.

cs.PF - Performance

标题作者发布日期PDF摘要
A Zoned Storage Optimized Flash Cache on ZNS SSDsChongzhuo Yang, Chang Guo, Ming Zhao, Zhichao Cao2024-10-15下载Zoned Namespace SSDs (ZNS) are introduced recently to mitigate the block interface penalties of flash-based SSDs. It is a good opportunity for flash cache to address cache throughput and write amplifi...

基于 VitePress 构建