Skip to content

2024-02-09

cs.AR - Architecture

标题作者发布日期PDF摘要
Towards Fair and Firm Real-Time Scheduling in DNN Multi-Tenant Multi-Accelerator Systems via Reinforcement LearningEnrico Russo, Francesco Giulio Blanco, Maurizio Palesi, Giuseppe Ascia, Davide Patti, Vincenzo Catania2024-02-09下载This paper addresses the critical challenge of managing Quality of Service (QoS) in cloud services, focusing on the nuances of individual tenant expectations and varying Service Level Indicators (SLIs...
PULSE: Parametric Hardware Units for Low-power Sparsity-Aware Convolution EngineIlkin Aliyev, Tosiron Adegbija2024-02-09下载Spiking Neural Networks (SNNs) have become popular for their more bio-realistic behavior than Artificial Neural Networks (ANNs). However, effectively leveraging the intrinsic, unstructured sparsity of...
Algorithm-hardware co-design for Energy-Efficient A/D conversion in ReRAM-based acceleratorsChenguang Zhang, Zhihang Yuan, Xingchen Li, Guangyu Sun2024-02-09下载Deep neural networks are widely deployed in many fields. Due to the in-situ computation (known as processing in memory) capacity of the Resistive Random Access Memory (ReRAM) crossbar, ReRAM-based acc...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
ForestColl: Throughput-Optimal Collective Communications on Heterogeneous Network FabricsLiangyu Zhao, Saeed Maleki, Yuanhong Wang, Zezhou Wang, Ziyue Yang, Hossein Pourreza, Arvind Krishnamurthy2024-02-09下载As modern DNN models grow ever larger, collective communications between the accelerators (allreduce, etc.) emerge as a significant performance bottleneck.
Experiences Porting Distributed Applications to Asynchronous Tasks: A Multidimensional FFT Case-studyAlexander Strack, Christopher Taylor, Patrick Diehl, Dirk Pflüger2024-02-09下载Parallel algorithms relying on synchronous parallelization libraries often experience adverse performance due to global synchronization barriers.
Population Protocols for Exact Plurality Consensus -- How a small chance of failure helps to eliminate insignificant opinionsGregor Bankhamer, Petra Berenbrink, Felix Biermeier, Robert Elsässer, Hamed Hosseinpour, Dominik Kaaser, Peter Kling2024-02-09下载We consider the \emph{exact plurality consensus} problem for \emph{population protocols}. Here, nn anonymous agents start each with one of kk opinions.
pSTL-Bench: A Micro-Benchmark Suite for Assessing Scalability of C++ Parallel STL ImplementationsRuben Laso, Diego Krupitza, Sascha Hunold2024-02-09下载Since the advent of parallel algorithms in the C++17 Standard Template Library (STL), the STL has become a viable framework for creating performance-portable applications.
Energy efficiency optimization of task-parallel codes on asymmetric architecturesLuis Costero, Francisco D. Igual, Katzalin Olcoz, Francisco Tirado2024-02-09下载We present a family of policies that, integrated within a runtime task scheduler (Nanox), pursue the goal of improving the energy efficiency of task-parallel executions with no intervention from the p...
Towards Fair and Firm Real-Time Scheduling in DNN Multi-Tenant Multi-Accelerator Systems via Reinforcement LearningEnrico Russo, Francesco Giulio Blanco, Maurizio Palesi, Giuseppe Ascia, Davide Patti, Vincenzo Catania2024-02-09下载This paper addresses the critical challenge of managing Quality of Service (QoS) in cloud services, focusing on the nuances of individual tenant expectations and varying Service Level Indicators (SLIs...
SuperBench: Improving Cloud AI Infrastructure Reliability with Proactive ValidationYifan Xiong, Yuting Jiang, Ziyue Yang, Lei Qu, Guoshuai Zhao, Shuguang Liu, Dong Zhong, Boris Pinzur, Jie Zhang, Yang Wang, Jithin Jose, Hossein Pourreza, Jeff Baxter, Kushal Datta, Prabhat Ram, Luke Melton, Joe Chau, Peng Cheng, Yongqiang Xiong, Lidong Zhou2024-02-09下载Reliability in cloud AI infrastructure is crucial for cloud service providers, prompting the widespread use of hardware redundancies. However, these redundancies can inadvertently lead to hidden degra...
Decentralized Proactive Model Offloading and Resource Allocation for Split and Federated LearningBinbin Huang, Hailiang Zhao, Lingbin Wang, Wenzhuo Qian, Yuyu Yin, Shuiguang Deng2024-02-09下载In the resource-constrained IoT-edge computing environment, Split Federated (SplitFed) learning is implemented to enhance training efficiency.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
ForestColl: Throughput-Optimal Collective Communications on Heterogeneous Network FabricsLiangyu Zhao, Saeed Maleki, Yuanhong Wang, Zezhou Wang, Ziyue Yang, Hossein Pourreza, Arvind Krishnamurthy2024-02-09下载As modern DNN models grow ever larger, collective communications between the accelerators (allreduce, etc.) emerge as a significant performance bottleneck.
HoneyDOC: An Efficient Honeypot Architecture Enabling All-Round DesignWenjun Fan, Zhihui Du, Max Smith-Creasey, David Fernández2024-02-09下载Honeypots are designed to trap the attacker with the purpose of investigating its malicious behavior. Owing to the increasing variety and sophistication of cyber attacks, how to capture high-quality a...
Toward Building a Semantic Network Inventory for Model-Driven TelemetryI. D. Martínez-Casanueva, D. González-Sanchez, L. Bellido, D. Fernández, D. R. López2024-02-09下载Network telemetry based on data models is expected to become the standard mechanism for collecting operational data from network devices efficiently.
DASH Adaptation Algorithm Based on Adaptive Forgetting Factor EstimationM. Aguayo, L. Bellido, C. M. Lentisco, E. Pastor2024-02-09下载The wide adoption of multimedia service capable mobile devices, the availability of better networks with higher bandwidths, and the availability of platforms offering digital content has led to an inc...
Towards a Wireless Physical-Layer Foundation Model: Challenges and StrategiesJaron Fontaine, Adnan Shahid, Eli De Poorter2024-02-09下载Artificial intelligence (AI) plays an important role in the dynamic landscape of wireless communications, solving challenges unattainable by traditional approaches.
On the Feasibility of Battery-Less LoRaWAN Communications using Energy HarvestingCarmen Delgado, Jos é María Sanz, Jeroen Famaey2024-02-09下载From the outset, batteries have been the main power source for the Internet of Things (IoT). However, replacing and disposing of billions of dead batteries per year is costly in terms of maintenance a...
On Optimal Resource Allocation in Virtual Sensor NetworksCarmen Delgado, José Ramón Gállego, María Canales, Jorge Ortín, Sonda Bousnina, Matteo Cesana2024-02-09下载Sensor network virtualization is a promising paradigm to move away from highlycustomized, application-specific wireless sensor networks deployment by opening up to the possibility of dynamically assig...
An experimental study: RF Fingerprinting of Bluetooth devicesArtis Rušiņš, Krišjānis Nesenbergs, Deniss Tiščenko, Pēteris Paikens2024-02-09下载This paper presents an experimental study on radio frequency (RF) fingerprinting of Bluetooth Classic devices. Our research aims to provide a practical evaluation of the possibilities for RF fingerpri...
Resource Allocation for Channel Estimation in Reconfigurable Intelligent Surface-Aided Multi-Cell NetworksYining Xu, Sheng Zhou2024-02-09下载Reconfigurable intelligent surface (RIS) is a promising solution to deal with the blockage-sensitivity of millimeter wave band and reduce the high energy consumption caused by network densification.

cs.PF - Performance

标题作者发布日期PDF摘要
pSTL-Bench: A Micro-Benchmark Suite for Assessing Scalability of C++ Parallel STL ImplementationsRuben Laso, Diego Krupitza, Sascha Hunold2024-02-09下载Since the advent of parallel algorithms in the C++17 Standard Template Library (STL), the STL has become a viable framework for creating performance-portable applications.

基于 VitePress 构建