2024-02-09

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Towards Fair and Firm Real-Time Scheduling in DNN Multi-Tenant Multi-Accelerator Systems via Reinforcement Learning	Enrico Russo, Francesco Giulio Blanco, Maurizio Palesi, Giuseppe Ascia, Davide Patti, Vincenzo Catania	2024-02-09	下载	This paper addresses the critical challenge of managing Quality of Service (QoS) in cloud services, focusing on the nuances of individual tenant expectations and varying Service Level Indicators (SLIs...
PULSE: Parametric Hardware Units for Low-power Sparsity-Aware Convolution Engine	Ilkin Aliyev, Tosiron Adegbija	2024-02-09	下载	Spiking Neural Networks (SNNs) have become popular for their more bio-realistic behavior than Artificial Neural Networks (ANNs). However, effectively leveraging the intrinsic, unstructured sparsity of...
Algorithm-hardware co-design for Energy-Efficient A/D conversion in ReRAM-based accelerators	Chenguang Zhang, Zhihang Yuan, Xingchen Li, Guangyu Sun	2024-02-09	下载	Deep neural networks are widely deployed in many fields. Due to the in-situ computation (known as processing in memory) capacity of the Resistive Random Access Memory (ReRAM) crossbar, ReRAM-based acc...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
ForestColl: Throughput-Optimal Collective Communications on Heterogeneous Network Fabrics	Liangyu Zhao, Saeed Maleki, Yuanhong Wang, Zezhou Wang, Ziyue Yang, Hossein Pourreza, Arvind Krishnamurthy	2024-02-09	下载	As modern DNN models grow ever larger, collective communications between the accelerators (allreduce, etc.) emerge as a significant performance bottleneck.
Experiences Porting Distributed Applications to Asynchronous Tasks: A Multidimensional FFT Case-study	Alexander Strack, Christopher Taylor, Patrick Diehl, Dirk Pflüger	2024-02-09	下载	Parallel algorithms relying on synchronous parallelization libraries often experience adverse performance due to global synchronization barriers.
Population Protocols for Exact Plurality Consensus -- How a small chance of failure helps to eliminate insignificant opinions	Gregor Bankhamer, Petra Berenbrink, Felix Biermeier, Robert Elsässer, Hamed Hosseinpour, Dominik Kaaser, Peter Kling	2024-02-09	下载	We consider the \emph{exact plurality consensus} problem for \emph{population protocols}. Here, $n$ anonymous agents start each with one of $k$ opinions.
pSTL-Bench: A Micro-Benchmark Suite for Assessing Scalability of C++ Parallel STL Implementations	Ruben Laso, Diego Krupitza, Sascha Hunold	2024-02-09	下载	Since the advent of parallel algorithms in the C++17 Standard Template Library (STL), the STL has become a viable framework for creating performance-portable applications.
Energy efficiency optimization of task-parallel codes on asymmetric architectures	Luis Costero, Francisco D. Igual, Katzalin Olcoz, Francisco Tirado	2024-02-09	下载	We present a family of policies that, integrated within a runtime task scheduler (Nanox), pursue the goal of improving the energy efficiency of task-parallel executions with no intervention from the p...
Towards Fair and Firm Real-Time Scheduling in DNN Multi-Tenant Multi-Accelerator Systems via Reinforcement Learning	Enrico Russo, Francesco Giulio Blanco, Maurizio Palesi, Giuseppe Ascia, Davide Patti, Vincenzo Catania	2024-02-09	下载	This paper addresses the critical challenge of managing Quality of Service (QoS) in cloud services, focusing on the nuances of individual tenant expectations and varying Service Level Indicators (SLIs...
SuperBench: Improving Cloud AI Infrastructure Reliability with Proactive Validation	Yifan Xiong, Yuting Jiang, Ziyue Yang, Lei Qu, Guoshuai Zhao, Shuguang Liu, Dong Zhong, Boris Pinzur, Jie Zhang, Yang Wang, Jithin Jose, Hossein Pourreza, Jeff Baxter, Kushal Datta, Prabhat Ram, Luke Melton, Joe Chau, Peng Cheng, Yongqiang Xiong, Lidong Zhou	2024-02-09	下载	Reliability in cloud AI infrastructure is crucial for cloud service providers, prompting the widespread use of hardware redundancies. However, these redundancies can inadvertently lead to hidden degra...
Decentralized Proactive Model Offloading and Resource Allocation for Split and Federated Learning	Binbin Huang, Hailiang Zhao, Lingbin Wang, Wenzhuo Qian, Yuyu Yin, Shuiguang Deng	2024-02-09	下载	In the resource-constrained IoT-edge computing environment, Split Federated (SplitFed) learning is implemented to enhance training efficiency.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
ForestColl: Throughput-Optimal Collective Communications on Heterogeneous Network Fabrics	Liangyu Zhao, Saeed Maleki, Yuanhong Wang, Zezhou Wang, Ziyue Yang, Hossein Pourreza, Arvind Krishnamurthy	2024-02-09	下载	As modern DNN models grow ever larger, collective communications between the accelerators (allreduce, etc.) emerge as a significant performance bottleneck.
HoneyDOC: An Efficient Honeypot Architecture Enabling All-Round Design	Wenjun Fan, Zhihui Du, Max Smith-Creasey, David Fernández	2024-02-09	下载	Honeypots are designed to trap the attacker with the purpose of investigating its malicious behavior. Owing to the increasing variety and sophistication of cyber attacks, how to capture high-quality a...
Toward Building a Semantic Network Inventory for Model-Driven Telemetry	I. D. Martínez-Casanueva, D. González-Sanchez, L. Bellido, D. Fernández, D. R. López	2024-02-09	下载	Network telemetry based on data models is expected to become the standard mechanism for collecting operational data from network devices efficiently.
DASH Adaptation Algorithm Based on Adaptive Forgetting Factor Estimation	M. Aguayo, L. Bellido, C. M. Lentisco, E. Pastor	2024-02-09	下载	The wide adoption of multimedia service capable mobile devices, the availability of better networks with higher bandwidths, and the availability of platforms offering digital content has led to an inc...
Towards a Wireless Physical-Layer Foundation Model: Challenges and Strategies	Jaron Fontaine, Adnan Shahid, Eli De Poorter	2024-02-09	下载	Artificial intelligence (AI) plays an important role in the dynamic landscape of wireless communications, solving challenges unattainable by traditional approaches.
On the Feasibility of Battery-Less LoRaWAN Communications using Energy Harvesting	Carmen Delgado, Jos é María Sanz, Jeroen Famaey	2024-02-09	下载	From the outset, batteries have been the main power source for the Internet of Things (IoT). However, replacing and disposing of billions of dead batteries per year is costly in terms of maintenance a...
On Optimal Resource Allocation in Virtual Sensor Networks	Carmen Delgado, José Ramón Gállego, María Canales, Jorge Ortín, Sonda Bousnina, Matteo Cesana	2024-02-09	下载	Sensor network virtualization is a promising paradigm to move away from highlycustomized, application-specific wireless sensor networks deployment by opening up to the possibility of dynamically assig...
An experimental study: RF Fingerprinting of Bluetooth devices	Artis Rušiņš, Krišjānis Nesenbergs, Deniss Tiščenko, Pēteris Paikens	2024-02-09	下载	This paper presents an experimental study on radio frequency (RF) fingerprinting of Bluetooth Classic devices. Our research aims to provide a practical evaluation of the possibilities for RF fingerpri...
Resource Allocation for Channel Estimation in Reconfigurable Intelligent Surface-Aided Multi-Cell Networks	Yining Xu, Sheng Zhou	2024-02-09	下载	Reconfigurable intelligent surface (RIS) is a promising solution to deal with the blockage-sensitivity of millimeter wave band and reduce the high energy consumption caused by network densification.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
pSTL-Bench: A Micro-Benchmark Suite for Assessing Scalability of C++ Parallel STL Implementations	Ruben Laso, Diego Krupitza, Sascha Hunold	2024-02-09	下载	Since the advent of parallel algorithms in the C++17 Standard Template Library (STL), the STL has become a viable framework for creating performance-portable applications.