Appearance
2024-02-09
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Towards Fair and Firm Real-Time Scheduling in DNN Multi-Tenant Multi-Accelerator Systems via Reinforcement Learning | Enrico Russo, Francesco Giulio Blanco, Maurizio Palesi, Giuseppe Ascia, Davide Patti, Vincenzo Catania | 2024-02-09 | 下载 | This paper addresses the critical challenge of managing Quality of Service (QoS) in cloud services, focusing on the nuances of individual tenant expectations and varying Service Level Indicators (SLIs... |
| PULSE: Parametric Hardware Units for Low-power Sparsity-Aware Convolution Engine | Ilkin Aliyev, Tosiron Adegbija | 2024-02-09 | 下载 | Spiking Neural Networks (SNNs) have become popular for their more bio-realistic behavior than Artificial Neural Networks (ANNs). However, effectively leveraging the intrinsic, unstructured sparsity of... |
| Algorithm-hardware co-design for Energy-Efficient A/D conversion in ReRAM-based accelerators | Chenguang Zhang, Zhihang Yuan, Xingchen Li, Guangyu Sun | 2024-02-09 | 下载 | Deep neural networks are widely deployed in many fields. Due to the in-situ computation (known as processing in memory) capacity of the Resistive Random Access Memory (ReRAM) crossbar, ReRAM-based acc... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| ForestColl: Throughput-Optimal Collective Communications on Heterogeneous Network Fabrics | Liangyu Zhao, Saeed Maleki, Yuanhong Wang, Zezhou Wang, Ziyue Yang, Hossein Pourreza, Arvind Krishnamurthy | 2024-02-09 | 下载 | As modern DNN models grow ever larger, collective communications between the accelerators (allreduce, etc.) emerge as a significant performance bottleneck. |
| Experiences Porting Distributed Applications to Asynchronous Tasks: A Multidimensional FFT Case-study | Alexander Strack, Christopher Taylor, Patrick Diehl, Dirk Pflüger | 2024-02-09 | 下载 | Parallel algorithms relying on synchronous parallelization libraries often experience adverse performance due to global synchronization barriers. |
| Population Protocols for Exact Plurality Consensus -- How a small chance of failure helps to eliminate insignificant opinions | Gregor Bankhamer, Petra Berenbrink, Felix Biermeier, Robert Elsässer, Hamed Hosseinpour, Dominik Kaaser, Peter Kling | 2024-02-09 | 下载 | We consider the \emph{exact plurality consensus} problem for \emph{population protocols}. Here, anonymous agents start each with one of opinions. |
| pSTL-Bench: A Micro-Benchmark Suite for Assessing Scalability of C++ Parallel STL Implementations | Ruben Laso, Diego Krupitza, Sascha Hunold | 2024-02-09 | 下载 | Since the advent of parallel algorithms in the C++17 Standard Template Library (STL), the STL has become a viable framework for creating performance-portable applications. |
| Energy efficiency optimization of task-parallel codes on asymmetric architectures | Luis Costero, Francisco D. Igual, Katzalin Olcoz, Francisco Tirado | 2024-02-09 | 下载 | We present a family of policies that, integrated within a runtime task scheduler (Nanox), pursue the goal of improving the energy efficiency of task-parallel executions with no intervention from the p... |
| Towards Fair and Firm Real-Time Scheduling in DNN Multi-Tenant Multi-Accelerator Systems via Reinforcement Learning | Enrico Russo, Francesco Giulio Blanco, Maurizio Palesi, Giuseppe Ascia, Davide Patti, Vincenzo Catania | 2024-02-09 | 下载 | This paper addresses the critical challenge of managing Quality of Service (QoS) in cloud services, focusing on the nuances of individual tenant expectations and varying Service Level Indicators (SLIs... |
| SuperBench: Improving Cloud AI Infrastructure Reliability with Proactive Validation | Yifan Xiong, Yuting Jiang, Ziyue Yang, Lei Qu, Guoshuai Zhao, Shuguang Liu, Dong Zhong, Boris Pinzur, Jie Zhang, Yang Wang, Jithin Jose, Hossein Pourreza, Jeff Baxter, Kushal Datta, Prabhat Ram, Luke Melton, Joe Chau, Peng Cheng, Yongqiang Xiong, Lidong Zhou | 2024-02-09 | 下载 | Reliability in cloud AI infrastructure is crucial for cloud service providers, prompting the widespread use of hardware redundancies. However, these redundancies can inadvertently lead to hidden degra... |
| Decentralized Proactive Model Offloading and Resource Allocation for Split and Federated Learning | Binbin Huang, Hailiang Zhao, Lingbin Wang, Wenzhuo Qian, Yuyu Yin, Shuiguang Deng | 2024-02-09 | 下载 | In the resource-constrained IoT-edge computing environment, Split Federated (SplitFed) learning is implemented to enhance training efficiency. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| ForestColl: Throughput-Optimal Collective Communications on Heterogeneous Network Fabrics | Liangyu Zhao, Saeed Maleki, Yuanhong Wang, Zezhou Wang, Ziyue Yang, Hossein Pourreza, Arvind Krishnamurthy | 2024-02-09 | 下载 | As modern DNN models grow ever larger, collective communications between the accelerators (allreduce, etc.) emerge as a significant performance bottleneck. |
| HoneyDOC: An Efficient Honeypot Architecture Enabling All-Round Design | Wenjun Fan, Zhihui Du, Max Smith-Creasey, David Fernández | 2024-02-09 | 下载 | Honeypots are designed to trap the attacker with the purpose of investigating its malicious behavior. Owing to the increasing variety and sophistication of cyber attacks, how to capture high-quality a... |
| Toward Building a Semantic Network Inventory for Model-Driven Telemetry | I. D. Martínez-Casanueva, D. González-Sanchez, L. Bellido, D. Fernández, D. R. López | 2024-02-09 | 下载 | Network telemetry based on data models is expected to become the standard mechanism for collecting operational data from network devices efficiently. |
| DASH Adaptation Algorithm Based on Adaptive Forgetting Factor Estimation | M. Aguayo, L. Bellido, C. M. Lentisco, E. Pastor | 2024-02-09 | 下载 | The wide adoption of multimedia service capable mobile devices, the availability of better networks with higher bandwidths, and the availability of platforms offering digital content has led to an inc... |
| Towards a Wireless Physical-Layer Foundation Model: Challenges and Strategies | Jaron Fontaine, Adnan Shahid, Eli De Poorter | 2024-02-09 | 下载 | Artificial intelligence (AI) plays an important role in the dynamic landscape of wireless communications, solving challenges unattainable by traditional approaches. |
| On the Feasibility of Battery-Less LoRaWAN Communications using Energy Harvesting | Carmen Delgado, Jos é María Sanz, Jeroen Famaey | 2024-02-09 | 下载 | From the outset, batteries have been the main power source for the Internet of Things (IoT). However, replacing and disposing of billions of dead batteries per year is costly in terms of maintenance a... |
| On Optimal Resource Allocation in Virtual Sensor Networks | Carmen Delgado, José Ramón Gállego, María Canales, Jorge Ortín, Sonda Bousnina, Matteo Cesana | 2024-02-09 | 下载 | Sensor network virtualization is a promising paradigm to move away from highlycustomized, application-specific wireless sensor networks deployment by opening up to the possibility of dynamically assig... |
| An experimental study: RF Fingerprinting of Bluetooth devices | Artis Rušiņš, Krišjānis Nesenbergs, Deniss Tiščenko, Pēteris Paikens | 2024-02-09 | 下载 | This paper presents an experimental study on radio frequency (RF) fingerprinting of Bluetooth Classic devices. Our research aims to provide a practical evaluation of the possibilities for RF fingerpri... |
| Resource Allocation for Channel Estimation in Reconfigurable Intelligent Surface-Aided Multi-Cell Networks | Yining Xu, Sheng Zhou | 2024-02-09 | 下载 | Reconfigurable intelligent surface (RIS) is a promising solution to deal with the blockage-sensitivity of millimeter wave band and reduce the high energy consumption caused by network densification. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| pSTL-Bench: A Micro-Benchmark Suite for Assessing Scalability of C++ Parallel STL Implementations | Ruben Laso, Diego Krupitza, Sascha Hunold | 2024-02-09 | 下载 | Since the advent of parallel algorithms in the C++17 Standard Template Library (STL), the STL has become a viable framework for creating performance-portable applications. |