Appearance
2024-02-28
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Accelerating Computer Architecture Simulation through Machine Learning | Wajid Ali, Ayaz Akram | 2024-02-28 | 下载 | This paper presents our approach to accelerate computer architecture simulation by leveraging machine learning techniques. Traditional computer architecture simulations are time-consuming, making it c... |
| Functionally-Complete Boolean Logic in Real DRAM Chips: Experimental Characterization and Analysis | Ismail Emir Yuksel, Yahya Can Tugrul, Ataberk Olgun, F. Nisa Bostanci, A. Giray Yaglikci, Geraldo F. Oliveira, Haocong Luo, Juan Gómez-Luna, Mohammad Sadrosadati, Onur Mutlu | 2024-02-28 | 下载 | Processing-using-DRAM (PuD) is an emerging paradigm that leverages the analog operational properties of DRAM circuitry to enable massively parallel in-DRAM computation. |
| Spatial Variation-Aware Read Disturbance Defenses: Experimental Analysis of Real DRAM Chips and Implications on Future Solutions | Abdullah Giray Yağlıkçı, Yahya Can Tuğrul, Geraldo F. Oliveira, İsmail Emir Yüksel, Ataberk Olgun, Haocong Luo, Onur Mutlu | 2024-02-28 | 下载 | Read disturbance in modern DRAM chips is a widespread phenomenon and is reliably used for breaking memory isolation, a fundamental building block for building robust systems. |
| Energy-Aware Heterogeneous Federated Learning via Approximate DNN Accelerators | Kilian Pfeiffer, Konstantinos Balaskas, Kostas Siozios, Jörg Henkel | 2024-02-28 | 下载 | In Federated Learning (FL), devices that participate in the training usually have heterogeneous resources, i.e., energy availability. In current deployments of FL, devices that do not fulfill certain ... |
| PIMSYN: Synthesizing Processing-in-memory CNN Accelerators | Wanqian Li, Xiaotian Sun, Xinyu Wang, Lei Wang, Yinhe Han, Xiaoming Chen | 2024-02-28 | 下载 | Processing-in-memory architectures have been regarded as a promising solution for CNN acceleration. Existing PIM accelerator designs rely heavily on the experience of experts and require significant m... |
| PIMSIM-NN: An ISA-based Simulation Framework for Processing-in-Memory Accelerators | Xinyu Wang, Xiaotian Sun, Yinhe Han, Xiaoming Chen | 2024-02-28 | 下载 | Processing-in-memory (PIM) has shown extraordinary potential in accelerating neural networks. To evaluate the performance of PIM accelerators, we present an ISA-based simulation framework including a ... |
| A Hierarchical Dataflow-Driven Heterogeneous Architecture for Wireless Baseband Processing | Limin Jiang, Yi Shi, Yintao Liu, Qingyu Deng, Siyi Xu, Yihao Shen, Fangfang Ye, Shan Cao, Zhiyuan Jiang | 2024-02-28 | 下载 | Wireless baseband processing (WBP) is a key element of wireless communications, with a series of signal processing modules to improve data throughput and counter channel fading. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Functionally-Complete Boolean Logic in Real DRAM Chips: Experimental Characterization and Analysis | Ismail Emir Yuksel, Yahya Can Tugrul, Ataberk Olgun, F. Nisa Bostanci, A. Giray Yaglikci, Geraldo F. Oliveira, Haocong Luo, Juan Gómez-Luna, Mohammad Sadrosadati, Onur Mutlu | 2024-02-28 | 下载 | Processing-using-DRAM (PuD) is an emerging paradigm that leverages the analog operational properties of DRAM circuitry to enable massively parallel in-DRAM computation. |
| Libfork: portable continuation-stealing with stackless coroutines | Conor John Williams, James Elliott | 2024-02-28 | 下载 | Fully-strict fork-join parallelism is a powerful model for shared-memory programming due to its optimal time scaling and strong bounds on memory scaling. |
| MaRDIFlow: A CSE workflow framework for abstracting meta-data from FAIR computational experiments | Pavan L. Veluvali, Jan Heiland, Peter Benner | 2024-02-28 | 下载 | Numerical algorithms and computational tools are instrumental in navigating and addressing complex simulation and data processing tasks. The exponential growth of metadata and parameter-driven simulat... |
| TrustRate: A Decentralized Platform for Hijack-Resistant Anonymous Reviews | Rohit Dwivedula, Sriram Sridhar, Sambhav Satija, Muthian Sivathanu, Nishanth Chandran, Divya Gupta, Satya Lokam | 2024-02-28 | 下载 | Reviews and ratings by users form a central component in several widely used products today (e.g., product reviews, ratings of online content, etc. |
| Play like a Vertex: A Stackelberg Game Approach for Streaming Graph Partitioning | Zezhong Ding, Yongan Xiang, Shangyou Wang, Xike Xie, S. Kevin Zhou | 2024-02-28 | 下载 | In the realm of distributed systems tasked with managing and processing large-scale graph-structured data, optimizing graph partitioning stands as a pivotal challenge. |
| Impact of network topology on the performance of Decentralized Federated Learning | Luigi Palmieri, Chiara Boldrini, Lorenzo Valerio, Andrea Passarella, Marco Conti | 2024-02-28 | 下载 | Fully decentralized learning is gaining momentum for training AI models at the Internet's edge, addressing infrastructure challenges and privacy concerns. |
| The Logarithmic Random Bidding for the Parallel Roulette Wheel Selection with Precise Probabilities | Koji Nakano | 2024-02-28 | 下载 | The roulette wheel selection is a critical process in heuristic algorithms, enabling the probabilistic choice of items based on assigned fitness values. |
| Communication Efficient ConFederated Learning: An Event-Triggered SAGA Approach | Bin Wang, Jun Fang, Hongbin Li, Yonina C. Eldar | 2024-02-28 | 下载 | Federated learning (FL) is a machine learning paradigm that targets model training without gathering the local data dispersed over various data sources. |
| The Design and Implementation of a High-Performance Log-Structured RAID System for ZNS SSDs | Jinhong Li, Yiyang Geng, Qiuping Wang, Shujie Han, Patrick P. C. Lee | 2024-02-28 | 下载 | Zoned Namespace (ZNS) defines a new abstraction for host software to flexibly manage storage in flash-based SSDs as append-only zones. It also provides a Zone Append primitive to further boost the wri... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A Quick Framework for Evaluating Worst Robustness of Complex Networks | Wenjun Jiang, Peiyan Li, Tianlong Fan, Ting Li, Chuan-fu Zhang, Tao Zhang, Zong-fu Luo | 2024-02-28 | 下载 | Robustness is pivotal for comprehending, designing, optimizing, and rehabilitating networks, with simulation attacks being the prevailing evaluation method. |
| HyperFedNet: Communication-Efficient Personalized Federated Learning Via Hypernetwork | Xingyun Chen, Yan Huang, Zhenzhen Xie, Junjie Pang | 2024-02-28 | 下载 | In response to the challenges posed by non-independent and identically distributed (non-IID) data and the escalating threat of privacy attacks in Federated Learning (FL), we introduce HyperFedNet (HFN... |
| The Fusion of Deep Reinforcement Learning and Edge Computing for Real-time Monitoring and Control Optimization in IoT Environments | Jingyu Xu, Weixiang Wan, Linying Pan, Wenjian Sun, Yuxiang Liu | 2024-02-28 | 下载 | In response to the demand for real-time performance and control quality in industrial Internet of Things (IoT) environments, this paper proposes an optimization control system based on deep reinforcem... |
| Utilization of Reconfigurable Intelligent Surfaces with Context Information: Use Cases | Łukasz Kułacz | 2024-02-28 | 下载 | In terms of complex radio environments especially in dense urban areas, a very interesting topic is considered - the utilization of reconfigurable intelligent surfaces. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Priority Sampling of Large Language Models for Compilers | Dejan Grubisic, Chris Cummins, Volker Seeker, Hugh Leather | 2024-02-28 | 下载 | Large language models show great potential in generating and optimizing code. Widely used sampling methods such as Nucleus Sampling increase the diversity of generation but often produce repeated samp... |