Skip to content

2024-02-28

cs.AR - Architecture

标题作者发布日期PDF摘要
Accelerating Computer Architecture Simulation through Machine LearningWajid Ali, Ayaz Akram2024-02-28下载This paper presents our approach to accelerate computer architecture simulation by leveraging machine learning techniques. Traditional computer architecture simulations are time-consuming, making it c...
Functionally-Complete Boolean Logic in Real DRAM Chips: Experimental Characterization and AnalysisIsmail Emir Yuksel, Yahya Can Tugrul, Ataberk Olgun, F. Nisa Bostanci, A. Giray Yaglikci, Geraldo F. Oliveira, Haocong Luo, Juan Gómez-Luna, Mohammad Sadrosadati, Onur Mutlu2024-02-28下载Processing-using-DRAM (PuD) is an emerging paradigm that leverages the analog operational properties of DRAM circuitry to enable massively parallel in-DRAM computation.
Spatial Variation-Aware Read Disturbance Defenses: Experimental Analysis of Real DRAM Chips and Implications on Future SolutionsAbdullah Giray Yağlıkçı, Yahya Can Tuğrul, Geraldo F. Oliveira, İsmail Emir Yüksel, Ataberk Olgun, Haocong Luo, Onur Mutlu2024-02-28下载Read disturbance in modern DRAM chips is a widespread phenomenon and is reliably used for breaking memory isolation, a fundamental building block for building robust systems.
Energy-Aware Heterogeneous Federated Learning via Approximate DNN AcceleratorsKilian Pfeiffer, Konstantinos Balaskas, Kostas Siozios, Jörg Henkel2024-02-28下载In Federated Learning (FL), devices that participate in the training usually have heterogeneous resources, i.e., energy availability. In current deployments of FL, devices that do not fulfill certain ...
PIMSYN: Synthesizing Processing-in-memory CNN AcceleratorsWanqian Li, Xiaotian Sun, Xinyu Wang, Lei Wang, Yinhe Han, Xiaoming Chen2024-02-28下载Processing-in-memory architectures have been regarded as a promising solution for CNN acceleration. Existing PIM accelerator designs rely heavily on the experience of experts and require significant m...
PIMSIM-NN: An ISA-based Simulation Framework for Processing-in-Memory AcceleratorsXinyu Wang, Xiaotian Sun, Yinhe Han, Xiaoming Chen2024-02-28下载Processing-in-memory (PIM) has shown extraordinary potential in accelerating neural networks. To evaluate the performance of PIM accelerators, we present an ISA-based simulation framework including a ...
A Hierarchical Dataflow-Driven Heterogeneous Architecture for Wireless Baseband ProcessingLimin Jiang, Yi Shi, Yintao Liu, Qingyu Deng, Siyi Xu, Yihao Shen, Fangfang Ye, Shan Cao, Zhiyuan Jiang2024-02-28下载Wireless baseband processing (WBP) is a key element of wireless communications, with a series of signal processing modules to improve data throughput and counter channel fading.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Functionally-Complete Boolean Logic in Real DRAM Chips: Experimental Characterization and AnalysisIsmail Emir Yuksel, Yahya Can Tugrul, Ataberk Olgun, F. Nisa Bostanci, A. Giray Yaglikci, Geraldo F. Oliveira, Haocong Luo, Juan Gómez-Luna, Mohammad Sadrosadati, Onur Mutlu2024-02-28下载Processing-using-DRAM (PuD) is an emerging paradigm that leverages the analog operational properties of DRAM circuitry to enable massively parallel in-DRAM computation.
Libfork: portable continuation-stealing with stackless coroutinesConor John Williams, James Elliott2024-02-28下载Fully-strict fork-join parallelism is a powerful model for shared-memory programming due to its optimal time scaling and strong bounds on memory scaling.
MaRDIFlow: A CSE workflow framework for abstracting meta-data from FAIR computational experimentsPavan L. Veluvali, Jan Heiland, Peter Benner2024-02-28下载Numerical algorithms and computational tools are instrumental in navigating and addressing complex simulation and data processing tasks. The exponential growth of metadata and parameter-driven simulat...
TrustRate: A Decentralized Platform for Hijack-Resistant Anonymous ReviewsRohit Dwivedula, Sriram Sridhar, Sambhav Satija, Muthian Sivathanu, Nishanth Chandran, Divya Gupta, Satya Lokam2024-02-28下载Reviews and ratings by users form a central component in several widely used products today (e.g., product reviews, ratings of online content, etc.
Play like a Vertex: A Stackelberg Game Approach for Streaming Graph PartitioningZezhong Ding, Yongan Xiang, Shangyou Wang, Xike Xie, S. Kevin Zhou2024-02-28下载In the realm of distributed systems tasked with managing and processing large-scale graph-structured data, optimizing graph partitioning stands as a pivotal challenge.
Impact of network topology on the performance of Decentralized Federated LearningLuigi Palmieri, Chiara Boldrini, Lorenzo Valerio, Andrea Passarella, Marco Conti2024-02-28下载Fully decentralized learning is gaining momentum for training AI models at the Internet's edge, addressing infrastructure challenges and privacy concerns.
The Logarithmic Random Bidding for the Parallel Roulette Wheel Selection with Precise ProbabilitiesKoji Nakano2024-02-28下载The roulette wheel selection is a critical process in heuristic algorithms, enabling the probabilistic choice of items based on assigned fitness values.
Communication Efficient ConFederated Learning: An Event-Triggered SAGA ApproachBin Wang, Jun Fang, Hongbin Li, Yonina C. Eldar2024-02-28下载Federated learning (FL) is a machine learning paradigm that targets model training without gathering the local data dispersed over various data sources.
The Design and Implementation of a High-Performance Log-Structured RAID System for ZNS SSDsJinhong Li, Yiyang Geng, Qiuping Wang, Shujie Han, Patrick P. C. Lee2024-02-28下载Zoned Namespace (ZNS) defines a new abstraction for host software to flexibly manage storage in flash-based SSDs as append-only zones. It also provides a Zone Append primitive to further boost the wri...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
A Quick Framework for Evaluating Worst Robustness of Complex NetworksWenjun Jiang, Peiyan Li, Tianlong Fan, Ting Li, Chuan-fu Zhang, Tao Zhang, Zong-fu Luo2024-02-28下载Robustness is pivotal for comprehending, designing, optimizing, and rehabilitating networks, with simulation attacks being the prevailing evaluation method.
HyperFedNet: Communication-Efficient Personalized Federated Learning Via HypernetworkXingyun Chen, Yan Huang, Zhenzhen Xie, Junjie Pang2024-02-28下载In response to the challenges posed by non-independent and identically distributed (non-IID) data and the escalating threat of privacy attacks in Federated Learning (FL), we introduce HyperFedNet (HFN...
The Fusion of Deep Reinforcement Learning and Edge Computing for Real-time Monitoring and Control Optimization in IoT EnvironmentsJingyu Xu, Weixiang Wan, Linying Pan, Wenjian Sun, Yuxiang Liu2024-02-28下载In response to the demand for real-time performance and control quality in industrial Internet of Things (IoT) environments, this paper proposes an optimization control system based on deep reinforcem...
Utilization of Reconfigurable Intelligent Surfaces with Context Information: Use CasesŁukasz Kułacz2024-02-28下载In terms of complex radio environments especially in dense urban areas, a very interesting topic is considered - the utilization of reconfigurable intelligent surfaces.

cs.PF - Performance

标题作者发布日期PDF摘要
Priority Sampling of Large Language Models for CompilersDejan Grubisic, Chris Cummins, Volker Seeker, Hugh Leather2024-02-28下载Large language models show great potential in generating and optimizing code. Widely used sampling methods such as Nucleus Sampling increase the diversity of generation but often produce repeated samp...

基于 VitePress 构建