Skip to content

2024-05-06

cs.AR - Architecture

标题作者发布日期PDF摘要
DeltaKWS: A 65nm 36nJ/Decision Bio-inspired Temporal-Sparsity-Aware Digital Keyword Spotting IC with 0.6V Near-Threshold SRAMQinyu Chen, Kwantae Kim, Chang Gao, Sheng Zhou, Taekwang Jang, Tobi Delbruck, Shih-Chii Liu2024-05-06下载This paper introduces DeltaKWS, to the best of our knowledge, the first ΔRNN-enabled fine-grained temporal sparsity-aware KWS IC for voice-controlled devices.
Basilisk: Achieving Competitive Performance with Open EDA Tools on an Open-Source Linux-Capable RISC-V SoCPhillippe Sauter, Thomas Benz, Paul Scheffler, Zerun Jiang, Beat Muheim, Frank K. Gürkaynak, Luca Benini2024-05-06下载We introduce Basilisk, an optimized application-specific integrated circuit (ASIC) implementation and design flow building on the end-to-end open-source Iguana system-on-chip (SoC).
Pinching Tactile Display: A Cloth that Changes Tactile Sensation by Electrostatic AdsorptionTakekazu Kitagishi, Hirotaka Hiraki, Hiromi Nakamura, Yoshio Ishiguro, Jun Rekimoto2024-05-06下载Haptic displays play an important role in enhancing the sense of presence in VR and telepresence. Displaying the tactile properties of fabrics has potential in the fashion industry, but there are diff...
SparrowSNN: A Hardware/software Co-design for Energy Efficient ECG ClassificationZhanglu Yan, Zhenyu Bai, Tulika Mitra, Weng-Fai Wong2024-05-06下载Heart disease is one of the leading causes of death worldwide. Given its high risk and often asymptomatic nature, real-time continuous monitoring is essential.
Towards Efficient Design Verification -- Constrained Random Verification using PyUVMDeepak Narayan Gadde, Suruchi Kumari, Aman Kumar2024-05-06下载Python, as a multi-paradigm language known for its ease of integration with other languages, has gained significant attention among verification engineers recently.
Effective Design Verification -- Constrained Random with Python and CocotbDeepak Narayan Gadde, Suruchi Kumari, Aman Kumar2024-05-06下载Being the most widely used language across the world due to its simplicity and with 35 keywords (v3.7), Python attracts both hardware and software engineers.
PCG: Mitigating Conflict-based Cache Side-channel Attacks with PrefetchingFang Jiang, Fei Tong, Hongyu Wang, Xiaoyu Cheng, Zhe Zhou, Ming Ling, Yuxing Mao2024-05-06下载To defend against conflict-based cache side-channel attacks, cache partitioning or remapping techniques were proposed to prevent set conflicts between different security domains or obfuscate the locat...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Deterministic Expander Routing: Faster and More VersatileYi-Jun Chang, Shang-En Huang, Hsin-Hao Su2024-05-06下载We consider the expander routing problem formulated by Ghaffari, Kuhn, and Su (PODC 2017), where the goal is to route all the tokens to their destinations given that each vertex is the source and the ...
On the Convergence of Malleability and the HPC PowerStack: Exploiting Dynamism in Over-Provisioned and Power-Constrained HPC SystemsEishi Arima, Isaías A. Comprés, Martin Schulz2024-05-06下载Recent High-Performance Computing (HPC) systems are facing important challenges, such as massive power consumption, while at the same time significantly under-utilized system resources.
Optimizing Hardware Resource Partitioning and Job Allocations on Modern GPUs under Power CapsEishi Arima, Minjoon Kang, Issa Saba, Josef Weidendorfer, Carsten Trinitis, Martin Schulz2024-05-06下载CPU-GPU heterogeneous systems are now commonly used in HPC (High-Performance Computing). However, improving the utilization and energy-efficiency of such systems is still one of the most critical issu...
Orchestrated Co-scheduling, Resource Partitioning, and Power Capping on CPU-GPU Heterogeneous Systems via Machine LearningIssa Saba, Eishi Arima, Dai Liu, Martin Schulz2024-05-06下载CPU-GPU heterogeneous architectures are now commonly used in a wide variety of computing systems from mobile devices to supercomputers. Maximizing the throughput for multi-programmed workloads on such...
Content-Oblivious Leader Election on RingsFabian Frei, Ran Gelles, Ahmed Ghazy, Alexandre Nolin2024-05-06下载In content-oblivious computation, n nodes wish to compute a given task over an asynchronous network that suffers from an extremely harsh type of noise, which corrupts the content of all messages acros...
Trackable Island-model Genetic Algorithms at Wafer ScaleMatthew Andres Moreno, Connor Yang, Emily Dolson, Luis Zaman2024-05-06下载Emerging ML/AI hardware accelerators, like the 850,000 processor Cerebras Wafer-Scale Engine (WSE), hold great promise to scale up the capabilities of evolutionary computation.
Understanding Read-Write Wait-Free Coverings in the Fully-Anonymous Shared-Memory ModelGiuliano Losa, Eli Gafni2024-05-06下载In the fully-anonymous (shared-memory) model, inspired by a biological setting, processors have no identifiers and memory locations are anonymous.
Majority consensus thresholds in competitive Lotka--Volterra populationsMatthias Függer, Thomas Nowak, Joel Rybicki2024-05-06下载One of the key challenges in synthetic biology is devising robust signaling primitives for engineered microbial consortia. In such systems, a fundamental signal amplification problem is the majority c...
Floating Point Compression of Hierarchical Matrix Formats and its Impact on Matrix-Vector MultiplicationRonald Kriemann2024-05-06下载Matrix-vector multiplication forms the basis of many iterative solution algorithms and as such is an important algorithm also for hierarchical matrices which are used to represent dense data in an opt...
EdgeMiner: Distributed Process Mining at the Data SourcesJulia Andersen, Patrick Rathje, Christian Imenkamp, Agnes Koschmider, Olaf Landsiedel2024-05-06下载Process mining is moving beyond mining traditional event logs and nowadays includes, for example, data sourced from sensors in the Internet of Things (IoT).
Embedded Distributed Inference of Deep Neural Networks: A Systematic ReviewFederico Nicolás Peccia, Oliver Bringmann2024-05-06下载Embedded distributed inference of Neural Networks has emerged as a promising approach for deploying machine-learning models on resource-constrained devices in an efficient and scalable manner.
DarkFed: A Data-Free Backdoor Attack in Federated LearningMinghui Li, Wei Wan, Yuxuan Ning, Shengshan Hu, Lulu Xue, Leo Yu Zhang, Yichen Wang2024-05-06下载Federated learning (FL) has been demonstrated to be susceptible to backdoor attacks. However, existing academic studies on FL backdoor attacks rely on a high proportion of real clients with main task-...
Characterizing the Dilemma of Performance and Index Size in Billion-Scale Vector Search and Breaking It with Second-Tier MemoryRongxin Cheng, Yifan Peng, Xingda Wei, Hongrui Xie, Rong Chen, Sijie Shen, Haibo Chen2024-05-06下载Vector searches on large-scale datasets are critical to modern online services like web search and RAG, which necessity storing the datasets and their index on the secondary storage like SSD.
An Auto-tuning Method for Run-time Data Transformation for Sparse Matrix-Vector MultiplicationTakahiro Katagiri, Masahiko Sato2024-05-06下载In this paper, we research the run-time sparse matrix data transformation from Compressed Row Storage (CRS) to Coordinate (COO) storage and an ELL (ELLPACK/ITPACK) format with OpenMP parallelization f...
OMP-Engineer: Bridging Syntax Analysis and In-Context Learning for Efficient Automated OpenMP ParallelizationWeidong Wang, Haoran Zhu2024-05-06下载In advancing parallel programming, particularly with OpenMP, the shift towards NLP-based methods marks a significant innovation beyond traditional S2S tools like Autopar and Cetus.
Impact of EIP-4844 on Ethereum: Consensus Security, Ethereum Usage, Rollup Transaction Dynamics, and Blob Gas Fee MarketsSeongwan Park, Bosul Mun, Seungyun Lee, Woojin Jeong, Jaewook Lee, Hyeonsang Eom, Huisu Jang2024-05-06下载On March 13, 2024, Ethereum implemented EIP-4844, designed to enhance its role as a data availability layer. While this upgrade reduces data posting costs for rollups, it also raises concerns about it...
Collaborative Satellite Computing through Adaptive DNN Task Splitting and OffloadingShifeng Peng, Xuefeng Hou, Zhishu Shen, Qiushi Zheng, Jiong Jin, Atsushi Tagami, Jingling Yuan2024-05-06下载Satellite computing has emerged as a promising technology for next-generation wireless networks. This innovative technology provides data processing capabilities, which facilitates the widespread impl...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Resource Optimization in UAV-assisted IoT Networks: The Role of Generative AISana Sharif, Sherali Zeadally, Waleed Ejaz2024-05-06下载We investigate how generative Artificial Intelligence (AI) can be used to optimize resources in Unmanned Aerial Vehicle (UAV)-assisted Internet of Things (IoT) networks.
A Novel Cross-band CSI Prediction Scheme for Multi-band Fingerprint based LocalizationYuan Ruihao, Huang Kaixuan, Zhang Shunqing2024-05-06下载Because of the advantages of computation complexity compared with traditional localization algorithms, fingerprint based localization is getting increasing demand.
State-Aware Timeliness in Energy Harvesting IoT Systems Monitoring a Markovian SourceErfan Delfani, George J. Stamatakis, Nikolaos Pappas2024-05-06下载In this study, we investigate the optimal transmission policies within an energy harvesting status update system, where the demand for status updates depends on the state of the source.
A Comprehensive Tutorial and Survey of O-RAN: Exploring Slicing-aware Architecture, Deployment Options, Use Cases, and ChallengesKhurshid Alam, Mohammad Asif Habibi, Matthias Tammen, Dennis Krummacker, Walid Saad, Marco Di Renzo, Tommaso Melodia, Xavier Costa-Pérez, Mérouane Debbah, Ashutosh Dutta, Hans D. Schotten2024-05-06下载Open-radio access network (O-RAN) seeks to establish the principles of openness, programmability, automation, intelligence, and hardware-software disaggregation with interoperable and standard-complia...
ReinWiFi: Application-Layer QoS Optimization of WiFi Networks with Reinforcement LearningQianren Li, Bojie Lv, Yuncong Hong, Rui Wang2024-05-06下载The enhanced distributed channel access (EDCA) mechanism is used in current wireless fidelity (WiFi) networks to support priority requirements of heterogeneous applications.
Snake Learning: A Communication- and Computation-Efficient Distributed Learning Framework for 6GXiaoxue Yu, Xingfu Yi, Rongpeng Li, Fei Wang, Chenghui Peng, Zhifeng Zhao, Honggang Zhang2024-05-06下载In the evolution towards 6G, integrating Artificial Intelligence (AI) with advanced network infrastructure emerges as a pivotal strategy for enhancing network intelligence and resource utilization.
An Overview of Intelligent Meta-surfaces for 6G and Beyond: Opportunities, Trends, and ChallengesMayur Katwe, Aryan Kaushik, Lina Mohjazi, Mohammad Abualhayja'a, Davide Dardari, Keshav Singh, Muhammad Ali Imran, M. Majid Butt, Octavia A. Dobre2024-05-06下载With the impending arrival of the sixth generation (6G) of wireless communication technology, the telecommunications landscape is poised for another revolutionary transformation.
Coordinating Cooperative Perception in Urban Air Mobility for Enhanced Environmental AwarenessTimo Häckel, Luca von Roenn, Nemo Juchmann, Alexander Fay, Rinie Akkermans, Tim Tiedemann, Thomas C. Schmidt2024-05-06下载The trend for Urban Air Mobility (UAM) is growing with prospective air taxis, parcel deliverers, and medical and industrial services. Safe and efficient UAM operation relies on timely communication an...
Automatic Retrieval-augmented Generation of 6G Network Specifications for Use CasesYun Tang, Weisi Guo2024-05-06下载6G Open Radio Access Networks (O-RAN) promises to open data interfaces to enable plug-and-play service Apps, many of which are consumer and business-facing.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
sqlelf: a SQL-centric Approach to ELF AnalysisFarid Zakaria, Zheyuan Chen, Andrew Quinn, Thomas R. W. Scogland2024-05-06下载The exploration and understanding of Executable and Linkable Format (ELF) objects underpin various critical activities in computer systems, from debugging to reverse engineering.

cs.PF - Performance

标题作者发布日期PDF摘要
Accurate and Fast Approximate Graph Pattern Mining at ScaleAnna Arpaci-Dusseau, Zixiang Zhou, Xuhao Chen2024-05-06下载Approximate graph pattern mining (A-GPM) is an important data analysis tool for many graph-based applications. There exist sampling-based A-GPM systems to provide automation and generalization over a ...
An Auto-tuning Method for Run-time Data Transformation for Sparse Matrix-Vector MultiplicationTakahiro Katagiri, Masahiko Sato2024-05-06下载In this paper, we research the run-time sparse matrix data transformation from Compressed Row Storage (CRS) to Coordinate (COO) storage and an ELL (ELLPACK/ITPACK) format with OpenMP parallelization f...

基于 VitePress 构建