Appearance
2025-01-22
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Fast-Locking and High-Resolution Mixed-Mode DLL with Binary Search and Clock Failure Detection for Wide Frequency Ranges in 3-nm FinFET CMOS | Nicolás Wainstein, Eran Avitay, Eugene Avner | 2025-01-22 | 下载 | This paper presents a mixed-mode delay-locked loop (MM-DLL) with binary search (BS) locking, designed to cover a broad frequency range from 533 MHz to 4.26 GHz. |
| Learning in Log-Domain: Subthreshold Analog AI Accelerator Based on Stochastic Gradient Descent | Momen K Tageldeen, Yacine Belgaid, Vivek Mohan, Zhou Wang, Emmanuel M Drakakis | 2025-01-22 | 下载 | The rapid proliferation of AI models, coupled with growing demand for edge deployment, necessitates the development of AI hardware that is both high-performance and energy-efficient. |
| Analyzing and Exploiting Branch Mispredictions in Microcode | Nicholas Mosier, Hamed Nemati, John C. Mitchell, Caroline Trippel | 2025-01-22 | 下载 | We present uSpectre, a new class of transient execution attacks that exploit microcode branch mispredictions to transiently leak sensitive data. |
| Efficient Implementation of LinearUCB through Algorithmic Improvements and Vector Computing Acceleration for Embedded Learning Systems | Marco Angioli, Marcello Barbirotta, Abdallah Cheikh, Antonio Mastrandrea, Francesco Menichelli, Mauro Olivieri | 2025-01-22 | 下载 | As the Internet of Things expands, embedding Artificial Intelligence algorithms in resource-constrained devices has become increasingly important to enable real-time, autonomous decision-making withou... |
| Late Breaking Result: FPGA-Based Emulation and Fault Injection for CNN Inference Accelerators | Filip Masar, Vojtech Mrazek, Lukas Sekanina | 2025-01-22 | 下载 | A new field programmable gate array (FPGA)-based emulation platform is proposed to accelerate fault tolerance analysis of inference accelerators of convolutional neural networks (CNN). |
| VRank: Enhancing Verilog Code Generation from Large Language Models via Self-Consistency | Zhuorui Zhao, Ruidi Qiu, Ing-Chao Lin, Grace Li Zhang, Bing Li, Ulf Schlichtmann | 2025-01-22 | 下载 | Large Language Models (LLMs) have demonstrated promising capabilities in generating Verilog code from module specifications. To improve the quality of such generated Verilog codes, previous methods re... |
| HEPPO-GAE: Hardware-Efficient Proximal Policy Optimization with Generalized Advantage Estimation | Hazem Taha, Ameer M. S. Abdelhadi | 2025-01-22 | 下载 | This paper introduces HEPPO-GAE, an FPGA-based accelerator designed to optimize the Generalized Advantage Estimation (GAE) stage in Proximal Policy Optimization (PPO). |
| Current Opinions on Memristor-Accelerated Machine Learning Hardware | Mingrui Jiang, Yichun Xu, Zefan Li, Can Li | 2025-01-22 | 下载 | The unprecedented advancement of artificial intelligence has placed immense demands on computing hardware, but traditional silicon-based semiconductor technologies are approaching their physical and e... |
| SoMa: Identifying, Exploring, and Understanding the DRAM Communication Scheduling Space for DNN Accelerators | Jingwei Cai, Xuan Wang, Mingyu Gao, Sen Peng, Zijian Zhu, Yuchen Wei, Zuotong Wu, Kaisheng Ma | 2025-01-22 | 下载 | Modern Deep Neural Network (DNN) accelerators are equipped with increasingly larger on-chip buffers to provide more opportunities to alleviate the increasingly severe DRAM bandwidth pressure. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A Selective Homomorphic Encryption Approach for Faster Privacy-Preserving Federated Learning | Abdulkadir Korkmaz, Praveen Rao | 2025-01-22 | 下载 | Federated learning (FL) has come forward as a critical approach for privacy-preserving machine learning in healthcare, allowing collaborative model training across decentralized medical datasets witho... |
| μOpTime: Statically Reducing the Execution Time of Microbenchmark Suites Using Stability Metrics | Nils Japke, Martin Grambow, Christoph Laaber, David Bermbach | 2025-01-22 | 下载 | Performance regressions have a tremendous impact on the quality of software. One way to catch regressions before they reach production is executing performance tests before deployment, e.g. |
| Practical quantum federated learning and its experimental demonstration | Zhi-Ping Liu, Xiao-Yu Cao, Hao-Wen Liu, Xiao-Ran Sun, Yu Bao, Yu-Shuo Lu, Hua-Lei Yin, Zeng-Bing Chen | 2025-01-22 | 下载 | Federated learning is essential for decentralized, privacy-preserving model training in the data-driven era. Quantum-enhanced federated learning leverages quantum resources to address privacy and scal... |
| Workflow as a Service Broker in Cloud Environment: A Systematic Literature Review | Saeid Abrishami, Faridreza Momtaz Zandi, Alireza Nourbakhsh | 2025-01-22 | 下载 | Cloud computing has emerged as a promising platform for running scientific workflows across various domains. Scientists can take advantage of different cloud service models, such as serverful or serve... |
| Knowledge-Driven Federated Graph Learning on Model Heterogeneity | Zhengyu Wu, Guang Zeng, Huilin Lai, Daohan Su, Jishuo Jia, Yinlin Zhu, Xunkai Li, Rong-Hua Li, Guoren Wang, Chenghu Zhou | 2025-01-22 | 下载 | Federated graph learning (FGL) has emerged as a promising paradigm for collaborative graph representation learning, enabling multiple parties to jointly train models while preserving data privacy. |
| Fray: An Efficient General-Purpose Concurrency Testing Platform for the JVM (Extended Version) | Ao Li, Byeongjee Kang, Vasudev Vikram, Isabella Laybourn, Samvid Dharanikota, Shrey Tiwari, Rohan Padhye | 2025-01-22 | 下载 | Concurrency bugs are hard to discover and reproduce. Prior work has developed sophisticated algorithms to search for concurrency bugs, such as partial order sampling (POS); however, fundamental limita... |
| FedGrAINS: Personalized SubGraph Federated Learning with Adaptive Neighbor Sampling | Emir Ceyani, Han Xie, Baturalp Buyukates, Carl Yang, Salman Avestimehr | 2025-01-22 | 下载 | Graphs are crucial for modeling relational and biological data. As datasets grow larger in real-world scenarios, the risk of exposing sensitive information increases, making privacy-preserving trainin... |
| D-LoRa: a Distributed Parameter Adaptation Scheme for LoRa Network | Ruiqi Wang, Tongyu Song, Jing Ren, Xiong Wang, Shizhong Xu, Sheng Wang | 2025-01-22 | 下载 | The deployment of LoRa networks necessitates joint performance optimization, including packet delivery rate, energy efficiency, and throughput. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Joint Task Offloading and User Scheduling in 5G MEC under Jamming Attacks | Mohammadreza Amini, Burak Kantarci, Claude D'Amours, Melike Erol-Kantarci | 2025-01-22 | 下载 | In this paper, we propose a novel joint task offloading and user scheduling (JTO-US) framework for 5G mobile edge computing (MEC) systems under security threats from jamming attacks. |
| Which Sensor to Observe? Timely Tracking of a Joint Markov Source with Model Predictive Control | Ismail Cosandal, Sennur Ulukus, Nail Akar | 2025-01-22 | 下载 | In this paper, we investigate the problem of remote estimation of a discrete-time joint Markov process using multiple sensors. Each sensor observes a different component of the joint Markov process, a... |
| UAV-assisted Internet of Vehicles: A Framework Empowered by Reinforcement Learning and Blockchain | Ahmed Alagha, Maha Kadadha, Rabeb Mizouni, Shakti Singh, Jamal Bentahar, Hadi Otrok | 2025-01-22 | 下载 | This paper addresses the challenges of selecting relay nodes and coordinating among them in UAV-assisted Internet-of-Vehicles (IoV). The selection of UAV relay nodes in IoV employs mechanisms executed... |
| Information Degradation and Misinformation in Gossip Networks | Thomas Jacob Maranzatto, Arunabh Srivastava, Sennur Ulukus | 2025-01-22 | 下载 | We study networks of gossiping users where a source observing a process sends updates to an underlying graph. Nodes in the graph update their neighbors randomly and nodes always accept packets that ha... |
| GPUs, CPUs, and... NICs: Rethinking the Network's Role in Serving Complex AI Pipelines | Mike Wong, Ulysses Butler, Emma Farkash, Praveen Tammana, Anirudh Sivaraman, Ravi Netravali | 2025-01-22 | 下载 | The increasing prominence of AI necessitates the deployment of inference platforms for efficient and effective management of AI pipelines and compute resources. |
| GWEn -- An Open-Source Wireless Physical-Layer Evaluation Platform | Alexander Heinrich, Florentin Putz, Sören Krollmann, Bastian Loss, Waqar Ahmed, Matthias Hollick | 2025-01-22 | 下载 | Wireless physical layer assessment, such as measuring antenna radiation patterns, is complex and cost-intensive. Researchers often require a stationary setup with antennas surrounding the device under... |
| Scalability Analysis of 5G-TSN Applications in Indoor Factory Settings | Kouros Zanbouri, Md. Noor-A-Rahim, Dirk Pesch | 2025-01-22 | 下载 | While technologies such as Time-Sensitive Networking (TSN) improve deterministic behaviour, real-time functionality, and robustness of Ethernet, future industrial networks aim to be increasingly wirel... |
| A transformer-based deep q learning approach for dynamic load balancing in software-defined networks | Evans Tetteh Owusu, Kwame Agyemang-Prempeh Agyekum, Marinah Benneh, Pius Ayorna, Justice Owusu Agyemang, George Nii Martey Colley, James Dzisi Gazde | 2025-01-22 | 下载 | This study proposes a novel approach for dynamic load balancing in Software-Defined Networks (SDNs) using a Transformer-based Deep Q-Network (DQN). |
| Comparative Performance Evaluation of 5G-TSN Applications in Indoor Factory Environments | Kouros Zanbouri, Md. Noor-A-Rahim, Dirk Pesch | 2025-01-22 | 下载 | While Time-Sensitive Networking (TSN) enhances the determinism, real-time capabilities, and reliability of Ethernet, future industrial networks will not only use wired but increasingly wireless commun... |
| Cost Optimization for Serverless Edge Computing with Budget Constraints using Deep Reinforcement Learning | Chen Chen, Peiyuan Guan, Ziru Chen, Amir Taherkordi, Fen Hou, Lin X. Cai | 2025-01-22 | 下载 | Serverless computing adopts a pay-as-you-go billing model where applications are executed in stateless and shortlived containers triggered by events, resulting in a reduction of monetary costs and res... |
| Making Temporal Betweenness Computation Faster and Restless | Filippo Brunelli, Pierluigi Crescenzi, Laurent Viennot | 2025-01-22 | 下载 | Buß et al [KDD 2020] recently proved that the problem of computing the betweenness of all nodes of a temporal graph is computationally hard in the case of foremost and fastest paths, while it is solva... |
| A Multi-Stakeholder Perspective on Self-Managing Networks | Patrick Weber, Artur Sterz, Bernd Freisleben, Oliver Hinz | 2025-01-22 | 下载 | Modern telecommunication networks face an increasing complexity due to the rapidly growing number of networked devices and rising amounts of data. |
| PPO-Based Vehicle Control for Ramp Merging Scheme Assisted by Enhanced C-V2X | Qiong Wu, Maoxin Ji, Pingyi Fan, Kezhi Wang, Nan Cheng, Wen Chen, Khaled B. Letaief | 2025-01-22 | 下载 | On-ramp merging presents a critical challenge in autonomous driving, as vehicles from merging lanes need to dynamically adjust their positions and speeds while monitoring traffic on the main road to p... |
| D-LoRa: a Distributed Parameter Adaptation Scheme for LoRa Network | Ruiqi Wang, Tongyu Song, Jing Ren, Xiong Wang, Shizhong Xu, Sheng Wang | 2025-01-22 | 下载 | The deployment of LoRa networks necessitates joint performance optimization, including packet delivery rate, energy efficiency, and throughput. |
| Mechanism Design for Blockchain Order Books against Selfish Miners | Yunshu Liu, Lingjie Duan | 2025-01-22 | 下载 | In blockchain-based order book systems, buyers and sellers trade assets, while it is miners to match them and include their transactions in the blockchain. |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| GPUs, CPUs, and... NICs: Rethinking the Network's Role in Serving Complex AI Pipelines | Mike Wong, Ulysses Butler, Emma Farkash, Praveen Tammana, Anirudh Sivaraman, Ravi Netravali | 2025-01-22 | 下载 | The increasing prominence of AI necessitates the deployment of inference platforms for efficient and effective management of AI pipelines and compute resources. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Deciphering boundary layer dynamics in high-Rayleigh-number convection using 3360 GPUs and a high-scaling in-situ workflow | Mathis Bode, Damian Alvarez, Paul Fischer, Christos E. Frouzakis, Jens Henrik Göbbert, Joseph A. Insley, Yu-Hsiang Lan, Victor A. Mateevitsi, Misun Min, Michael E. Papka, Silvio Rizzi, Roshan J. Samuel, Jörg Schumacher | 2025-01-22 | 下载 | Turbulent heat and momentum transfer processes due to thermal convection cover many scales and are of great importance for several natural and technical flows. |
| Need for Speed: A Comprehensive Benchmark of JPEG Decoders in Python | Vladimir Iglovikov | 2025-01-22 | 下载 | Image loading represents a critical bottleneck in modern machine learning pipelines, particularly in computer vision tasks where JPEG remains the dominant format. |