2025-01-22

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Fast-Locking and High-Resolution Mixed-Mode DLL with Binary Search and Clock Failure Detection for Wide Frequency Ranges in 3-nm FinFET CMOS	Nicolás Wainstein, Eran Avitay, Eugene Avner	2025-01-22	下载	This paper presents a mixed-mode delay-locked loop (MM-DLL) with binary search (BS) locking, designed to cover a broad frequency range from 533 MHz to 4.26 GHz.
Learning in Log-Domain: Subthreshold Analog AI Accelerator Based on Stochastic Gradient Descent	Momen K Tageldeen, Yacine Belgaid, Vivek Mohan, Zhou Wang, Emmanuel M Drakakis	2025-01-22	下载	The rapid proliferation of AI models, coupled with growing demand for edge deployment, necessitates the development of AI hardware that is both high-performance and energy-efficient.
Analyzing and Exploiting Branch Mispredictions in Microcode	Nicholas Mosier, Hamed Nemati, John C. Mitchell, Caroline Trippel	2025-01-22	下载	We present uSpectre, a new class of transient execution attacks that exploit microcode branch mispredictions to transiently leak sensitive data.
Efficient Implementation of LinearUCB through Algorithmic Improvements and Vector Computing Acceleration for Embedded Learning Systems	Marco Angioli, Marcello Barbirotta, Abdallah Cheikh, Antonio Mastrandrea, Francesco Menichelli, Mauro Olivieri	2025-01-22	下载	As the Internet of Things expands, embedding Artificial Intelligence algorithms in resource-constrained devices has become increasingly important to enable real-time, autonomous decision-making withou...
Late Breaking Result: FPGA-Based Emulation and Fault Injection for CNN Inference Accelerators	Filip Masar, Vojtech Mrazek, Lukas Sekanina	2025-01-22	下载	A new field programmable gate array (FPGA)-based emulation platform is proposed to accelerate fault tolerance analysis of inference accelerators of convolutional neural networks (CNN).
VRank: Enhancing Verilog Code Generation from Large Language Models via Self-Consistency	Zhuorui Zhao, Ruidi Qiu, Ing-Chao Lin, Grace Li Zhang, Bing Li, Ulf Schlichtmann	2025-01-22	下载	Large Language Models (LLMs) have demonstrated promising capabilities in generating Verilog code from module specifications. To improve the quality of such generated Verilog codes, previous methods re...
HEPPO-GAE: Hardware-Efficient Proximal Policy Optimization with Generalized Advantage Estimation	Hazem Taha, Ameer M. S. Abdelhadi	2025-01-22	下载	This paper introduces HEPPO-GAE, an FPGA-based accelerator designed to optimize the Generalized Advantage Estimation (GAE) stage in Proximal Policy Optimization (PPO).
Current Opinions on Memristor-Accelerated Machine Learning Hardware	Mingrui Jiang, Yichun Xu, Zefan Li, Can Li	2025-01-22	下载	The unprecedented advancement of artificial intelligence has placed immense demands on computing hardware, but traditional silicon-based semiconductor technologies are approaching their physical and e...
SoMa: Identifying, Exploring, and Understanding the DRAM Communication Scheduling Space for DNN Accelerators	Jingwei Cai, Xuan Wang, Mingyu Gao, Sen Peng, Zijian Zhu, Yuchen Wei, Zuotong Wu, Kaisheng Ma	2025-01-22	下载	Modern Deep Neural Network (DNN) accelerators are equipped with increasingly larger on-chip buffers to provide more opportunities to alleviate the increasingly severe DRAM bandwidth pressure.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
A Selective Homomorphic Encryption Approach for Faster Privacy-Preserving Federated Learning	Abdulkadir Korkmaz, Praveen Rao	2025-01-22	下载	Federated learning (FL) has come forward as a critical approach for privacy-preserving machine learning in healthcare, allowing collaborative model training across decentralized medical datasets witho...
μOpTime: Statically Reducing the Execution Time of Microbenchmark Suites Using Stability Metrics	Nils Japke, Martin Grambow, Christoph Laaber, David Bermbach	2025-01-22	下载	Performance regressions have a tremendous impact on the quality of software. One way to catch regressions before they reach production is executing performance tests before deployment, e.g.
Practical quantum federated learning and its experimental demonstration	Zhi-Ping Liu, Xiao-Yu Cao, Hao-Wen Liu, Xiao-Ran Sun, Yu Bao, Yu-Shuo Lu, Hua-Lei Yin, Zeng-Bing Chen	2025-01-22	下载	Federated learning is essential for decentralized, privacy-preserving model training in the data-driven era. Quantum-enhanced federated learning leverages quantum resources to address privacy and scal...
Workflow as a Service Broker in Cloud Environment: A Systematic Literature Review	Saeid Abrishami, Faridreza Momtaz Zandi, Alireza Nourbakhsh	2025-01-22	下载	Cloud computing has emerged as a promising platform for running scientific workflows across various domains. Scientists can take advantage of different cloud service models, such as serverful or serve...
Knowledge-Driven Federated Graph Learning on Model Heterogeneity	Zhengyu Wu, Guang Zeng, Huilin Lai, Daohan Su, Jishuo Jia, Yinlin Zhu, Xunkai Li, Rong-Hua Li, Guoren Wang, Chenghu Zhou	2025-01-22	下载	Federated graph learning (FGL) has emerged as a promising paradigm for collaborative graph representation learning, enabling multiple parties to jointly train models while preserving data privacy.
Fray: An Efficient General-Purpose Concurrency Testing Platform for the JVM (Extended Version)	Ao Li, Byeongjee Kang, Vasudev Vikram, Isabella Laybourn, Samvid Dharanikota, Shrey Tiwari, Rohan Padhye	2025-01-22	下载	Concurrency bugs are hard to discover and reproduce. Prior work has developed sophisticated algorithms to search for concurrency bugs, such as partial order sampling (POS); however, fundamental limita...
FedGrAINS: Personalized SubGraph Federated Learning with Adaptive Neighbor Sampling	Emir Ceyani, Han Xie, Baturalp Buyukates, Carl Yang, Salman Avestimehr	2025-01-22	下载	Graphs are crucial for modeling relational and biological data. As datasets grow larger in real-world scenarios, the risk of exposing sensitive information increases, making privacy-preserving trainin...
D-LoRa: a Distributed Parameter Adaptation Scheme for LoRa Network	Ruiqi Wang, Tongyu Song, Jing Ren, Xiong Wang, Shizhong Xu, Sheng Wang	2025-01-22	下载	The deployment of LoRa networks necessitates joint performance optimization, including packet delivery rate, energy efficiency, and throughput.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Joint Task Offloading and User Scheduling in 5G MEC under Jamming Attacks	Mohammadreza Amini, Burak Kantarci, Claude D'Amours, Melike Erol-Kantarci	2025-01-22	下载	In this paper, we propose a novel joint task offloading and user scheduling (JTO-US) framework for 5G mobile edge computing (MEC) systems under security threats from jamming attacks.
Which Sensor to Observe? Timely Tracking of a Joint Markov Source with Model Predictive Control	Ismail Cosandal, Sennur Ulukus, Nail Akar	2025-01-22	下载	In this paper, we investigate the problem of remote estimation of a discrete-time joint Markov process using multiple sensors. Each sensor observes a different component of the joint Markov process, a...
UAV-assisted Internet of Vehicles: A Framework Empowered by Reinforcement Learning and Blockchain	Ahmed Alagha, Maha Kadadha, Rabeb Mizouni, Shakti Singh, Jamal Bentahar, Hadi Otrok	2025-01-22	下载	This paper addresses the challenges of selecting relay nodes and coordinating among them in UAV-assisted Internet-of-Vehicles (IoV). The selection of UAV relay nodes in IoV employs mechanisms executed...
Information Degradation and Misinformation in Gossip Networks	Thomas Jacob Maranzatto, Arunabh Srivastava, Sennur Ulukus	2025-01-22	下载	We study networks of gossiping users where a source observing a process sends updates to an underlying graph. Nodes in the graph update their neighbors randomly and nodes always accept packets that ha...
GPUs, CPUs, and... NICs: Rethinking the Network's Role in Serving Complex AI Pipelines	Mike Wong, Ulysses Butler, Emma Farkash, Praveen Tammana, Anirudh Sivaraman, Ravi Netravali	2025-01-22	下载	The increasing prominence of AI necessitates the deployment of inference platforms for efficient and effective management of AI pipelines and compute resources.
GWEn -- An Open-Source Wireless Physical-Layer Evaluation Platform	Alexander Heinrich, Florentin Putz, Sören Krollmann, Bastian Loss, Waqar Ahmed, Matthias Hollick	2025-01-22	下载	Wireless physical layer assessment, such as measuring antenna radiation patterns, is complex and cost-intensive. Researchers often require a stationary setup with antennas surrounding the device under...
Scalability Analysis of 5G-TSN Applications in Indoor Factory Settings	Kouros Zanbouri, Md. Noor-A-Rahim, Dirk Pesch	2025-01-22	下载	While technologies such as Time-Sensitive Networking (TSN) improve deterministic behaviour, real-time functionality, and robustness of Ethernet, future industrial networks aim to be increasingly wirel...
A transformer-based deep q learning approach for dynamic load balancing in software-defined networks	Evans Tetteh Owusu, Kwame Agyemang-Prempeh Agyekum, Marinah Benneh, Pius Ayorna, Justice Owusu Agyemang, George Nii Martey Colley, James Dzisi Gazde	2025-01-22	下载	This study proposes a novel approach for dynamic load balancing in Software-Defined Networks (SDNs) using a Transformer-based Deep Q-Network (DQN).
Comparative Performance Evaluation of 5G-TSN Applications in Indoor Factory Environments	Kouros Zanbouri, Md. Noor-A-Rahim, Dirk Pesch	2025-01-22	下载	While Time-Sensitive Networking (TSN) enhances the determinism, real-time capabilities, and reliability of Ethernet, future industrial networks will not only use wired but increasingly wireless commun...
Cost Optimization for Serverless Edge Computing with Budget Constraints using Deep Reinforcement Learning	Chen Chen, Peiyuan Guan, Ziru Chen, Amir Taherkordi, Fen Hou, Lin X. Cai	2025-01-22	下载	Serverless computing adopts a pay-as-you-go billing model where applications are executed in stateless and shortlived containers triggered by events, resulting in a reduction of monetary costs and res...
Making Temporal Betweenness Computation Faster and Restless	Filippo Brunelli, Pierluigi Crescenzi, Laurent Viennot	2025-01-22	下载	Buß et al [KDD 2020] recently proved that the problem of computing the betweenness of all nodes of a temporal graph is computationally hard in the case of foremost and fastest paths, while it is solva...
A Multi-Stakeholder Perspective on Self-Managing Networks	Patrick Weber, Artur Sterz, Bernd Freisleben, Oliver Hinz	2025-01-22	下载	Modern telecommunication networks face an increasing complexity due to the rapidly growing number of networked devices and rising amounts of data.
PPO-Based Vehicle Control for Ramp Merging Scheme Assisted by Enhanced C-V2X	Qiong Wu, Maoxin Ji, Pingyi Fan, Kezhi Wang, Nan Cheng, Wen Chen, Khaled B. Letaief	2025-01-22	下载	On-ramp merging presents a critical challenge in autonomous driving, as vehicles from merging lanes need to dynamically adjust their positions and speeds while monitoring traffic on the main road to p...
D-LoRa: a Distributed Parameter Adaptation Scheme for LoRa Network	Ruiqi Wang, Tongyu Song, Jing Ren, Xiong Wang, Shizhong Xu, Sheng Wang	2025-01-22	下载	The deployment of LoRa networks necessitates joint performance optimization, including packet delivery rate, energy efficiency, and throughput.
Mechanism Design for Blockchain Order Books against Selfish Miners	Yunshu Liu, Lingjie Duan	2025-01-22	下载	In blockchain-based order book systems, buyers and sellers trade assets, while it is miners to match them and include their transactions in the blockchain.

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
GPUs, CPUs, and... NICs: Rethinking the Network's Role in Serving Complex AI Pipelines	Mike Wong, Ulysses Butler, Emma Farkash, Praveen Tammana, Anirudh Sivaraman, Ravi Netravali	2025-01-22	下载	The increasing prominence of AI necessitates the deployment of inference platforms for efficient and effective management of AI pipelines and compute resources.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Deciphering boundary layer dynamics in high-Rayleigh-number convection using 3360 GPUs and a high-scaling in-situ workflow	Mathis Bode, Damian Alvarez, Paul Fischer, Christos E. Frouzakis, Jens Henrik Göbbert, Joseph A. Insley, Yu-Hsiang Lan, Victor A. Mateevitsi, Misun Min, Michael E. Papka, Silvio Rizzi, Roshan J. Samuel, Jörg Schumacher	2025-01-22	下载	Turbulent heat and momentum transfer processes due to thermal convection cover many scales and are of great importance for several natural and technical flows.
Need for Speed: A Comprehensive Benchmark of JPEG Decoders in Python	Vladimir Iglovikov	2025-01-22	下载	Image loading represents a critical bottleneck in modern machine learning pipelines, particularly in computer vision tasks where JPEG remains the dominant format.