Skip to content

2025-01-22

cs.AR - Architecture

标题作者发布日期PDF摘要
Fast-Locking and High-Resolution Mixed-Mode DLL with Binary Search and Clock Failure Detection for Wide Frequency Ranges in 3-nm FinFET CMOSNicolás Wainstein, Eran Avitay, Eugene Avner2025-01-22下载This paper presents a mixed-mode delay-locked loop (MM-DLL) with binary search (BS) locking, designed to cover a broad frequency range from 533 MHz to 4.26 GHz.
Learning in Log-Domain: Subthreshold Analog AI Accelerator Based on Stochastic Gradient DescentMomen K Tageldeen, Yacine Belgaid, Vivek Mohan, Zhou Wang, Emmanuel M Drakakis2025-01-22下载The rapid proliferation of AI models, coupled with growing demand for edge deployment, necessitates the development of AI hardware that is both high-performance and energy-efficient.
Analyzing and Exploiting Branch Mispredictions in MicrocodeNicholas Mosier, Hamed Nemati, John C. Mitchell, Caroline Trippel2025-01-22下载We present uSpectre, a new class of transient execution attacks that exploit microcode branch mispredictions to transiently leak sensitive data.
Efficient Implementation of LinearUCB through Algorithmic Improvements and Vector Computing Acceleration for Embedded Learning SystemsMarco Angioli, Marcello Barbirotta, Abdallah Cheikh, Antonio Mastrandrea, Francesco Menichelli, Mauro Olivieri2025-01-22下载As the Internet of Things expands, embedding Artificial Intelligence algorithms in resource-constrained devices has become increasingly important to enable real-time, autonomous decision-making withou...
Late Breaking Result: FPGA-Based Emulation and Fault Injection for CNN Inference AcceleratorsFilip Masar, Vojtech Mrazek, Lukas Sekanina2025-01-22下载A new field programmable gate array (FPGA)-based emulation platform is proposed to accelerate fault tolerance analysis of inference accelerators of convolutional neural networks (CNN).
VRank: Enhancing Verilog Code Generation from Large Language Models via Self-ConsistencyZhuorui Zhao, Ruidi Qiu, Ing-Chao Lin, Grace Li Zhang, Bing Li, Ulf Schlichtmann2025-01-22下载Large Language Models (LLMs) have demonstrated promising capabilities in generating Verilog code from module specifications. To improve the quality of such generated Verilog codes, previous methods re...
HEPPO-GAE: Hardware-Efficient Proximal Policy Optimization with Generalized Advantage EstimationHazem Taha, Ameer M. S. Abdelhadi2025-01-22下载This paper introduces HEPPO-GAE, an FPGA-based accelerator designed to optimize the Generalized Advantage Estimation (GAE) stage in Proximal Policy Optimization (PPO).
Current Opinions on Memristor-Accelerated Machine Learning HardwareMingrui Jiang, Yichun Xu, Zefan Li, Can Li2025-01-22下载The unprecedented advancement of artificial intelligence has placed immense demands on computing hardware, but traditional silicon-based semiconductor technologies are approaching their physical and e...
SoMa: Identifying, Exploring, and Understanding the DRAM Communication Scheduling Space for DNN AcceleratorsJingwei Cai, Xuan Wang, Mingyu Gao, Sen Peng, Zijian Zhu, Yuchen Wei, Zuotong Wu, Kaisheng Ma2025-01-22下载Modern Deep Neural Network (DNN) accelerators are equipped with increasingly larger on-chip buffers to provide more opportunities to alleviate the increasingly severe DRAM bandwidth pressure.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
A Selective Homomorphic Encryption Approach for Faster Privacy-Preserving Federated LearningAbdulkadir Korkmaz, Praveen Rao2025-01-22下载Federated learning (FL) has come forward as a critical approach for privacy-preserving machine learning in healthcare, allowing collaborative model training across decentralized medical datasets witho...
μOpTime: Statically Reducing the Execution Time of Microbenchmark Suites Using Stability MetricsNils Japke, Martin Grambow, Christoph Laaber, David Bermbach2025-01-22下载Performance regressions have a tremendous impact on the quality of software. One way to catch regressions before they reach production is executing performance tests before deployment, e.g.
Practical quantum federated learning and its experimental demonstrationZhi-Ping Liu, Xiao-Yu Cao, Hao-Wen Liu, Xiao-Ran Sun, Yu Bao, Yu-Shuo Lu, Hua-Lei Yin, Zeng-Bing Chen2025-01-22下载Federated learning is essential for decentralized, privacy-preserving model training in the data-driven era. Quantum-enhanced federated learning leverages quantum resources to address privacy and scal...
Workflow as a Service Broker in Cloud Environment: A Systematic Literature ReviewSaeid Abrishami, Faridreza Momtaz Zandi, Alireza Nourbakhsh2025-01-22下载Cloud computing has emerged as a promising platform for running scientific workflows across various domains. Scientists can take advantage of different cloud service models, such as serverful or serve...
Knowledge-Driven Federated Graph Learning on Model HeterogeneityZhengyu Wu, Guang Zeng, Huilin Lai, Daohan Su, Jishuo Jia, Yinlin Zhu, Xunkai Li, Rong-Hua Li, Guoren Wang, Chenghu Zhou2025-01-22下载Federated graph learning (FGL) has emerged as a promising paradigm for collaborative graph representation learning, enabling multiple parties to jointly train models while preserving data privacy.
Fray: An Efficient General-Purpose Concurrency Testing Platform for the JVM (Extended Version)Ao Li, Byeongjee Kang, Vasudev Vikram, Isabella Laybourn, Samvid Dharanikota, Shrey Tiwari, Rohan Padhye2025-01-22下载Concurrency bugs are hard to discover and reproduce. Prior work has developed sophisticated algorithms to search for concurrency bugs, such as partial order sampling (POS); however, fundamental limita...
FedGrAINS: Personalized SubGraph Federated Learning with Adaptive Neighbor SamplingEmir Ceyani, Han Xie, Baturalp Buyukates, Carl Yang, Salman Avestimehr2025-01-22下载Graphs are crucial for modeling relational and biological data. As datasets grow larger in real-world scenarios, the risk of exposing sensitive information increases, making privacy-preserving trainin...
D-LoRa: a Distributed Parameter Adaptation Scheme for LoRa NetworkRuiqi Wang, Tongyu Song, Jing Ren, Xiong Wang, Shizhong Xu, Sheng Wang2025-01-22下载The deployment of LoRa networks necessitates joint performance optimization, including packet delivery rate, energy efficiency, and throughput.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Joint Task Offloading and User Scheduling in 5G MEC under Jamming AttacksMohammadreza Amini, Burak Kantarci, Claude D'Amours, Melike Erol-Kantarci2025-01-22下载In this paper, we propose a novel joint task offloading and user scheduling (JTO-US) framework for 5G mobile edge computing (MEC) systems under security threats from jamming attacks.
Which Sensor to Observe? Timely Tracking of a Joint Markov Source with Model Predictive ControlIsmail Cosandal, Sennur Ulukus, Nail Akar2025-01-22下载In this paper, we investigate the problem of remote estimation of a discrete-time joint Markov process using multiple sensors. Each sensor observes a different component of the joint Markov process, a...
UAV-assisted Internet of Vehicles: A Framework Empowered by Reinforcement Learning and BlockchainAhmed Alagha, Maha Kadadha, Rabeb Mizouni, Shakti Singh, Jamal Bentahar, Hadi Otrok2025-01-22下载This paper addresses the challenges of selecting relay nodes and coordinating among them in UAV-assisted Internet-of-Vehicles (IoV). The selection of UAV relay nodes in IoV employs mechanisms executed...
Information Degradation and Misinformation in Gossip NetworksThomas Jacob Maranzatto, Arunabh Srivastava, Sennur Ulukus2025-01-22下载We study networks of gossiping users where a source observing a process sends updates to an underlying graph. Nodes in the graph update their neighbors randomly and nodes always accept packets that ha...
GPUs, CPUs, and... NICs: Rethinking the Network's Role in Serving Complex AI PipelinesMike Wong, Ulysses Butler, Emma Farkash, Praveen Tammana, Anirudh Sivaraman, Ravi Netravali2025-01-22下载The increasing prominence of AI necessitates the deployment of inference platforms for efficient and effective management of AI pipelines and compute resources.
GWEn -- An Open-Source Wireless Physical-Layer Evaluation PlatformAlexander Heinrich, Florentin Putz, Sören Krollmann, Bastian Loss, Waqar Ahmed, Matthias Hollick2025-01-22下载Wireless physical layer assessment, such as measuring antenna radiation patterns, is complex and cost-intensive. Researchers often require a stationary setup with antennas surrounding the device under...
Scalability Analysis of 5G-TSN Applications in Indoor Factory SettingsKouros Zanbouri, Md. Noor-A-Rahim, Dirk Pesch2025-01-22下载While technologies such as Time-Sensitive Networking (TSN) improve deterministic behaviour, real-time functionality, and robustness of Ethernet, future industrial networks aim to be increasingly wirel...
A transformer-based deep q learning approach for dynamic load balancing in software-defined networksEvans Tetteh Owusu, Kwame Agyemang-Prempeh Agyekum, Marinah Benneh, Pius Ayorna, Justice Owusu Agyemang, George Nii Martey Colley, James Dzisi Gazde2025-01-22下载This study proposes a novel approach for dynamic load balancing in Software-Defined Networks (SDNs) using a Transformer-based Deep Q-Network (DQN).
Comparative Performance Evaluation of 5G-TSN Applications in Indoor Factory EnvironmentsKouros Zanbouri, Md. Noor-A-Rahim, Dirk Pesch2025-01-22下载While Time-Sensitive Networking (TSN) enhances the determinism, real-time capabilities, and reliability of Ethernet, future industrial networks will not only use wired but increasingly wireless commun...
Cost Optimization for Serverless Edge Computing with Budget Constraints using Deep Reinforcement LearningChen Chen, Peiyuan Guan, Ziru Chen, Amir Taherkordi, Fen Hou, Lin X. Cai2025-01-22下载Serverless computing adopts a pay-as-you-go billing model where applications are executed in stateless and shortlived containers triggered by events, resulting in a reduction of monetary costs and res...
Making Temporal Betweenness Computation Faster and RestlessFilippo Brunelli, Pierluigi Crescenzi, Laurent Viennot2025-01-22下载Buß et al [KDD 2020] recently proved that the problem of computing the betweenness of all nodes of a temporal graph is computationally hard in the case of foremost and fastest paths, while it is solva...
A Multi-Stakeholder Perspective on Self-Managing NetworksPatrick Weber, Artur Sterz, Bernd Freisleben, Oliver Hinz2025-01-22下载Modern telecommunication networks face an increasing complexity due to the rapidly growing number of networked devices and rising amounts of data.
PPO-Based Vehicle Control for Ramp Merging Scheme Assisted by Enhanced C-V2XQiong Wu, Maoxin Ji, Pingyi Fan, Kezhi Wang, Nan Cheng, Wen Chen, Khaled B. Letaief2025-01-22下载On-ramp merging presents a critical challenge in autonomous driving, as vehicles from merging lanes need to dynamically adjust their positions and speeds while monitoring traffic on the main road to p...
D-LoRa: a Distributed Parameter Adaptation Scheme for LoRa NetworkRuiqi Wang, Tongyu Song, Jing Ren, Xiong Wang, Shizhong Xu, Sheng Wang2025-01-22下载The deployment of LoRa networks necessitates joint performance optimization, including packet delivery rate, energy efficiency, and throughput.
Mechanism Design for Blockchain Order Books against Selfish MinersYunshu Liu, Lingjie Duan2025-01-22下载In blockchain-based order book systems, buyers and sellers trade assets, while it is miners to match them and include their transactions in the blockchain.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
GPUs, CPUs, and... NICs: Rethinking the Network's Role in Serving Complex AI PipelinesMike Wong, Ulysses Butler, Emma Farkash, Praveen Tammana, Anirudh Sivaraman, Ravi Netravali2025-01-22下载The increasing prominence of AI necessitates the deployment of inference platforms for efficient and effective management of AI pipelines and compute resources.

cs.PF - Performance

标题作者发布日期PDF摘要
Deciphering boundary layer dynamics in high-Rayleigh-number convection using 3360 GPUs and a high-scaling in-situ workflowMathis Bode, Damian Alvarez, Paul Fischer, Christos E. Frouzakis, Jens Henrik Göbbert, Joseph A. Insley, Yu-Hsiang Lan, Victor A. Mateevitsi, Misun Min, Michael E. Papka, Silvio Rizzi, Roshan J. Samuel, Jörg Schumacher2025-01-22下载Turbulent heat and momentum transfer processes due to thermal convection cover many scales and are of great importance for several natural and technical flows.
Need for Speed: A Comprehensive Benchmark of JPEG Decoders in PythonVladimir Iglovikov2025-01-22下载Image loading represents a critical bottleneck in modern machine learning pipelines, particularly in computer vision tasks where JPEG remains the dominant format.

基于 VitePress 构建