2025-11-28

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Variable Point: A Number Format for Area- and Energy-Efficient Multiplication of High-Dynamic-Range Numbers	Seyed Hadi Mirfarshbafan, Nicolas Filliol, Oscar Castañeda, Christoph Studer	2025-11-28	下载	Fixed-point number representation is commonly employed in digital VLSI designs that have stringent hardware efficiency constraints. However, fixed-point numbers cover a relatively small dynamic range ...
Accelerated Execution of Bayesian Neural Networks using a Single Probabilistic Forward Pass and Code Generation	Bernhard Klein, Falk Selker, Hendrik Borras, Sophie Steger, Franz Pernkopf, Holger Fröning	2025-11-28	下载	Machine learning models perform well across domains such as diagnostics, weather forecasting, NLP, and autonomous driving, but their limited uncertainty handling restricts use in safety-critical setti...
Ternary-Input Binary-Weight CNN Accelerator Design for Miniature Object Classification System with Query-Driven Spatial DVS	Yuyang Li, Swasthik Muloor, Jack Laudati, Nickolas Dematteis, Yidam Park, Hana Kim, Nathan Chang, Inhee Lee	2025-11-28	下载	Miniature imaging systems are essential for space-constrained applications but are limited by memory and power constraints. While machine learning can reduce data size by extracting key features, its ...
GAVINA: flexible aggressive undervolting for bit-serial mixed-precision DNN acceleration	Jordi Fornt, Pau Fontova-Musté, Adrian Gras, Omar Lahyani, Martí Caro, Jaume Abella, Francesc Moll, Josep Altet	2025-11-28	下载	Voltage overscaling, or undervolting, is an enticing approximate technique in the context of energy-efficient Deep Neural Network (DNN) acceleration, given the quadratic relationship between power and...
Cohet: A CXL-Driven Coherent Heterogeneous Computing Framework with Hardware-Calibrated Full-System Simulation	Yanjing Wang, Lizhou Wu, Sunfeng Gao, Yibo Tang, Junhui Luo, Zicong Wang, Yang Ou, Dezun Dong, Nong Xiao, Mingche Lai	2025-11-28	下载	Conventional heterogeneous computing systems built on PCIe interconnects suffer from inefficient fine-grained host-device interactions and complex programming models.
The Immutable Tensor Architecture: A Pure Dataflow Approach for Secure, Energy-Efficient AI Inference	Fang Li	2025-11-28	下载	The deployment of Large Language Models (LLMs) on consumer edge devices is throttled by the "Memory Wall" -- the prohibitive bandwidth and energy cost of fetching gigabytes of model weights from DRAM ...
STELLAR: Structure-guided LLM Assertion Retrieval and Generation for Formal Verification	Saeid Rajabi, Chengmo Yang, Satwik Patnaik	2025-11-28	下载	Formal Verification (FV) relies on high-quality SystemVerilog Assertions (SVAs), but the manual writing process is slow and error-prone. Existing LLM-based approaches either generate assertions from s...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
A Parallel and Distributed Rust Library for Core Decomposition on Large Graphs	Davide Rucci, Sebastian Parfeniuc, Matteo Mordacchini, Emanuele Carlini, Alfredo Cuzzocrea, Patrizio Dazzi	2025-11-28	下载	In this paper, we investigate the parallelization of $k$ -core decomposition, a method used in graph analysis to identify cohesive substructures and assess node centrality.
Accelerated Execution of Bayesian Neural Networks using a Single Probabilistic Forward Pass and Code Generation	Bernhard Klein, Falk Selker, Hendrik Borras, Sophie Steger, Franz Pernkopf, Holger Fröning	2025-11-28	下载	Machine learning models perform well across domains such as diagnostics, weather forecasting, NLP, and autonomous driving, but their limited uncertainty handling restricts use in safety-critical setti...
Beyond 2-Edge-Connectivity: Algorithms and Impossibility for Content-Oblivious Leader Election	Yi-Jun Chang, Lyuting Chen, Haoran Zhou	2025-11-28	下载	The content-oblivious model, introduced by Censor-Hillel, Cohen, Gelles, and Sel (PODC 2022; Distributed Computing 2023), captures an extremely weak form of communication where nodes can only send asy...
Closing the Generalization Gap in Parameter-efficient Federated Edge Learning	Xinnong Du, Zhonghao Lyu, Xiaowen Cao, Chunyang Wen, Shuguang Cui, Jie Xu	2025-11-28	下载	Federated edge learning (FEEL) provides a promising foundation for edge artificial intelligence (AI) by enabling collaborative model training while preserving data privacy.
RetryGuard: Preventing Self-Inflicted Retry Storms in Cloud Microservices Applications	Jhonatan Tavori, Anat Bremler-Barr, Hanoch Levy, Ofek Lavi	2025-11-28	下载	Modern cloud applications are built on independent, diverse microservices, offering scalability, flexibility, and usage-based billing. However, the structural design of these varied services, along wi...
Fixed-Priority and EDF Schedules for ROS2 Graphs on Uniprocessor	Oren Bell, Harun Teper, Mario Günzel, Chris Gill, Jian-Jia Chen	2025-11-28	下载	This paper addresses limitations of current scheduling methods in the Robot Operating System (ROS)2, focusing on scheduling tasks beyond simple chains and analyzing arbitrary Directed Acyclic Graphs (...
Communication-Computation Pipeline Parallel Split Learning over Wireless Edge Networks	Chenyu Liu, Zhaoyang Zhang, Zirui Chen, Zhaohui Yang	2025-11-28	下载	Split learning (SL) offloads main computing tasks from multiple resource-constrained user equippments (UEs) to the base station (BS), while preserving local data privacy.
Areon: Latency-Friendly and Resilient Multi-Proposer Consensus	Álvaro Castro-Castilla, Marcin Pawlowski, Hong-Sheng Zhou	2025-11-28	下载	We present Areon, a family of latency-friendly, stake-weighted, multi-proposer proof-of-stake consensus protocols. By allowing multiple proposers per slot and organizing blocks into a directed acyclic...
Serving Heterogeneous LoRA Adapters in Distributed LLM Inference Systems	Shashwat Jaiswal, Shrikara Arun, Anjaly Parayil, Ankur Mallick, Spyros Mastorakis, Alind Khare, Chloi Alverti, Renee St Amant, Chetan Bansal, Victor Rühle, Josep Torrellas	2025-11-28	下载	Low-Rank Adaptation (LoRA) has become the de facto method for parameter-efficient fine-tuning of large language models (LLMs), enabling rapid adaptation to diverse domains.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Analysis of the operation of a TSN switch and other devices using executable QR codes	Stefano Scanzio, Pietro Chiavassa, Gianluca Cena	2025-11-28	下载	Executable QR codes, also known as sQRy, are a technology aimed at inserting executable programs in a QR code. Through a concrete example, in this paper, we demonstrate their usage in the context of i...
On the Prediction of Wi-Fi Performance through Deep Learning	Gabriele Formis, Amanda Ericson, Stefan Forsstrom, Kyi Thar, Gianluca Cena, Stefano Scanzio	2025-11-28	下载	Ensuring reliable and predictable communications is one of the main goals in modern industrial systems that rely on Wi-Fi networks, especially in scenarios where continuity of operation and low latenc...
Mesh Augmentation of LoRaWAN-based IoT Networks	Ram Ramanathan, Dmitrii Dugaev, Liang Tan, Warren Ramanathan	2025-11-28	下载	LoRaWAN is a leading standard and technology for low-power, long-range Internet-of-Things (IoT) communications. However, its single-hop architecture results in limited effective range and excessive po...
Quantum Private Distributed Matrix Multiplication With Degree Tables	Mohamed Nomeir, Alptug Aytekin, Lei Hu, Sennur Ulukus	2025-11-28	下载	In this paper, we explore how quantum resources can be used to increase the rate of private distributed matrix multiplication (PDMM). In PDMM, a user who has two high-dimensional matrices, $A$ and $B$ ...
Joint Resource Allocation to Transparently Integrate 5G TDD Uplink with Time-Aware TSN	Laura Becker, Yash Deshpande, Wolfgang Kellerer	2025-11-28	下载	To enable mobility in industrial communication systems, the seamless integration of 5G with Time-Sensitive Networking (TSN) is a promising approach.
Performance Evaluation of Multi-Armed Bandit Algorithms for Wi-Fi Channel Access	Miguel Casasnovas, Francesc Wilhelmi, Richard Combes, Maksymilian Wojnar, Katarzyna Kosek-Szott, Szymon Szott, Anders Jonsson, Luis Esteve, Boris Bellalta	2025-11-28	下载	The adoption of dynamic, self-learning solutions for real-time wireless network optimization has recently gained significant attention due to the limited adaptability of existing protocols.
RetryGuard: Preventing Self-Inflicted Retry Storms in Cloud Microservices Applications	Jhonatan Tavori, Anat Bremler-Barr, Hanoch Levy, Ofek Lavi	2025-11-28	下载	Modern cloud applications are built on independent, diverse microservices, offering scalability, flexibility, and usage-based billing. However, the structural design of these varied services, along wi...
IoTEdu: Access Control, Detection, and Automatic Incident Response in Academic IoT Networks	Joner Assolin, Diego Kreutz, Leandro Bertholdo	2025-11-28	下载	The growing presence of IoT devices in academic environments has increased operational complexity and exposed security weaknesses, especially in academic institutions without unified policies for regi...
Efficient Asynchronous Federated Evaluation with Strategy Similarity Awareness for Intent-Based Networking in Industrial Internet of Things	Shaowen Qin, Jianfeng Zeng, Haodong Guo, Xiaohuan Li, Jiawen Kang, Qian Chen, Dusit Niyato	2025-11-28	下载	Intent-Based Networking (IBN) offers a promising paradigm for intelligent and automated network control in Industrial Internet of Things (IIoT) environments by translating high-level user intents into...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Fixed-Priority and EDF Schedules for ROS2 Graphs on Uniprocessor	Oren Bell, Harun Teper, Mario Günzel, Chris Gill, Jian-Jia Chen	2025-11-28	下载	This paper addresses limitations of current scheduling methods in the Robot Operating System (ROS)2, focusing on scheduling tasks beyond simple chains and analyzing arbitrary Directed Acyclic Graphs (...