Skip to content

2025-11-28

cs.AR - Architecture

标题作者发布日期PDF摘要
Variable Point: A Number Format for Area- and Energy-Efficient Multiplication of High-Dynamic-Range NumbersSeyed Hadi Mirfarshbafan, Nicolas Filliol, Oscar Castañeda, Christoph Studer2025-11-28下载Fixed-point number representation is commonly employed in digital VLSI designs that have stringent hardware efficiency constraints. However, fixed-point numbers cover a relatively small dynamic range ...
Accelerated Execution of Bayesian Neural Networks using a Single Probabilistic Forward Pass and Code GenerationBernhard Klein, Falk Selker, Hendrik Borras, Sophie Steger, Franz Pernkopf, Holger Fröning2025-11-28下载Machine learning models perform well across domains such as diagnostics, weather forecasting, NLP, and autonomous driving, but their limited uncertainty handling restricts use in safety-critical setti...
Ternary-Input Binary-Weight CNN Accelerator Design for Miniature Object Classification System with Query-Driven Spatial DVSYuyang Li, Swasthik Muloor, Jack Laudati, Nickolas Dematteis, Yidam Park, Hana Kim, Nathan Chang, Inhee Lee2025-11-28下载Miniature imaging systems are essential for space-constrained applications but are limited by memory and power constraints. While machine learning can reduce data size by extracting key features, its ...
GAVINA: flexible aggressive undervolting for bit-serial mixed-precision DNN accelerationJordi Fornt, Pau Fontova-Musté, Adrian Gras, Omar Lahyani, Martí Caro, Jaume Abella, Francesc Moll, Josep Altet2025-11-28下载Voltage overscaling, or undervolting, is an enticing approximate technique in the context of energy-efficient Deep Neural Network (DNN) acceleration, given the quadratic relationship between power and...
Cohet: A CXL-Driven Coherent Heterogeneous Computing Framework with Hardware-Calibrated Full-System SimulationYanjing Wang, Lizhou Wu, Sunfeng Gao, Yibo Tang, Junhui Luo, Zicong Wang, Yang Ou, Dezun Dong, Nong Xiao, Mingche Lai2025-11-28下载Conventional heterogeneous computing systems built on PCIe interconnects suffer from inefficient fine-grained host-device interactions and complex programming models.
The Immutable Tensor Architecture: A Pure Dataflow Approach for Secure, Energy-Efficient AI InferenceFang Li2025-11-28下载The deployment of Large Language Models (LLMs) on consumer edge devices is throttled by the "Memory Wall" -- the prohibitive bandwidth and energy cost of fetching gigabytes of model weights from DRAM ...
STELLAR: Structure-guided LLM Assertion Retrieval and Generation for Formal VerificationSaeid Rajabi, Chengmo Yang, Satwik Patnaik2025-11-28下载Formal Verification (FV) relies on high-quality SystemVerilog Assertions (SVAs), but the manual writing process is slow and error-prone. Existing LLM-based approaches either generate assertions from s...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
A Parallel and Distributed Rust Library for Core Decomposition on Large GraphsDavide Rucci, Sebastian Parfeniuc, Matteo Mordacchini, Emanuele Carlini, Alfredo Cuzzocrea, Patrizio Dazzi2025-11-28下载In this paper, we investigate the parallelization of kk-core decomposition, a method used in graph analysis to identify cohesive substructures and assess node centrality.
Accelerated Execution of Bayesian Neural Networks using a Single Probabilistic Forward Pass and Code GenerationBernhard Klein, Falk Selker, Hendrik Borras, Sophie Steger, Franz Pernkopf, Holger Fröning2025-11-28下载Machine learning models perform well across domains such as diagnostics, weather forecasting, NLP, and autonomous driving, but their limited uncertainty handling restricts use in safety-critical setti...
Beyond 2-Edge-Connectivity: Algorithms and Impossibility for Content-Oblivious Leader ElectionYi-Jun Chang, Lyuting Chen, Haoran Zhou2025-11-28下载The content-oblivious model, introduced by Censor-Hillel, Cohen, Gelles, and Sel (PODC 2022; Distributed Computing 2023), captures an extremely weak form of communication where nodes can only send asy...
Closing the Generalization Gap in Parameter-efficient Federated Edge LearningXinnong Du, Zhonghao Lyu, Xiaowen Cao, Chunyang Wen, Shuguang Cui, Jie Xu2025-11-28下载Federated edge learning (FEEL) provides a promising foundation for edge artificial intelligence (AI) by enabling collaborative model training while preserving data privacy.
RetryGuard: Preventing Self-Inflicted Retry Storms in Cloud Microservices ApplicationsJhonatan Tavori, Anat Bremler-Barr, Hanoch Levy, Ofek Lavi2025-11-28下载Modern cloud applications are built on independent, diverse microservices, offering scalability, flexibility, and usage-based billing. However, the structural design of these varied services, along wi...
Fixed-Priority and EDF Schedules for ROS2 Graphs on UniprocessorOren Bell, Harun Teper, Mario Günzel, Chris Gill, Jian-Jia Chen2025-11-28下载This paper addresses limitations of current scheduling methods in the Robot Operating System (ROS)2, focusing on scheduling tasks beyond simple chains and analyzing arbitrary Directed Acyclic Graphs (...
Communication-Computation Pipeline Parallel Split Learning over Wireless Edge NetworksChenyu Liu, Zhaoyang Zhang, Zirui Chen, Zhaohui Yang2025-11-28下载Split learning (SL) offloads main computing tasks from multiple resource-constrained user equippments (UEs) to the base station (BS), while preserving local data privacy.
Areon: Latency-Friendly and Resilient Multi-Proposer ConsensusÁlvaro Castro-Castilla, Marcin Pawlowski, Hong-Sheng Zhou2025-11-28下载We present Areon, a family of latency-friendly, stake-weighted, multi-proposer proof-of-stake consensus protocols. By allowing multiple proposers per slot and organizing blocks into a directed acyclic...
Serving Heterogeneous LoRA Adapters in Distributed LLM Inference SystemsShashwat Jaiswal, Shrikara Arun, Anjaly Parayil, Ankur Mallick, Spyros Mastorakis, Alind Khare, Chloi Alverti, Renee St Amant, Chetan Bansal, Victor Rühle, Josep Torrellas2025-11-28下载Low-Rank Adaptation (LoRA) has become the de facto method for parameter-efficient fine-tuning of large language models (LLMs), enabling rapid adaptation to diverse domains.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Analysis of the operation of a TSN switch and other devices using executable QR codesStefano Scanzio, Pietro Chiavassa, Gianluca Cena2025-11-28下载Executable QR codes, also known as sQRy, are a technology aimed at inserting executable programs in a QR code. Through a concrete example, in this paper, we demonstrate their usage in the context of i...
On the Prediction of Wi-Fi Performance through Deep LearningGabriele Formis, Amanda Ericson, Stefan Forsstrom, Kyi Thar, Gianluca Cena, Stefano Scanzio2025-11-28下载Ensuring reliable and predictable communications is one of the main goals in modern industrial systems that rely on Wi-Fi networks, especially in scenarios where continuity of operation and low latenc...
Mesh Augmentation of LoRaWAN-based IoT NetworksRam Ramanathan, Dmitrii Dugaev, Liang Tan, Warren Ramanathan2025-11-28下载LoRaWAN is a leading standard and technology for low-power, long-range Internet-of-Things (IoT) communications. However, its single-hop architecture results in limited effective range and excessive po...
Quantum Private Distributed Matrix Multiplication With Degree TablesMohamed Nomeir, Alptug Aytekin, Lei Hu, Sennur Ulukus2025-11-28下载In this paper, we explore how quantum resources can be used to increase the rate of private distributed matrix multiplication (PDMM). In PDMM, a user who has two high-dimensional matrices, AA and BB...
Joint Resource Allocation to Transparently Integrate 5G TDD Uplink with Time-Aware TSNLaura Becker, Yash Deshpande, Wolfgang Kellerer2025-11-28下载To enable mobility in industrial communication systems, the seamless integration of 5G with Time-Sensitive Networking (TSN) is a promising approach.
Performance Evaluation of Multi-Armed Bandit Algorithms for Wi-Fi Channel AccessMiguel Casasnovas, Francesc Wilhelmi, Richard Combes, Maksymilian Wojnar, Katarzyna Kosek-Szott, Szymon Szott, Anders Jonsson, Luis Esteve, Boris Bellalta2025-11-28下载The adoption of dynamic, self-learning solutions for real-time wireless network optimization has recently gained significant attention due to the limited adaptability of existing protocols.
RetryGuard: Preventing Self-Inflicted Retry Storms in Cloud Microservices ApplicationsJhonatan Tavori, Anat Bremler-Barr, Hanoch Levy, Ofek Lavi2025-11-28下载Modern cloud applications are built on independent, diverse microservices, offering scalability, flexibility, and usage-based billing. However, the structural design of these varied services, along wi...
IoTEdu: Access Control, Detection, and Automatic Incident Response in Academic IoT NetworksJoner Assolin, Diego Kreutz, Leandro Bertholdo2025-11-28下载The growing presence of IoT devices in academic environments has increased operational complexity and exposed security weaknesses, especially in academic institutions without unified policies for regi...
Efficient Asynchronous Federated Evaluation with Strategy Similarity Awareness for Intent-Based Networking in Industrial Internet of ThingsShaowen Qin, Jianfeng Zeng, Haodong Guo, Xiaohuan Li, Jiawen Kang, Qian Chen, Dusit Niyato2025-11-28下载Intent-Based Networking (IBN) offers a promising paradigm for intelligent and automated network control in Industrial Internet of Things (IIoT) environments by translating high-level user intents into...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Fixed-Priority and EDF Schedules for ROS2 Graphs on UniprocessorOren Bell, Harun Teper, Mario Günzel, Chris Gill, Jian-Jia Chen2025-11-28下载This paper addresses limitations of current scheduling methods in the Robot Operating System (ROS)2, focusing on scheduling tasks beyond simple chains and analyzing arbitrary Directed Acyclic Graphs (...

基于 VitePress 构建