2025-03-21

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Improving Quantization with Post-Training Model Expansion	Giuseppe Franco, Pablo Monteagudo-Lago, Ian Colbert, Nicholas Fraser, Michaela Blott	2025-03-21	下载	The size of a model has been a strong predictor of its quality, as well as its cost. As such, the trade-off between model cost and quality has been well-studied.
Register Dispersion: Reducing the Footprint of the Vector Register File in Vector Engines of Low-Cost RISC-V CPUs	Vasileios Titopoulos, George Alexakis, Kosmas Alexandridis, Chrysostomos Nicopoulos, Giorgos Dimitrakopoulos	2025-03-21	下载	The deployment of Machine Learning (ML) applications at the edge on resource-constrained devices has accentuated the need for efficient ML processing on low-cost processors.
Achieving Dependability of AI Execution with Radiation Hardened Processors	Carlos Rafael Tordoya Taquichiri, Hans Dermot Doran, Pablo Ghiglino, Mandar Harshe	2025-03-21	下载	The reliance on radiation-hardened hardware, essential for domains requiring high-dependability such as space, nuclear energy and medical applications, severely restricts the choice of components avai...
Arm DynamIQ Shared Unit and Real-Time: An Empirical Evaluation	Ashutosh Pradhan, Daniele Ottaviano, Yi Jiang, Haozheng Huang, Alexander Zuepke, Andrea Bastoni, Marco Caccamo	2025-03-21	下载	The increasing complexity of embedded hardware platforms poses significant challenges for real-time workloads. Architectural features such as Intel RDT, Arm QoS, and Arm MPAM are either unavailable on...
Work-In-Progress: Accelerating Numpy With OpenBLAS For Open-Source RISC-V Chips	Cyril Koenig, Enrico Zelioli, Frank K. Gürkaynak, Luca Benini	2025-03-21	下载	RISC-V allows for building general-purpose computing platforms with programmable accelerators around a single open-source ISA. However, leveraging heterogeneous SoCs within high-level applications is ...
Fused-Tiled Layers: Minimizing Data Movement on RISC-V SoCs with Software-Managed Caches	Victor J. B. Jung, Alessio Burrello, Francesco Conti, Luca Benini	2025-03-21	下载	The success of DNNs and their high computational requirements pushed for large codesign efforts aiming at DNN acceleration. Since DNNs can be represented as static computational graphs, static memory ...
MemPool Flavors: Between Versatility and Specialization in a RISC-V Manycore Cluster	Sergio Mazzola, Yichao Zhang, Marco Bertuletti, Diyou Shen, Luca Benini	2025-03-21	下载	As computational paradigms evolve, applications such as attention-based models, wireless telecommunications, and computer vision impose increasingly challenging requirements on computer architectures:...
On-Sensor Convolutional Neural Networks with Early-Exits	Hazem Hesham Yousef Shalby, Arianna De Vecchi, Alice Scandelli, Pietro Bartoli, Diana Trojaniello, Manuel Roveri, Federica Villa	2025-03-21	下载	Tiny Machine Learning (TinyML) is a novel research field aiming at integrating Machine Learning (ML) within embedded devices with limited memory, computation, and energy.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Intelligent Resource Allocation Optimization for Cloud Computing via Machine Learning	Yuqing Wang, Xiao Yang	2025-03-21	下载	With the rapid expansion of cloud computing applications, optimizing resource allocation has become crucial for improving system performance and cost efficiency.
Serinv: A Scalable Library for the Selected Inversion of Block-Tridiagonal with Arrowhead Matrices	Vincent Maillou, Lisa Gaedke-Merzhaeuser, Alexandros Nikolaos Ziogas, Olaf Schenk, Mathieu Luisier	2025-03-21	下载	The inversion of structured sparse matrices is a key but computationally and memory-intensive operation in many scientific applications. There are cases, however, where only particular entries of the ...
Energy Efficiency trends in HPC: what high-energy and astrophysicists need to know	Estela Suarez, Jorge Amaya, Martin Frank, Oliver Freyermuth, Maria Girone, Bartosz Kostrzewa, Susanne Pfalzner	2025-03-21	下载	The growing energy demands of HPC systems have made energy efficiency a critical concern for system developers and operators. However, HPC users are generally less aware of how these energy concerns i...
Achieving Dependability of AI Execution with Radiation Hardened Processors	Carlos Rafael Tordoya Taquichiri, Hans Dermot Doran, Pablo Ghiglino, Mandar Harshe	2025-03-21	下载	The reliance on radiation-hardened hardware, essential for domains requiring high-dependability such as space, nuclear energy and medical applications, severely restricts the choice of components avai...
Analyzing Performance Bottlenecks in Zero-Knowledge Proof Based Rollups on Ethereum	Md. Ahsan Habib	2025-03-21	下载	Blockchain technology is rapidly evolving, with scalability remaining one of its most significant challenges. While various solutions have been proposed and continue to be developed, it is essential t...
LoGoFair: Post-Processing for Local and Global Fairness in Federated Learning	Li Zhang, Chaochao Chen, Zhongxuan Han, Qiyong Zhong, Xiaolin Zheng	2025-03-21	下载	Federated learning (FL) has garnered considerable interest for its capability to learn from decentralized data sources. Given the increasing application of FL in decision-making scenarios, addressing ...
Robustness of deep learning classification to adversarial input on GPUs: asynchronous parallel accumulation is a source of vulnerability	Sanjif Shanmugavelu, Mathieu Taillefumier, Christopher Culver, Vijay Ganesh, Oscar Hernandez, Ada Sedova	2025-03-21	下载	The ability of machine learning (ML) classification models to resist small, targeted input perturbations -- known as adversarial attacks -- is a key measure of their safety and reliability.
MemPool Flavors: Between Versatility and Specialization in a RISC-V Manycore Cluster	Sergio Mazzola, Yichao Zhang, Marco Bertuletti, Diyou Shen, Luca Benini	2025-03-21	下载	As computational paradigms evolve, applications such as attention-based models, wireless telecommunications, and computer vision impose increasingly challenging requirements on computer architectures:...
Improving the End-to-End Efficiency of Offline Inference for Multi-LLM Applications Based on Sampling and Simulation	Jingzhi Fang, Yanyan Shen, Yue Wang, Lei Chen	2025-03-21	下载	As large language models (LLMs) have shown great success in many tasks, they are used in various applications. While a lot of works have focused on the efficiency of single-LLM application (e.g.
Federated Cross-Domain Click-Through Rate Prediction With Large Language Model Augmentation	Jiangcheng Qin, Xueyuan Zhang, Baisong Liu, Jiangbo Qian, Yangyang Wang	2025-03-21	下载	Accurately predicting click-through rates (CTR) under stringent privacy constraints poses profound challenges, particularly when user-item interactions are sparse and fragmented across domains.
DeFT: Mitigating Data Dependencies for Flexible Communication Scheduling in Distributed Training	Lin Meng, Yuzhong Sun	2025-03-21	下载	Communication scheduling aims to reduce communication bottlenecks in data parallel training (DP) by maximizing the overlap between computation and communication.
Local Ratio based Real-time Job Offloading and Resource Allocation in Mobile Edge Computing	Chuanchao Gao, Arvind Easwaran	2025-03-21	下载	Mobile Edge Computing (MEC) has emerged as a promising paradigm enabling vehicles to handle computation-intensive and time-sensitive applications for intelligent transportation.
CoBRA: A Universal Strategyproof Confirmation Protocol for Quorum-based Proof-of-Stake Blockchains	Zeta Avarikioti, Eleftherios Kokoris Kogias, Ray Neiheiser, Christos Stefo	2025-03-21	下载	The security of many Proof-of-Stake (PoS) payment systems relies on quorum-based State Machine Replication (SMR) protocols. While classical analyses assume purely Byzantine faults, real-world systems ...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
P4sim: Programming Protocol-independent Packet Processors in ns-3	Mingyu Ma, Giang T. Nguyen	2025-03-21	下载	Programmable data planes enable users to design data plane algorithms for network devices, providing extensive flexibility for network customization.
Commercial Dishes Can Be My Ladder: Sustainable and Collaborative Data Offloading in LEO Satellite Networks	Yi Ching Chou, Long Chen, Hengzhi Wang, Feng Wang, Hao Fang, Haoyuan Zhao, Miao Zhang, Xiaoyi Fan, Jiangchuan Liu	2025-03-21	下载	Low Earth Orbit (LEO) satellite networks, characterized by their high data throughput and low latency, have gained significant interest from both industry and academia.
Transfer Learning for EDFA Gain Modeling: A Semi-Supervised Approach Using Internal Amplifier Features	Agastya Raj, Dan Kilper, Marco Ruffini	2025-03-21	下载	The gain spectrum of an Erbium-Doped Fiber Amplifier (EDFA) has a complex dependence on channel loading, pump power, and operating mode, making accurate modeling difficult to achieve.
Interference Identification in Multi-User Optical Spectrum as a Service using Convolutional Neural Networks	Agastya Raj, Zehao Wang, Frank Slyne, Tingjun Chen, Dan Kilper, Marco Ruffini	2025-03-21	下载	We introduce a ML-based architecture for network operators to detect impairments from specific OSaaS users while blind to the users' internal spectrum details.
Multi-Span Optical Power Spectrum Evolution Modeling using ML-based Multi-Decoder Attention Framework	Agastya Raj, Zehao Wang, Frank Slyne, Tingjun Chen, Dan Kilper, Marco Ruffini	2025-03-21	下载	We implement a ML-based attention framework with component-specific decoders, improving optical power spectrum prediction in multi-span networks.
Demonstration of Cooperative Transport Interface over Open Source 7.2 split RAN and Virtualised Open PON Network	Merim Dzaferagic, Kevin O'Sullivan, Bruce Richardson, Brendan Ryan, Niall Power, Robin Giller, Marco Ruffini	2025-03-21	下载	We demonstrate end-to-end 5G Open RAN over PON using off-the-shelf open networking hardware and open source RAN software. The implementation of the Cooperative Transport Interface provides timely sync...
Governance of Ledger-Anchored Decentralized Identifiers	Sandro Rodriguez Garzon, Carlo Segat, Axel Küpper	2025-03-21	下载	A Decentralized Identifier (DID) empowers an entity to prove control over a unique and self-issued identifier without relying on any identity provider.
Joint Beamforming and Trajectory Optimization for Multi-UAV-Assisted Integrated Sensing and Communication Systems	Yan Kyaw Tun, Nway Nway Ei, Sheikh Salman Hassan, Cedomir Stefanovic, Nguyen Van Huynh, Madyan Alsenwi, Choong Seon Hong	2025-03-21	下载	In this paper, we investigate beamforming design and trajectory optimization for a multi-unmanned aerial vehicle (UAV)-assisted integrated sensing and communication (ISAC) system.
Indoor Localization Based on MSC Map	Łukasz Kułacz, Adrian Kliks, Julius Ruseckas, Gediminas Molis	2025-03-21	下载	In this short paper, we propose a technique for AI-based identification of modulation and coding schemes (MCS) in surrounding cellular signals.
Rotatable RIS-Assisted Edge Computing: Orientation, Task Offloading, and Resource Optimization	Bin Li, Dongdong Yang, Lei Liu	2025-03-21	下载	The rotatable reconfigurable intelligent surface (RIS) can enhance mobile edge computing (MEC) performance by optimizing its orientation to improve the gain of received and transmitted signals.
Betweenness Centrality Based Dynamic Source Routing for Flying Ad Hoc Networks in Marching Formation	Shaoshi Yang, Wei Zhao, Chu-Meng Wang, Wen-Yu Dong, Xiaojie Ju	2025-03-21	下载	Designing high-performance routing protocols for flying ad hoc networks (FANETs) is challenging due to the diversity of applications and the dynamics of network topology.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Serinv: A Scalable Library for the Selected Inversion of Block-Tridiagonal with Arrowhead Matrices	Vincent Maillou, Lisa Gaedke-Merzhaeuser, Alexandros Nikolaos Ziogas, Olaf Schenk, Mathieu Luisier	2025-03-21	下载	The inversion of structured sparse matrices is a key but computationally and memory-intensive operation in many scientific applications. There are cases, however, where only particular entries of the ...
Arm DynamIQ Shared Unit and Real-Time: An Empirical Evaluation	Ashutosh Pradhan, Daniele Ottaviano, Yi Jiang, Haozheng Huang, Alexander Zuepke, Andrea Bastoni, Marco Caccamo	2025-03-21	下载	The increasing complexity of embedded hardware platforms poses significant challenges for real-time workloads. Architectural features such as Intel RDT, Arm QoS, and Arm MPAM are either unavailable on...
V-Seek: Accelerating LLM Reasoning on Open-hardware Server-class RISC-V Platforms	Javier J. Poveda Rodrigo, Mohamed Amine Ahmdi, Alessio Burrello, Daniele Jahier Pagliari, Luca Benini	2025-03-21	下载	The recent exponential growth of Large Language Models (LLMs) has relied on GPU-based systems. However, CPUs are emerging as a flexible and lower-cost alternative, especially when targeting inference ...