Appearance
2025-03-21
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Improving Quantization with Post-Training Model Expansion | Giuseppe Franco, Pablo Monteagudo-Lago, Ian Colbert, Nicholas Fraser, Michaela Blott | 2025-03-21 | 下载 | The size of a model has been a strong predictor of its quality, as well as its cost. As such, the trade-off between model cost and quality has been well-studied. |
| Register Dispersion: Reducing the Footprint of the Vector Register File in Vector Engines of Low-Cost RISC-V CPUs | Vasileios Titopoulos, George Alexakis, Kosmas Alexandridis, Chrysostomos Nicopoulos, Giorgos Dimitrakopoulos | 2025-03-21 | 下载 | The deployment of Machine Learning (ML) applications at the edge on resource-constrained devices has accentuated the need for efficient ML processing on low-cost processors. |
| Achieving Dependability of AI Execution with Radiation Hardened Processors | Carlos Rafael Tordoya Taquichiri, Hans Dermot Doran, Pablo Ghiglino, Mandar Harshe | 2025-03-21 | 下载 | The reliance on radiation-hardened hardware, essential for domains requiring high-dependability such as space, nuclear energy and medical applications, severely restricts the choice of components avai... |
| Arm DynamIQ Shared Unit and Real-Time: An Empirical Evaluation | Ashutosh Pradhan, Daniele Ottaviano, Yi Jiang, Haozheng Huang, Alexander Zuepke, Andrea Bastoni, Marco Caccamo | 2025-03-21 | 下载 | The increasing complexity of embedded hardware platforms poses significant challenges for real-time workloads. Architectural features such as Intel RDT, Arm QoS, and Arm MPAM are either unavailable on... |
| Work-In-Progress: Accelerating Numpy With OpenBLAS For Open-Source RISC-V Chips | Cyril Koenig, Enrico Zelioli, Frank K. Gürkaynak, Luca Benini | 2025-03-21 | 下载 | RISC-V allows for building general-purpose computing platforms with programmable accelerators around a single open-source ISA. However, leveraging heterogeneous SoCs within high-level applications is ... |
| Fused-Tiled Layers: Minimizing Data Movement on RISC-V SoCs with Software-Managed Caches | Victor J. B. Jung, Alessio Burrello, Francesco Conti, Luca Benini | 2025-03-21 | 下载 | The success of DNNs and their high computational requirements pushed for large codesign efforts aiming at DNN acceleration. Since DNNs can be represented as static computational graphs, static memory ... |
| MemPool Flavors: Between Versatility and Specialization in a RISC-V Manycore Cluster | Sergio Mazzola, Yichao Zhang, Marco Bertuletti, Diyou Shen, Luca Benini | 2025-03-21 | 下载 | As computational paradigms evolve, applications such as attention-based models, wireless telecommunications, and computer vision impose increasingly challenging requirements on computer architectures:... |
| On-Sensor Convolutional Neural Networks with Early-Exits | Hazem Hesham Yousef Shalby, Arianna De Vecchi, Alice Scandelli, Pietro Bartoli, Diana Trojaniello, Manuel Roveri, Federica Villa | 2025-03-21 | 下载 | Tiny Machine Learning (TinyML) is a novel research field aiming at integrating Machine Learning (ML) within embedded devices with limited memory, computation, and energy. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Intelligent Resource Allocation Optimization for Cloud Computing via Machine Learning | Yuqing Wang, Xiao Yang | 2025-03-21 | 下载 | With the rapid expansion of cloud computing applications, optimizing resource allocation has become crucial for improving system performance and cost efficiency. |
| Serinv: A Scalable Library for the Selected Inversion of Block-Tridiagonal with Arrowhead Matrices | Vincent Maillou, Lisa Gaedke-Merzhaeuser, Alexandros Nikolaos Ziogas, Olaf Schenk, Mathieu Luisier | 2025-03-21 | 下载 | The inversion of structured sparse matrices is a key but computationally and memory-intensive operation in many scientific applications. There are cases, however, where only particular entries of the ... |
| Energy Efficiency trends in HPC: what high-energy and astrophysicists need to know | Estela Suarez, Jorge Amaya, Martin Frank, Oliver Freyermuth, Maria Girone, Bartosz Kostrzewa, Susanne Pfalzner | 2025-03-21 | 下载 | The growing energy demands of HPC systems have made energy efficiency a critical concern for system developers and operators. However, HPC users are generally less aware of how these energy concerns i... |
| Achieving Dependability of AI Execution with Radiation Hardened Processors | Carlos Rafael Tordoya Taquichiri, Hans Dermot Doran, Pablo Ghiglino, Mandar Harshe | 2025-03-21 | 下载 | The reliance on radiation-hardened hardware, essential for domains requiring high-dependability such as space, nuclear energy and medical applications, severely restricts the choice of components avai... |
| Analyzing Performance Bottlenecks in Zero-Knowledge Proof Based Rollups on Ethereum | Md. Ahsan Habib | 2025-03-21 | 下载 | Blockchain technology is rapidly evolving, with scalability remaining one of its most significant challenges. While various solutions have been proposed and continue to be developed, it is essential t... |
| LoGoFair: Post-Processing for Local and Global Fairness in Federated Learning | Li Zhang, Chaochao Chen, Zhongxuan Han, Qiyong Zhong, Xiaolin Zheng | 2025-03-21 | 下载 | Federated learning (FL) has garnered considerable interest for its capability to learn from decentralized data sources. Given the increasing application of FL in decision-making scenarios, addressing ... |
| Robustness of deep learning classification to adversarial input on GPUs: asynchronous parallel accumulation is a source of vulnerability | Sanjif Shanmugavelu, Mathieu Taillefumier, Christopher Culver, Vijay Ganesh, Oscar Hernandez, Ada Sedova | 2025-03-21 | 下载 | The ability of machine learning (ML) classification models to resist small, targeted input perturbations -- known as adversarial attacks -- is a key measure of their safety and reliability. |
| MemPool Flavors: Between Versatility and Specialization in a RISC-V Manycore Cluster | Sergio Mazzola, Yichao Zhang, Marco Bertuletti, Diyou Shen, Luca Benini | 2025-03-21 | 下载 | As computational paradigms evolve, applications such as attention-based models, wireless telecommunications, and computer vision impose increasingly challenging requirements on computer architectures:... |
| Improving the End-to-End Efficiency of Offline Inference for Multi-LLM Applications Based on Sampling and Simulation | Jingzhi Fang, Yanyan Shen, Yue Wang, Lei Chen | 2025-03-21 | 下载 | As large language models (LLMs) have shown great success in many tasks, they are used in various applications. While a lot of works have focused on the efficiency of single-LLM application (e.g. |
| Federated Cross-Domain Click-Through Rate Prediction With Large Language Model Augmentation | Jiangcheng Qin, Xueyuan Zhang, Baisong Liu, Jiangbo Qian, Yangyang Wang | 2025-03-21 | 下载 | Accurately predicting click-through rates (CTR) under stringent privacy constraints poses profound challenges, particularly when user-item interactions are sparse and fragmented across domains. |
| DeFT: Mitigating Data Dependencies for Flexible Communication Scheduling in Distributed Training | Lin Meng, Yuzhong Sun | 2025-03-21 | 下载 | Communication scheduling aims to reduce communication bottlenecks in data parallel training (DP) by maximizing the overlap between computation and communication. |
| Local Ratio based Real-time Job Offloading and Resource Allocation in Mobile Edge Computing | Chuanchao Gao, Arvind Easwaran | 2025-03-21 | 下载 | Mobile Edge Computing (MEC) has emerged as a promising paradigm enabling vehicles to handle computation-intensive and time-sensitive applications for intelligent transportation. |
| CoBRA: A Universal Strategyproof Confirmation Protocol for Quorum-based Proof-of-Stake Blockchains | Zeta Avarikioti, Eleftherios Kokoris Kogias, Ray Neiheiser, Christos Stefo | 2025-03-21 | 下载 | The security of many Proof-of-Stake (PoS) payment systems relies on quorum-based State Machine Replication (SMR) protocols. While classical analyses assume purely Byzantine faults, real-world systems ... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| P4sim: Programming Protocol-independent Packet Processors in ns-3 | Mingyu Ma, Giang T. Nguyen | 2025-03-21 | 下载 | Programmable data planes enable users to design data plane algorithms for network devices, providing extensive flexibility for network customization. |
| Commercial Dishes Can Be My Ladder: Sustainable and Collaborative Data Offloading in LEO Satellite Networks | Yi Ching Chou, Long Chen, Hengzhi Wang, Feng Wang, Hao Fang, Haoyuan Zhao, Miao Zhang, Xiaoyi Fan, Jiangchuan Liu | 2025-03-21 | 下载 | Low Earth Orbit (LEO) satellite networks, characterized by their high data throughput and low latency, have gained significant interest from both industry and academia. |
| Transfer Learning for EDFA Gain Modeling: A Semi-Supervised Approach Using Internal Amplifier Features | Agastya Raj, Dan Kilper, Marco Ruffini | 2025-03-21 | 下载 | The gain spectrum of an Erbium-Doped Fiber Amplifier (EDFA) has a complex dependence on channel loading, pump power, and operating mode, making accurate modeling difficult to achieve. |
| Interference Identification in Multi-User Optical Spectrum as a Service using Convolutional Neural Networks | Agastya Raj, Zehao Wang, Frank Slyne, Tingjun Chen, Dan Kilper, Marco Ruffini | 2025-03-21 | 下载 | We introduce a ML-based architecture for network operators to detect impairments from specific OSaaS users while blind to the users' internal spectrum details. |
| Multi-Span Optical Power Spectrum Evolution Modeling using ML-based Multi-Decoder Attention Framework | Agastya Raj, Zehao Wang, Frank Slyne, Tingjun Chen, Dan Kilper, Marco Ruffini | 2025-03-21 | 下载 | We implement a ML-based attention framework with component-specific decoders, improving optical power spectrum prediction in multi-span networks. |
| Demonstration of Cooperative Transport Interface over Open Source 7.2 split RAN and Virtualised Open PON Network | Merim Dzaferagic, Kevin O'Sullivan, Bruce Richardson, Brendan Ryan, Niall Power, Robin Giller, Marco Ruffini | 2025-03-21 | 下载 | We demonstrate end-to-end 5G Open RAN over PON using off-the-shelf open networking hardware and open source RAN software. The implementation of the Cooperative Transport Interface provides timely sync... |
| Governance of Ledger-Anchored Decentralized Identifiers | Sandro Rodriguez Garzon, Carlo Segat, Axel Küpper | 2025-03-21 | 下载 | A Decentralized Identifier (DID) empowers an entity to prove control over a unique and self-issued identifier without relying on any identity provider. |
| Joint Beamforming and Trajectory Optimization for Multi-UAV-Assisted Integrated Sensing and Communication Systems | Yan Kyaw Tun, Nway Nway Ei, Sheikh Salman Hassan, Cedomir Stefanovic, Nguyen Van Huynh, Madyan Alsenwi, Choong Seon Hong | 2025-03-21 | 下载 | In this paper, we investigate beamforming design and trajectory optimization for a multi-unmanned aerial vehicle (UAV)-assisted integrated sensing and communication (ISAC) system. |
| Indoor Localization Based on MSC Map | Łukasz Kułacz, Adrian Kliks, Julius Ruseckas, Gediminas Molis | 2025-03-21 | 下载 | In this short paper, we propose a technique for AI-based identification of modulation and coding schemes (MCS) in surrounding cellular signals. |
| Rotatable RIS-Assisted Edge Computing: Orientation, Task Offloading, and Resource Optimization | Bin Li, Dongdong Yang, Lei Liu | 2025-03-21 | 下载 | The rotatable reconfigurable intelligent surface (RIS) can enhance mobile edge computing (MEC) performance by optimizing its orientation to improve the gain of received and transmitted signals. |
| Betweenness Centrality Based Dynamic Source Routing for Flying Ad Hoc Networks in Marching Formation | Shaoshi Yang, Wei Zhao, Chu-Meng Wang, Wen-Yu Dong, Xiaojie Ju | 2025-03-21 | 下载 | Designing high-performance routing protocols for flying ad hoc networks (FANETs) is challenging due to the diversity of applications and the dynamics of network topology. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Serinv: A Scalable Library for the Selected Inversion of Block-Tridiagonal with Arrowhead Matrices | Vincent Maillou, Lisa Gaedke-Merzhaeuser, Alexandros Nikolaos Ziogas, Olaf Schenk, Mathieu Luisier | 2025-03-21 | 下载 | The inversion of structured sparse matrices is a key but computationally and memory-intensive operation in many scientific applications. There are cases, however, where only particular entries of the ... |
| Arm DynamIQ Shared Unit and Real-Time: An Empirical Evaluation | Ashutosh Pradhan, Daniele Ottaviano, Yi Jiang, Haozheng Huang, Alexander Zuepke, Andrea Bastoni, Marco Caccamo | 2025-03-21 | 下载 | The increasing complexity of embedded hardware platforms poses significant challenges for real-time workloads. Architectural features such as Intel RDT, Arm QoS, and Arm MPAM are either unavailable on... |
| V-Seek: Accelerating LLM Reasoning on Open-hardware Server-class RISC-V Platforms | Javier J. Poveda Rodrigo, Mohamed Amine Ahmdi, Alessio Burrello, Daniele Jahier Pagliari, Luca Benini | 2025-03-21 | 下载 | The recent exponential growth of Large Language Models (LLMs) has relied on GPU-based systems. However, CPUs are emerging as a flexible and lower-cost alternative, especially when targeting inference ... |