Skip to content

2025-09-22

cs.AR - Architecture

标题作者发布日期PDF摘要
Chiplet-Based RISC-V SoC with Modular AI AccelerationSuhas Suresh Bharadwaj, Prerana Ramkumar2025-09-22下载Achieving high performance, energy efficiency, and cost-effectiveness while maintaining architectural flexibility is a critical challenge in the development and deployment of edge AI devices.
Lightweight Congruence Profiling for Early Design Exploration of Heterogeneous FPGAsAllen Boston, Biruk Seyoum, Luca Carloni, Pierre-Emmanuel Gaillardon2025-09-22下载Field-Programmable Gate Arrays (FPGAs) have evolved from uniform logic arrays into heterogeneous fabrics integrating digital signal processors (DSPs), memories, and specialized accelerators to support...
Single-Cell Universal Logic-in-Memory Using 2T-nC FeRAM: An Area and Energy-Efficient Approach for Bulk Bitwise ComputationRudra Biswas, Jiahui Duan, Shan Deng, Xuezhong Niu, Yixin Qin, Prapti Panigrahi, Varun Parekh, Rajiv Joshi, Kai Ni, Vijaykrishnan Narayanan2025-09-22下载This work presents a novel approach to configure 2T-nC ferroelectric RAM (FeRAM) for performing single cell logic-in-memory operations, highlighting its advantages in energy-efficient computation over...
Minimal Neuron Circuits: BurstersAmr Nabil, T. Nandha Kumar, Haider Abbas F. Almurib2025-09-22下载This work introduces a novel methodology for designing biologically plausible bursting neuron circuits using a minimal number of components. We hypothesize that to design circuits capable of bursting,...
Overcoming challenges in bamboo connections: A review of mechanical properties and structural considerationsPierre Boucher, Victor Fréchard, Diego Ramirez-Cardona, Claudiane Ouellet-Plamondon2025-09-22下载Over the past decades, bamboo has increasingly gained attention as a sustainable construction material, through its rapid growth, naturally optimized shape, high mechanical properties, and significant...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Bridging Simulation and Silicon: A Study of RISC-V Hardware and FireSim SimulationAtanu Barai, Kamalavasan Kamalakkannan, Patrick Diehl, Maxim Moraru, Jered Dominguez-Trujillo, Howard Pritchard, Nandakishore Santhi, Farzad Fatollahi-Fard, Galen Shipman2025-09-22下载RISC-V ISA-based processors have recently emerged as both powerful and energy-efficient computing platforms. The release of the MILK-V Pioneer marked a significant milestone as the first desktop-grade...
Intelligent Load Balancing in Cloud Computer SystemsLeszek Sliwko2025-09-22下载Cloud computing is an established technology allowing users to share resources on a large scale, never before seen in IT history. A cloud system connects multiple individual servers in order to proces...
XaaS Containers: Performance-Portable Representation With Source and IR ContainersMarcin Copik, Eiman Alnuaimi, Alok Kamatar, Valerie Hayot-Sasson, Alberto Madonna, Todd Gamblin, Kyle Chard, Ian Foster, Torsten Hoefler2025-09-22下载High-performance computing (HPC) systems and cloud data centers are converging, and containers are becoming the default method of portable software deployment.
Expert-as-a-Service: Towards Efficient, Scalable, and Robust Large-scale MoE ServingZiming Liu, Boyu Tian, Guoteng Wang, Zhen Jiang, Peng Sun, Zhenhua Han, Tian Tang, Xiaohe Hu, Yanmin Jia, Yan Zhang, He Liu, Mingjun Zhang, Yiqi Zhang, Qiaoling Chen, Shenggan Cheng, Mingyu Gao, Yang You, Siyuan Feng2025-09-22下载Mixture-of-Experts (MoE) models challenge serving infrastructures with dynamic, sparse expert utilization, causing instability on conventional systems designed for dense architectures.
A Lightweight Approach for State Machine ReplicationChristian Cachin, Jinfeng Dou, Christian Scheideler, Philipp Schneider2025-09-22下载We present a lightweight solution for state machine replication with commitment certificates. Specifically, we adapt and analyze a median rule for the stabilizing consensus problem [Doerr11] to operat...
A Comparison of Low and high-Order Methods for the Simulation of Supersonic Jet FlowsD. F. Abreu, C. Junqueira-Junior, E. T. V. Dauricio, J. L. F. Azevedo2025-09-22下载The present work compares results for different numerical methods in search of alternatives to improve the quality of large-eddy simulations for the problem of supersonic turbulent jet flows.
Cluster Workload Allocation: A Predictive Approach Leveraging Machine Learning EfficiencyLeszek Sliwko2025-09-22下载This research investigates how Machine Learning (ML) algorithms can assist in workload allocation strategies by detecting tasks with node affinity operators (referred to as constraint operators), whic...
Enhancing Cluster Scheduling in HPC: A Continuous Transfer Learning for Real-Time OptimizationLeszek Sliwko, Jolanta Mizera-Pietraszko2025-09-22下载This study presents a machine learning-assisted approach to optimize task scheduling in cluster systems, focusing on node-affinity constraints.
Disaggregated Prefill and Decoding Inference System for Large Language Model Serving on Multi-Vendor GPUsXing Chen, Rong Shi, Lu Zhao, Lingbin Wang, Xiao Jin, Yueqiang Chen, Hongfeng Sun2025-09-22下载LLM-based applications have been widely used in various industries, but with the increasing of models size, an efficient large language model (LLM) inference system is an urgent problem to be solved f...
TACTFL: Temporal Contrastive Training for Multi-modal Federated Learning with Similarity-guided Model AggregationGuanxiong Sun, Majid Mirmehdi, Zahraa Abdallah, Raul Santos-Rodriguez, Ian Craddock, Telmo de Menezes e Silva Filho2025-09-22下载Real-world federated learning faces two key challenges: limited access to labelled data and the presence of heterogeneous multi-modal inputs. This paper proposes TACTFL, a unified framework for semi-s...
pBeeGees: A Prudent Approach to Certificate-Decoupled BFT ConsensusKaiji Yang, Jingjing Zhang, Junyao Zheng, Qiwen Liu, Weigang Wu, Jieying Zhou2025-09-22下载Pipelined Byzantine Fault Tolerant (BFT) consensus is fundamental to permissioned blockchains. However, many existing protocols are limited by the requirement for view-consecutive quorum certificates ...
Prefetching in Deep Memory Hierarchies with NVRAM as Main MemoryManel Lurbe, Miguel Avargues, Salvador Petit, Maria E. Gomez, Rui Yang, Guanhao Wang, Julio Sahuquillo2025-09-22下载Emerging applications, such as big data analytics and machine learning, require increasingly large amounts of main memory, often exceeding the capacity of current commodity processors built on DRAM te...
Cortex: Achieving Low-Latency, Cost-Efficient Remote Data Access For LLM via Semantic-Aware Knowledge CachingChaoyi Ruan, Chao Bi, Kaiwen Zheng, Ziji Shi, Xinyi Wan, Jialin Li2025-09-22下载Large Language Model (LLM) agents tackle data-intensive tasks such as deep research and code generation. However, their effectiveness depends on frequent interactions with knowledge sources across rem...
Cronus: Efficient LLM inference on Heterogeneous GPU Clusters via Partially Disaggregated PrefillYunzhao Liu, Qiang Xu, Y. Charlie Hu2025-09-22下载Efficient LLM inference is critical for real-world applications, especially within heterogeneous GPU clusters commonly found in organizations and on-premise datacenters as GPU architecture rapidly evo...
Institutional Research Computing Capabilities in Australia: 2024Slava Kitaeff, Luc Betbeder-Matibet, Jake Carroll, Stephen Giugni, David Abramson, John Zaitseff, Sarah Walters, David Powell, Chris Bording, Trung Nguyen, Angus Macoustra, Fabien Voisin, Bowen Chen, Jarrod Hurley2025-09-22下载Institutional research computing infrastructure plays a vital role in Australia's research ecosystem, complementing and extending national facilities.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Using Age of Information for Throughput Optimal Spectrum SharingHongjae Nam, Vishrant Tripathi, David J. Love2025-09-22下载We consider a spectrum sharing problem where two users attempt to communicate over N channels. The Primary User (PU) has prioritized transmissions and its occupancy on each channel over time can be mo...
A User-to-User Resource Reselling Game in Open RAN with Buffer RolloverRuide Cao, Marie Siew, David Yau2025-09-22下载The development of the Open RAN (O-RAN) framework helps enable network slicing through its virtualization, interoperability, and flexibility. To improve spectral efficiency and better meet users' dyna...
5GC-Bench: A Framework for Stress-Testing and Benchmarking 5G Core VNFsIoannis Panitsas, Tolga O. Atalay, Dragoslav Stojadinovic, Angelos Stavrou, Leandros Tassiulas2025-09-22下载The disaggregated, cloud-native design of the 5G Core (5GC) enables flexibility and scalability but introduces significant challenges. Control-plane procedures involve complex interactions across mult...
LIFY: IoT System for Monitoring Vital Signs of Elderly PeopleSara Gonzalez, Martin Vasquez, Wilder Castellanos2025-09-22下载This article describes the implementation of a technological solution aimed at improving the recording of physiological signals in the elderly population residing in geriatric facilities.
Optimal Service Mode Assignment in a Simple Computation Offloading System: Extended VersionDarin Jeff, Eytan Modiano2025-09-22下载We consider a simple computation offloading model where jobs can either be fully processed in the cloud or be partially processed at a local server before being sent to the cloud to complete processin...
Detection of Misreporting Attacks on Software-Defined Immersive EnvironmentsSourya Saha, Md Nurul Absur, Shima Yousefi, Saptarshi Debroy2025-09-22下载The ability to centrally control network infrastructure using a programmable middleware has made Software-Defined Networking (SDN) ideal for emerging applications, such as immersive environments.
Building Transparency in Deep Learning-Powered Network Traffic Classification: A Traffic-Explainer FrameworkRiya Ponraj, Ram Durairajan, Yu Wang2025-09-22下载Recent advancements in deep learning have significantly enhanced the performance and efficiency of traffic classification in networking systems.
GLo-MAPPO: A Multi-Agent Proximal Policy Optimization for Energy Efficiency in UAV-Assisted LoRa NetworksAbdullahi Isa Ahmed, Jamal Bentahar, El Mehdi Amhoud2025-09-22下载Long Range (LoRa) based low-power wide area networks (LPWANs) are crucial for enabling next-generation IoT (NG-IoT) applications in 5G/6G ecosystems due to their long-range, low-power, and low-cost ch...
BiLCNet : BiLSTM-Conformer Network for Encrypted Traffic Classification with 5G SA Physical Channel RecordsKe Ma, Jialiang Lu, Philippe Martins2025-09-22下载Accurate and efficient traffic classification is vital for wireless network management, especially under encrypted payloads and dynamic application behavior, where traditional methods such as port-bas...
Optimizing Split Federated Learning with Unstable Client ParticipationWei Wei, Zheng Lin, Xihui Liu, Hongyang Du, Dusit Niyato, Xianhao Chen2025-09-22下载To enable training of large artificial intelligence (AI) models at the network edge, split federated learning (SFL) has emerged as a promising approach by distributing computation between edge devices...

cs.PF - Performance

标题作者发布日期PDF摘要
On the Design of Capacity-Achieving Distributions for Discrete-Time Poisson Channel with Low-Precision ADCsQianqian Li, Lintao Li, Lixiang Liu, Lei Yang, Caihong Gong, Hua Li, Shiya Hao, Xiaoming Dai2025-09-22下载This paper investigates the design of the capacity-achieving input distribution for the discrete-time Poisson channel (DTPC) under dark current effects with low-precision analog-to-digital converters ...

基于 VitePress 构建