Skip to content

2025-10-14

cs.AR - Architecture

标题作者发布日期PDF摘要
Wavefront Coding for Accommodation-Invariant Near-Eye DisplaysUgur Akpinar, Erdem Sahin, Tina M. Hayward, Apratim Majumder, Rajesh Menon, Atanas Gotchev2025-10-14下载We present a new computational near-eye display method that addresses the vergence-accommodation conflict problem in stereoscopic displays through accommodation-invariance.
A Direct Memory Access Controller (DMAC) for Irregular Data Transfers on RISC-V Linux SystemsThomas Benz, Axel Vanoni, Michael Rogenmoser, Luca Benini2025-10-14下载With the ever-growing heterogeneity in computing systems, driven by modern machine learning applications, pressure is increasing on memory systems to handle arbitrary and more demanding transfers effi...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Dodoor: Efficient Randomized Decentralized Scheduling with Load Caching for Heterogeneous Tasks and ClustersWei Da, Evangelia Kalyvianaki2025-10-14下载This paper introduces Dodoor, an efficient randomized decentralized scheduler designed for task scheduling in modern data centers. Dodoor leverages advanced research on the weighted balls-into-bins mo...
Personalized Federated Fine-Tuning of Vision Foundation Models for HealthcareAdam Tupper, Christian Gagné2025-10-14下载Foundation models open up new possibilities for the use of AI in healthcare. However, even when pre-trained on health data, they still need to be fine-tuned for specific downstream tasks.
Hierarchical Federated Learning for Crop Yield Prediction in Smart Agricultural Production SystemsAnas Abouaomar, Mohammed El hanjri, Abdellatif Kobbane, Anis Laouiti, Khalid Nafil2025-10-14下载In this paper, we presents a novel hierarchical federated learning architecture specifically designed for smart agricultural production systems and crop yield prediction.
Accelerating Bidiagonalization of Banded Matrices through Memory-Aware Bulge-Chasing on GPUsEvelyne Ringoot, Rabab Alomairy, Alan Edelman2025-10-14下载The reduction of a banded matrix to bidiagonal form is a critical step in the calculation of Singular Values, a cornerstone of scientific computing and AI.
Laminar: A Scalable Asynchronous RL Post-Training FrameworkGuangming Sheng, Yuxuan Tong, Borui Wan, Wang Zhang, Chaobo Jia, Xibin Wu, Yuqi Wu, Xiang Li, Chi Zhang, Yanghua Peng, Haibin Lin, Xin Liu, Chuan Wu2025-10-14下载Reinforcement learning (RL) post-training for Large Language Models (LLMs) is now scaling to large clusters and running for extended durations to enhance model reasoning performance.
Low Latency, High Bandwidth Streaming of Experimental Data with EJFATIlya Baldin, Michael Goodrich, Vardan Gyurjyan, Graham Heyes, Derek Howard, Yatish Kumar, David Lawrence, Brad Sawatzky, Stacey Sheldon, Carl Timmer2025-10-14下载Thomas Jefferson National Accelerator Facility (JLab) has partnered with Energy Sciences Network (ESnet) to define and implement an edge to compute cluster computational load balancing acceleration ar...
PubSub-VFL: Towards Efficient Two-Party Split Learning in Heterogeneous Environments via Publisher/Subscriber ArchitectureYi Liu, Yang Liu, Leqian Zheng, Jue Hong, Junjie Shi, Qingyou Yang, Ye Wu, Cong Wang2025-10-14下载With the rapid advancement of the digital economy, data collaboration between organizations has become a well-established business model, driving the growth of various industries.
Proof of Cloud: Data Center Execution Assurance for Confidential VMsFilip Rezabek, Moe Mahhouk, Andrew Miller, Quintus Kilbourn, Georg Carle, Jonathan Passerat-Palmbach2025-10-14下载Confidential Virtual Machines (CVMs) protect data in use by running workloads within hardware-enforced Trusted Execution Environments (TEEs). However, existing CVM attestation mechanisms only certify ...
TALP-Pages: An easy-to-integrate continuous performance monitoring frameworkValentin Seitz, Jordy Trilaksono, Marta Garcia-Gasulla2025-10-14下载Ensuring good performance is a key aspect in the development of codes that target HPC machines. As these codes are under active development, the necessity to detect performance degradation early in th...
Should I Run My Cloud Benchmark on Black Friday?Sören Henning, Adriano Vogel, Esteban Perez-Wohlfeil, Otmar Ertl, Rick Rabiser2025-10-14下载Benchmarks and performance experiments are frequently conducted in cloud environments. However, their results are often treated with caution, as the presumed high variability of performance in the clo...
A Non-Intrusive Framework for Deferred Integration of Cloud Patterns in Energy-Efficient Data-Sharing PipelinesSepideh Masoudi, Mark Edward Michael Daly, Jannis Kiesel, Stefan Tai2025-10-14下载As data mesh architectures gain traction in federated environments, organizations are increasingly building consumer-specific data-sharing pipelines using modular, cloud-native transformation services...
Metronome: Efficient Scheduling for Periodic Traffic Jobs with Network and Priority AwarenessHao Jiang, Meng Qin, Ruijie Kuai, Dandan Liang, Yue Gao2025-10-14下载With the rapid growth in computing power demand, cloud native networks have emerged as a promising solution to address the challenges of efficient resource coordination, particularly in coping with th...
GPU-Accelerated Algorithms for Process MappingPetr Samoldekin, Christian Schulz, Henning Woydt2025-10-14下载Process mapping asks to assign vertices of a task graph to processing elements of a supercomputer such that the computational workload is balanced while the communication cost is minimized.
Comparing Cross-Platform Performance via Node-to-Node Scaling StudiesKenneth Weiss, Thomas M. Stitt, Daryl Hawkins, Olga Pearce, Stephanie Brink, Robert N. Rieben2025-10-14下载Due to the increasing diversity of high-performance computing architectures, researchers and practitioners are increasingly interested in comparing a code's performance and scalability across differen...
nuGPR: GPU-Accelerated Gaussian Process Regression with Iterative Algorithms and Low-Rank ApproximationsZiqi Zhao, Vivek Sarin2025-10-14下载Gaussian Process Regression (GPR) is an important type of supervised machine learning model with inherent uncertainty measure in its predictions.
Deploying Atmospheric and Oceanic AI Models on Chinese Hardware and Framework: Migration Strategies, Performance Optimization and AnalysisYuze Sun, Wentao Luo, Yanfei Xiang, Jiancheng Pan, Jiahao Li, Quan Zhang, Xiaomeng Huang2025-10-14下载With the growing role of artificial intelligence in climate and weather research, efficient model training and inference are in high demand. Current models like FourCastNet and AI-GOMS depend heavily ...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Towards xApp Conflict Evaluation with Explainable Machine Learning and Causal Inference in O-RANPragya Sharma, Shihua Sun, Shachi Deshpande, Angelos Stavrou, Haining Wang2025-10-14下载The Open Radio Access Network (O-RAN) architecture enables a flexible, vendor-neutral deployment of 5G networks by disaggregating base station components and supporting third-party xApps for near real...
Millimeter Wave Inverse Pinhole ImagingAkarsh Prabhakara, Yawen Liu, Aswin C. Sankaranarayanan, Anthony Rowe, Swarun Kumar2025-10-14下载Millimeter wave (mmWave) radars are popular for perception in vision-denied contexts due to their compact size. This paper explores emerging use-cases that involve static mount or momentarily-static c...
Toward Hyper-Dimensional Connectivity in Beyond 6G: A Conceptual FrameworkEkram Hossain, Angelo Vera-Rivera2025-10-14下载Cellular wireless networks enable mobile broadband connectivity for Internet-based applications through their radio access and core network infrastructure.
CAMNet: Leveraging Cooperative Awareness Messages for Vehicle Trajectory PredictionMattia Grasselli, Angelo Porrello, Carlo Augusto Grazia2025-10-14下载Autonomous driving remains a challenging task, particularly due to safety concerns. Modern vehicles are typically equipped with expensive sensors such as LiDAR, cameras, and radars to reduce the risk ...
AMHRP: Adaptive Multi-Hop Routing Protocol to Improve Network Lifetime for Multi-Hop Wireless Body Area NetworkMuhammad Mateen Yaqoob, Kulsoom Fatima, Shahab Shamshirband, Amir Mosavi, Waqar Khurshid2025-10-14下载This paper presents a protocol for enhancement of life time of WBAN network as well other protocol related issues such as throughput, path loss, and residual energy.
Noisy Neighbor: Exploiting RDMA for Resource Exhaustion Attacks in Containerized CloudsGunwoo Kim, Taejune Park, Jinwoo Kim2025-10-14下载In modern containerized cloud environments, the adoption of RDMA (Remote Direct Memory Access) has expanded to reduce CPU overhead and enable high-performance data exchange.
A Network Digital Twin of a 5G Private Network: Designing a Proof-of-Concept from Theory to PracticeCristina Emilia Costa, Tatenda Horiro Zhou, Fabrizio Granelli2025-10-14下载Network Digital Twins represent a key technology in future networks, expected to provide the capability to perform accurate analysis and predictions about the behaviour of 6G mobile networks.
Human-in-the-Loop Bandwidth Estimation for Quality of Experience Optimization in Real-Time Video CommunicationSami Khairy, Gabriel Mittag, Vishak Gopal, Ross Cutler2025-10-14下载The quality of experience (QoE) delivered by video conferencing systems is significantly influenced by accurately estimating the time-varying available bandwidth between the sender and receiver.
GeoPipe: a Geo-distributed LLM Training Framework with enhanced Pipeline Parallelism in a Lossless RDMA-enabled Datacenter Optical Transport NetworkJun Dai, Xiaorun Wang, Kexiong Fang, Zheng Yang, Yuefeng Ji, Jiawei Zhang2025-10-14下载The proliferation of Large Language Models (LLMs) with exponentially growing parameters is making cross-data center (DC) training an inevitable trend.
Over-Threshold Multiparty Private Set Intersection for Collaborative Network Intrusion DetectionOnur Eren Arpaci, Raouf Boutaba, Florian Kerschbaum2025-10-14下载An important function of collaborative network intrusion detection is to analyze the network logs of the collaborators for joint IP addresses.

cs.PF - Performance

标题作者发布日期PDF摘要
TALP-Pages: An easy-to-integrate continuous performance monitoring frameworkValentin Seitz, Jordy Trilaksono, Marta Garcia-Gasulla2025-10-14下载Ensuring good performance is a key aspect in the development of codes that target HPC machines. As these codes are under active development, the necessity to detect performance degradation early in th...
Should I Run My Cloud Benchmark on Black Friday?Sören Henning, Adriano Vogel, Esteban Perez-Wohlfeil, Otmar Ertl, Rick Rabiser2025-10-14下载Benchmarks and performance experiments are frequently conducted in cloud environments. However, their results are often treated with caution, as the presumed high variability of performance in the clo...
Analysis and Evaluation of Using Microsecond-Latency Memory for In-Memory Indices and Caches in SSD-Based Key-Value StoresYosuke Bando, Akinobu Mita, Kazuhiro Hiwada, Shintaro Sano, Tomoya Suzuki, Yu Nakanishi, Kazutaka Tomida, Hirotsugu Kajihara, Akiyuki Kaneko, Daisuke Taki, Yukimasa Miyamoto, Tomokazu Yoshida, Tatsuo Shiozawa2025-10-14下载When key-value (KV) stores use SSDs for storing a large number of items, oftentimes they also require large in-memory data structures including indices and caches to be traversed to reduce IOs.

基于 VitePress 构建