Skip to content

2025-07-29

cs.AR - Architecture

标题作者发布日期PDF摘要
A Customized Memory-aware Architecture for Biological Sequence AlignmentNasrin Akbari, Mehdi Modarressi, Alireza Khadem2025-07-29下载Sequence alignment is a fundamental process in computational biology which identifies regions of similarity in biological sequences. With the exponential growth in the volume of data in bioinformatics...
A Multi-Agent Generative AI Framework for IC Module-Level Verification AutomationWenbo Liu, Forbes Hou, Jon Zhang, Hong Liu, Allen Lei2025-07-29下载As large language models demonstrate enormous potential in the field of Electronic Design Automation (EDA), generative AI-assisted chip design is attracting widespread attention from academia and indu...
No Redundancy, No Stall: Lightweight Streaming 3D Gaussian Splatting for Real-time RenderingLinye Wei, Jiajun Tang, Fan Fei, Boxin Shi, Runsheng Wang, Meng Li2025-07-29下载3D Gaussian Splatting (3DGS) enables high-quality rendering of 3D scenes and is getting increasing adoption in domains like autonomous driving and embodied intelligence.
SLTarch: Towards Scalable Point-Based Neural Rendering by Taming Workload Imbalance and Memory IrregularityXingyang Li, Jie Jiang, Yu Feng, Yiming Gan, Jieru Zhao, Zihan Liu, Jingwen Leng, Minyi Guo2025-07-29下载Rendering is critical in fields like 3D modeling, AR/VR, and autonomous driving, where high-quality, real-time output is essential. Point-based neural rendering (PBNR) offers a photorealistic and effi...
Forecasting LLM Inference Performance via Hardware-Agnostic Analytical ModelingRajeev Patwari, Ashish Sirasao, Devleena Das2025-07-29下载Large language models (LLMs) have been increasingly deployed as local agents on personal devices with CPUs, NPUs and integrated GPUs. However, forecasting inference performance on devices with such he...
A2H-MAS: An Algorithm-to-HLS Multi-Agent System for Automated and Reliable FPGA ImplementationJie Lei, Ruofan Jia, J. Andrew Zhang, Hao Zhang2025-07-29下载Bridging the gap between algorithm development and hardware realization remains a persistent challenge, particularly in latency- and resource-constrained domains such as wireless communication.
Automated HEMT Model Construction from Datasheets via Multi-Modal Intelligence and Prior-Knowledge-Free OptimizationYuang Peng, Jiarui Zhong, Yang Zhang, Hong Cai Chen2025-07-29下载Parameter extraction for industry-standard device models like ASM-HEMT is crucial in circuit design workflows. However, many manufacturers do not provide such models, leaving users to build them using...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Minimizing CGYRO HPC Communication Costs in Ensembles with XGYRO by Sharing the Collisional Constant Tensor StructureIgor Sfiligoi, Emily A. Belli, Jeff Candy2025-07-29下载First-principles fusion plasma simulations are both compute and memory intensive, and CGYRO is no exception. The use of many HPC nodes to fit the problem in the available memory thus results in signif...
OpenRASE: Service Function Chain EmulationTheviyanthan Krishnamohan, Paul Harvey2025-07-29下载Service Function Chains (SFCs) are one of the key enablers in providing programmable computer networks, paving the way for network autonomy. However, this also introduces new challenges, such as resou...
Large-Scale Linear Energy System Optimization: A Systematic Review on Parallelization Strategies via DecompositionLars Hadidi, Leonard Göke, Maximilian Hoffmann, Mario Klostermeier, Shima Sasanpour, Tim Varelmann, Vassilios Yfantis, Jochen Linßen, Detlef Stolten, Jann M. Weinand2025-07-29下载As renewable energy integration, sector coupling, and spatiotemporal detail increase, energy system optimization models grow in size and complexity, often pushing solvers to their performance limits.
The Performance of Low-Synchronization Variants of Reorthogonalized Block Classical Gram--SchmidtErin Carson, Yuxin Ma2025-07-29下载Numerous applications, such as Krylov subspace solvers, make extensive use of the block classical Gram-Schmidt (BCGS) algorithm and its reorthogonalized variants for orthogonalizing a set of vectors.
Collaborative State Machines: A Better Programming Model for the Cloud-Edge-IoT ContinuumMarlon Etheredge, Thomas Fahringer, Felix Erlacher, Elias Kohler, Stefan Pedratscher, Juan Aznar-Poveda, Nishant Saurabh, Adrien Lebre2025-07-29下载The development of Cloud-Edge-IoT applications requires robust programming models. Existing models often struggle to manage the dynamic and stateful nature of these applications effectively.
Bridging Cache-Friendliness and Concurrency: A Locality-Optimized In-Memory B-SkiplistYicong Luo, Senhe Hao, Brian Wheatman, Prashant Pandey, Helen Xu2025-07-29下载Skiplists are widely used for in-memory indexing in many key-value stores, such as RocksDB and LevelDB, due to their ease of implementation and simple concurrency control mechanisms.
GlideinBenchmark: collecting resource information to optimize provisioningMarco Mambelli, Shrijan Swaminathan2025-07-29下载Choosing the right resource can speed up job completion, better utilize the available hardware, and visibly reduce costs, especially when renting computers in the cloud.
Using Containers to Speed Up Development, to Run Integration Tests and to Teach About Distributed SystemsMarco Mambelli, Bruno Moreira Coimbra, Namratha Urs, Ilya Baburashvili2025-07-29下载GlideinWMS is a workload manager provisioning resources for many experiments, including CMS and DUNE. The software is distributed both as native packages and specialized production containers.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Programmable Data Planes for Network SecurityGursimran Singh, H. B. Acharya, Minseok Kwon2025-07-29下载The emergence of programmable data planes, and particularly switches supporting the P4 language, has transformed network security by enabling customized, line-rate packet processing.
OpenRASE: Service Function Chain EmulationTheviyanthan Krishnamohan, Paul Harvey2025-07-29下载Service Function Chains (SFCs) are one of the key enablers in providing programmable computer networks, paving the way for network autonomy. However, this also introduces new challenges, such as resou...
Not Here, Go There: Analyzing Redirection Patterns on the WebKritika Garg, Sawood Alam, Dietrich Ayala, Michele C. Weigle, Michael L. Nelson2025-07-29下载URI redirections are integral to web management, supporting structural changes, SEO optimization, and security. However, their complexities affect usability, SEO performance, and digital preservation.
Reasoning Language Models for Root Cause Analysis in 5G Wireless NetworksMohamed Sana, Nicola Piovesan, Antonio De Domenico, Yibin Kang, Haozhe Zhang, Merouane Debbah, Fadhel Ayed2025-07-29下载Root Cause Analysis (RCA) in mobile networks remains a challenging task due to the need for interpretability, domain expertise, and causal reasoning.
Blockchain-Based Decentralized Domain Name SystemGuang Yang, Peter Trinh, Alma Nkemla, Amuru Serikyaku, Edward Tatchim, Osman Sharaf2025-07-29下载The current Domain Name System (DNS) infrastructure faces critical vulnerabilities including poisoning attacks, censorship mechanisms, and centralized points of failure that compromise internet freedo...
RRTO: A High-Performance Transparent Offloading System for Model Inference in Mobile Edge ComputingZekai Sun, Xiuxian Guan, Zheng Lin, Yuhao Qing, Haoze Song, Zihan Fang, Zhe Chen, Fangming Liu, Heming Cui, Wei Ni, Jun Luo2025-07-29下载Deploying Machine Learning (ML) applications on resource-constrained mobile devices remains challenging due to limited computational resources and poor platform compatibility.
Generalized few-shot transfer learning architecture for modeling the EDFA gain spectrumAgastya Raj, Zehao Wang, Tingjun Chen, Daniel C Kilper, Marco Ruffini2025-07-29下载Accurate modeling of the gain spectrum in Erbium-Doped Fiber Amplifiers (EDFAs) is essential for optimizing optical network performance, particularly as networks evolve toward multi-vendor solutions.
Hybrid activation functions for deep neural networks: S3 and S4 -- a novel approach to gradient flow optimizationSergii Kavun2025-07-29下载Activation functions are critical components in deep neural networks, directly influencing gradient flow, training stability, and model performance.

cs.PF - Performance

标题作者发布日期PDF摘要
Beamforming-based Achievable Rate Maximization in ISAC System for Multi-UAV NetworkingShengcai Zhou, Luping Xiang, Kun Yang, Kai Kit Wong, Dapeng Oliver Wu, Chan-Byoung Chae2025-07-29下载Airborne mobile Integrated Sensing and Communication (ISAC) base stations have garnered significant attention recently, with ISAC technology being a crucial application for 6G networks.
Forecasting LLM Inference Performance via Hardware-Agnostic Analytical ModelingRajeev Patwari, Ashish Sirasao, Devleena Das2025-07-29下载Large language models (LLMs) have been increasingly deployed as local agents on personal devices with CPUs, NPUs and integrated GPUs. However, forecasting inference performance on devices with such he...

基于 VitePress 构建