Skip to content

2025-07-30

cs.AR - Architecture

标题作者发布日期PDF摘要
KLLM: Fast LLM Inference with K-Means QuantizationXueying Wu, Baijun Zhou, Zhihui Gao, Yuzhe Fu, Qilin Zheng, Yintao He, Hai Li2025-07-30下载Large language model (LLM) inference poses significant challenges due to its intensive memory and computation demands. Weight and activation quantization (WAQ) offers a promising solution by reducing ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Data Readiness for Scientific AI at ScaleWesley Brewer, Patrick Widener, Valentine Anantharaj, Feiyi Wang, Tom Beck, Arjun Shankar, Sarp Oral2025-07-30下载This paper examines how Data Readiness for AI (DRAI) principles apply to leadership-scale scientific datasets used to train foundation models.
DSPE: Profit Maximization in Edge-Cloud Storage System using Dynamic Space Partitioning with Erasure CodeShubhradeep Roy, Suvarthi Sarkar, Vivek Verma, Aryabartta Sahu2025-07-30下载Edge Storage Systems have emerged as a critical enabler of low latency data access in modern cloud networks by bringing storage and computation closer to end users.
A DataOps Toolbox Enabling Continuous Semantic Integration of Devices for Edge-Cloud AI ApplicationsMario Scrocca, Marco Grassi, Alessio Carenini, Jean-Paul Calbimonte, Darko Anicic, Irene Celino2025-07-30下载The implementation of AI-based applications in complex environments often requires the collaboration of several devices spanning from edge to cloud.
Leveraging Caliper and Benchpark to Analyze MPI Communication Patterns: Insights from AMG2023, Kripke, and LaghosGrace Nansamba, Evelyn Namugwanya, David Boehme, Dewi Yokelson, Riley Shipley, Derek Schafer, Michael McKinsey, Olga Pearce, Anthony Skjellum2025-07-30下载We introduce ``communication regions'' into the widely used Caliper HPC profiling tool. A communication region is an annotation enabling capture of metrics about the data being communicated (including...
Low-Communication Resilient Distributed Estimation Algorithm Based on Memory MechanismWei Li, Limei Hu, Feng Chen, Ye Yao2025-07-30下载In multi-task adversarial networks, the accurate estimation of unknown parameters in a distributed algorithm is hindered by attacked nodes or links.
A Semi-Supervised Federated Learning Framework with Hierarchical Clustering Aggregation for Heterogeneous Satellite NetworksZhuocheng Liu, Zhishu Shen, Qiushi Zheng, Tiehua Zhang, Zheng Lei, Jiong Jin2025-07-30下载Low Earth Orbit (LEO) satellites are emerging as key components of 6G networks, with many already deployed to support large-scale Earth observation and sensing related tasks.
Hypernetworks for Model-Heterogeneous Personalized Federated LearningChen Zhang, Husheng Li, Xiang Liu, Linshan Jiang, Danxin Wang2025-07-30下载Recent advances in personalized federated learning have focused on addressing client model heterogeneity. However, most existing methods still require external data, rely on model decoupling, or adopt...
Towards Experiment Execution in Support of Community Benchmark Workflows for HPCGregor von Laszewski, Wesley Brewer, Sean R. Wilkinson, Andrew Shao, J. P. Fleischer, Harshad Pitkar, Christine R. Kirkpatrick, Geoffrey C. Fox2025-07-30下载A key hurdle is demonstrating compute resource capability with limited benchmarks. We propose workflow templates as a solution, offering adaptable designs for specific scientific applications.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
PRIME: Pseudo-Random Integrated Multi-Part Entropy for Adaptive Packet Spraying in AI/ML Data centersAshkan Sobhani, Sogand Sadrhaghighi, Xingjun Chu2025-07-30下载Large-scale distributed training in production data centers place significant demands on network infrastructure. In particular, significant load balancing challenges arise when processing AI/ML worklo...
Morph: ChirpTransformer-based Encoder-decoder Co-design for Reliable LoRa CommunicationYidong Ren, Maolin Gan, Chenning Li, Shakhrul Iman Siam, Mi Zhang, Shigang Chen, Zhichao Cao2025-07-30下载In this paper, we propose Morph, a LoRa encoder-decoder co-design to enhance communication reliability while improving its computation efficiency in extremely-low signal-to-noise ratio (SNR) situation...
OFCnetLLM: Large Language Model for Network Monitoring and AlertnessHong-Jun Yoon, Mariam Kiran, Danial Ebling, Joe Breen2025-07-30下载The rapid evolution of network infrastructure is bringing new challenges and opportunities for efficient network management, optimization, and security.
An Architecture for Spatial NetworkingJosh Millar, Ryan Gibb, Roy Ang, Hamed Haddadi, Anil Madhavapeddy2025-07-30下载Physical spaces are increasingly dense with networked devices, promising seamless coordination and ambient intelligence. Yet today, cloud-first architectures force all communication through wide-area ...
802.11bf Multiband Passive Sensing: Reusing Wi-Fi Signaling for SensingPablo Picazo-Martinez, Carlos Barroso-Fernández, Alejandro Calvillo-Fernandez, Milan Groshev, Carlos J. Bernardos, Antonio de la Oliva, Alain Mourad2025-07-30下载This paper presents a novel multiband passive sensing system that leverages IEEE 802.11bf Wi-Fi signals for environmental sensing, focusing on both sub-7 GHz and millimeter-wave (mmWave) bands.
Scalable Spectrum Availability Prediction using a Markov Chain Framework and ITU-R Propagation ModelsAbir Ray2025-07-30下载Spectrum resources are often underutilized across time and space, motivating dynamic spectrum access strategies that allow secondary users to exploit unused frequencies.
AdapSCA-PSO: An Adaptive Localization Algorithm with AI-Based Hybrid SCA-PSO for IoT WSNsZe Zhang, Qian Dong, Wenhan Wang2025-07-30下载The accurate localization of sensor nodes is a fundamental requirement for the practical application of the Internet of Things (IoT). To enable robust localization across diverse environments, this pa...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
From Tracepoints to Timeliness: A Semi-Markov Framework for Predictive Runtime AnalysisBenno Bielmeier, Ralf Ramsauer, Takahiro Yoshida, Wolfgang Mauerer2025-07-30下载Detecting and resolving violations of temporal constraints in real-time systems is both, time-consuming and resource-intensive, particularly in complex software environments.

cs.PF - Performance

标题作者发布日期PDF摘要
On the Sustainability of AI Inferences in the EdgeGhazal Sobhani, Md. Monzurul Amin Ifath, Tushar Sharma, Israat Haque2025-07-30下载The proliferation of the Internet of Things (IoT) and its cutting-edge AI-enabled applications (e.g., autonomous vehicles and smart industries) combine two paradigms: data-driven systems and their dep...
Ecoscape: Fault Tolerance Benchmark for Adaptive Remediation Strategies in Real-Time Edge MLHendrik Reiter, Ahmad Rzgar Hamid, Florian Schlösser, Mikkel Baun Kjærgaard, Wilhelm Hasselbring2025-07-30下载Edge computing offers significant advantages for realtime data processing tasks, such as object recognition, by reducing network latency and bandwidth usage.
Dissecting RISC-V Performance: Practical PMU Profiling and Hardware-Agnostic Roofline Analysis on Emerging PlatformsAlexander Batashev2025-07-30下载As RISC-V architectures proliferate across embedded and high-performance domains, developers face persistent challenges in performance optimization due to fragmented tooling, immature hardware feature...

基于 VitePress 构建