2025-06-25

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
FINN-GL: Generalized Mixed-Precision Extensions for FPGA-Accelerated LSTMs	Shashwat Khandelwal, Jakoba Petri-Koenig, Thomas B. Preußer, Michaela Blott, Shreejith Shanker	2025-06-25	下载	Recurrent neural networks (RNNs), particularly LSTMs, are effective for time-series tasks like sentiment analysis and short-term stock prediction.
Characterization and Mitigation of Training Instabilities in Microscaling Formats	Huangyuan Su, Mujin Kwun, Stephanie Gil, Sham Kakade, Nikhil Anand	2025-06-25	下载	Training large language models is an expensive, compute-bound process that must be repeated as models scale, algorithms improve, and new data is collected.
When Servers Meet Species: A Fab-to-Grave Lens on Computing's Biodiversity Impact	Tianyao Shi, Ritbik Kumar, Inez Hua, Yi Ding	2025-06-25	下载	Biodiversity loss is a critical planetary boundary, yet its connection to computing remains largely unexamined. Prior sustainability efforts in computing have focused on carbon and water, overlooking ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
SuperSONIC: Cloud-Native Infrastructure for ML Inferencing	Dmitry Kondratyev, Benedikt Riedel, Yuan-Tang Chou, Miles Cochran-Branson, Noah Paladino, David Schultz, Mia Liu, Javier Duarte, Philip Harris, Shih-Chieh Hsu	2025-06-25	下载	The increasing computational demand from growing data rates and complex machine learning (ML) algorithms in large-scale scientific experiments has driven the adoption of the Services for Optimized Net...
Hear No Evil: Detecting Gradient Leakage by Malicious Servers in Federated Learning	Fei Wang, Baochun Li	2025-06-25	下载	Recent work has shown that gradient updates in federated learning (FL) can unintentionally reveal sensitive information about a client's local data.
AIMeter: Measuring, Analyzing, and Visualizing Energy and Carbon Footprint of AI Workloads	Hongzhen Huang, Kunming Zhang, Hanlong Liao, Kui Wu, Guoming Tang	2025-06-25	下载	The rapid advancement of AI, particularly large language models (LLMs), has raised significant concerns about the energy use and carbon emissions associated with model training and inference.
Collaborative Batch Size Optimization for Federated Learning	Arno Geimer, Karthick Panner Selvam, Beltran Fiz Pontiveros	2025-06-25	下载	Federated Learning (FL) is a decentralized collaborative Machine Learning framework for training models without collecting data in a centralized location.
When Servers Meet Species: A Fab-to-Grave Lens on Computing's Biodiversity Impact	Tianyao Shi, Ritbik Kumar, Inez Hua, Yi Ding	2025-06-25	下载	Biodiversity loss is a critical planetary boundary, yet its connection to computing remains largely unexamined. Prior sustainability efforts in computing have focused on carbon and water, overlooking ...
PAT: a new algorithm for all-gather and reduce-scatter operations at scale	Sylvain Jeaugey	2025-06-25	下载	This paper describes a new algorithm called PAT, for Parallel Aggregated Trees, and which can be used to implement all-gather and reduce-scatter operations.
On the $h$ -majority dynamics with many opinions	Francesco d'Amore, Niccolò D'Archivio, George Giakkoupis, Emanuele Natale	2025-06-25	下载	We present the first upper bound on the convergence time to consensus of the well-known $h$ -majority dynamics with $k$ opinions, in the synchronous setting, for $h$ and $k$ that are both non-constant ...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Drift-Adaptive Slicing-Based Resource Management for Cooperative ISAC Networks	Shisheng Hu, Jie Gao, Xue Qin, Conghao Zhou, Xinyu Huang, Mushu Li, Mingcheng He, Xuemin Shen	2025-06-25	下载	In this paper, we propose a novel drift-adaptive slicing-based resource management scheme for cooperative integrated sensing and communication (ISAC) networks.
Generative AI for Vulnerability Detection in 6G Wireless Networks: Advances, Case Study, and Future Directions	Shuo Yang, Xinran Zheng, Jinfeng Xu, Jinze Li, Danyang Song, Zheyu Chen, Edith C. H. Ngai	2025-06-25	下载	The rapid advancement of 6G wireless networks, IoT, and edge computing has significantly expanded the cyberattack surface, necessitating more intelligent and adaptive vulnerability detection mechanism...
Semantic Caching for Improving Web Affordability	Hafsa Akbar, Danish Athar, Muhammad Ayain Fida Rana, Chaudhary Hammad Javed, Zartash Afzal Uzmi, Ihsan Ayyub Qazi, Zafar Ayyub Qazi	2025-06-25	下载	The rapid growth of web content has led to increasingly large webpages, posing significant challenges for Internet affordability, especially in developing countries where data costs remain prohibitive...
A Detailed Measurement View on IPv6 Scanners and Their Adaption to BGP Signals	Isabell Egloff, Raphael Hiesgen, Maynard Koch, Thomas C. Schmidt, Matthias Wählisch	2025-06-25	下载	Scanners are daily visitors of public IPv4 hosts. Scanning IPv6 nodes successfully is still a challenge, which an increasing crowd of actors tries to master.
A clusterability test for directed graphs	Mario R. Guarracino, Pierre Miasnikof, Alexander Y. Shestopaloff, Houyem Demni, Cristián Bravo, Yuri Lawryshyn	2025-06-25	下载	In this article, we extend a statistical test of graph clusterability, the δ test, to directed graphs with no self loops. The δ test, originally designed for undirected graphs, is based on the pre...
The Impact of the Russia-Ukraine Conflict on the Cloud Computing Risk Landscape	Malikussaid, Sutiyo	2025-06-25	下载	This study examines how geopolitical tensions catalyze IT risk evolution through systematic analysis of the conflict's impact on data sovereignty, cybersecurity paradigms, and cloud infrastructure str...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Breaking the Boundaries of Long-Context LLM Inference: Adaptive KV Management on a Single Commodity GPU	He Sun, Li Li, Mingjun Xiao, Chengzhong Xu	2025-06-25	下载	Advanced Large Language Models (LLMs) have achieved impressive performance across a wide range of complex and long-context natural language tasks.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
GPU Kernel Scientist: An LLM-Driven Framework for Iterative Kernel Optimization	Martin Andrews, Sam Witteveen	2025-06-25	下载	Optimizing GPU kernels for high performance is a complex task, often demanding deep architectural knowledge, extensive profiling, and iterative experimentation.