Skip to content

2025-06-25

cs.AR - Architecture

标题作者发布日期PDF摘要
FINN-GL: Generalized Mixed-Precision Extensions for FPGA-Accelerated LSTMsShashwat Khandelwal, Jakoba Petri-Koenig, Thomas B. Preußer, Michaela Blott, Shreejith Shanker2025-06-25下载Recurrent neural networks (RNNs), particularly LSTMs, are effective for time-series tasks like sentiment analysis and short-term stock prediction.
Characterization and Mitigation of Training Instabilities in Microscaling FormatsHuangyuan Su, Mujin Kwun, Stephanie Gil, Sham Kakade, Nikhil Anand2025-06-25下载Training large language models is an expensive, compute-bound process that must be repeated as models scale, algorithms improve, and new data is collected.
When Servers Meet Species: A Fab-to-Grave Lens on Computing's Biodiversity ImpactTianyao Shi, Ritbik Kumar, Inez Hua, Yi Ding2025-06-25下载Biodiversity loss is a critical planetary boundary, yet its connection to computing remains largely unexamined. Prior sustainability efforts in computing have focused on carbon and water, overlooking ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
SuperSONIC: Cloud-Native Infrastructure for ML InferencingDmitry Kondratyev, Benedikt Riedel, Yuan-Tang Chou, Miles Cochran-Branson, Noah Paladino, David Schultz, Mia Liu, Javier Duarte, Philip Harris, Shih-Chieh Hsu2025-06-25下载The increasing computational demand from growing data rates and complex machine learning (ML) algorithms in large-scale scientific experiments has driven the adoption of the Services for Optimized Net...
Hear No Evil: Detecting Gradient Leakage by Malicious Servers in Federated LearningFei Wang, Baochun Li2025-06-25下载Recent work has shown that gradient updates in federated learning (FL) can unintentionally reveal sensitive information about a client's local data.
AIMeter: Measuring, Analyzing, and Visualizing Energy and Carbon Footprint of AI WorkloadsHongzhen Huang, Kunming Zhang, Hanlong Liao, Kui Wu, Guoming Tang2025-06-25下载The rapid advancement of AI, particularly large language models (LLMs), has raised significant concerns about the energy use and carbon emissions associated with model training and inference.
Collaborative Batch Size Optimization for Federated LearningArno Geimer, Karthick Panner Selvam, Beltran Fiz Pontiveros2025-06-25下载Federated Learning (FL) is a decentralized collaborative Machine Learning framework for training models without collecting data in a centralized location.
When Servers Meet Species: A Fab-to-Grave Lens on Computing's Biodiversity ImpactTianyao Shi, Ritbik Kumar, Inez Hua, Yi Ding2025-06-25下载Biodiversity loss is a critical planetary boundary, yet its connection to computing remains largely unexamined. Prior sustainability efforts in computing have focused on carbon and water, overlooking ...
PAT: a new algorithm for all-gather and reduce-scatter operations at scaleSylvain Jeaugey2025-06-25下载This paper describes a new algorithm called PAT, for Parallel Aggregated Trees, and which can be used to implement all-gather and reduce-scatter operations.
On the hh-majority dynamics with many opinionsFrancesco d'Amore, Niccolò D'Archivio, George Giakkoupis, Emanuele Natale2025-06-25下载We present the first upper bound on the convergence time to consensus of the well-known hh-majority dynamics with kk opinions, in the synchronous setting, for hh and kk that are both non-constant ...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Drift-Adaptive Slicing-Based Resource Management for Cooperative ISAC NetworksShisheng Hu, Jie Gao, Xue Qin, Conghao Zhou, Xinyu Huang, Mushu Li, Mingcheng He, Xuemin Shen2025-06-25下载In this paper, we propose a novel drift-adaptive slicing-based resource management scheme for cooperative integrated sensing and communication (ISAC) networks.
Generative AI for Vulnerability Detection in 6G Wireless Networks: Advances, Case Study, and Future DirectionsShuo Yang, Xinran Zheng, Jinfeng Xu, Jinze Li, Danyang Song, Zheyu Chen, Edith C. H. Ngai2025-06-25下载The rapid advancement of 6G wireless networks, IoT, and edge computing has significantly expanded the cyberattack surface, necessitating more intelligent and adaptive vulnerability detection mechanism...
Semantic Caching for Improving Web AffordabilityHafsa Akbar, Danish Athar, Muhammad Ayain Fida Rana, Chaudhary Hammad Javed, Zartash Afzal Uzmi, Ihsan Ayyub Qazi, Zafar Ayyub Qazi2025-06-25下载The rapid growth of web content has led to increasingly large webpages, posing significant challenges for Internet affordability, especially in developing countries where data costs remain prohibitive...
A Detailed Measurement View on IPv6 Scanners and Their Adaption to BGP SignalsIsabell Egloff, Raphael Hiesgen, Maynard Koch, Thomas C. Schmidt, Matthias Wählisch2025-06-25下载Scanners are daily visitors of public IPv4 hosts. Scanning IPv6 nodes successfully is still a challenge, which an increasing crowd of actors tries to master.
A clusterability test for directed graphsMario R. Guarracino, Pierre Miasnikof, Alexander Y. Shestopaloff, Houyem Demni, Cristián Bravo, Yuri Lawryshyn2025-06-25下载In this article, we extend a statistical test of graph clusterability, the δ test, to directed graphs with no self loops. The δ test, originally designed for undirected graphs, is based on the pre...
The Impact of the Russia-Ukraine Conflict on the Cloud Computing Risk LandscapeMalikussaid, Sutiyo2025-06-25下载This study examines how geopolitical tensions catalyze IT risk evolution through systematic analysis of the conflict's impact on data sovereignty, cybersecurity paradigms, and cloud infrastructure str...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Breaking the Boundaries of Long-Context LLM Inference: Adaptive KV Management on a Single Commodity GPUHe Sun, Li Li, Mingjun Xiao, Chengzhong Xu2025-06-25下载Advanced Large Language Models (LLMs) have achieved impressive performance across a wide range of complex and long-context natural language tasks.

cs.PF - Performance

标题作者发布日期PDF摘要
GPU Kernel Scientist: An LLM-Driven Framework for Iterative Kernel OptimizationMartin Andrews, Sam Witteveen2025-06-25下载Optimizing GPU kernels for high performance is a complex task, often demanding deep architectural knowledge, extensive profiling, and iterative experimentation.

基于 VitePress 构建