Skip to content

2025-12-09

cs.AR - Architecture

标题作者发布日期PDF摘要
A Hybrid Residue Floating Numerical Architecture for High Precision Arithmetic on FPGAsMostafa Darvishi2025-12-09下载Floating point arithmetic remains expensive on FPGA platforms due to wide datapaths and normalization logic, motivating alternative representations that preserve dynamic range at lower cost.
Chopper: A Multi-Level GPU Characterization Tool & Derived Insights Into LLM Training InefficiencyMarco Kurzynski, Shaizeen Aga, Di Wu2025-12-09下载Training large language models (LLMs) efficiently requires a deep understanding of how modern GPU systems behave under real-world distributed training workloads.
LayerPipe2: Multistage Pipelining and Weight Recompute via Improved Exponential Moving Average for Training Neural NetworksNanda K. Unnikrishnan, Keshab K. Parhi2025-12-09下载In our prior work, LayerPipe, we had introduced an approach to accelerate training of convolutional, fully connected, and spiking neural networks by overlapping forward and backward computation.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
When Quantum Federated Learning Meets Blockchain in 6G NetworksDinh C. Nguyen, Md Bokhtiar Al Zami, Ratun Rahman, Shaba Shaon, Tuy Tan Nguyen, Fatemeh Afghah2025-12-09下载Quantum federated learning (QFL) is emerging as a key enabler for intelligent, secure, and privacy-preserving model training in next-generation 6G networks.
CloudFix: Automated Policy Repair for Cloud Access Control Policies Using Large Language ModelsBethel Hall, Owen Ungaro, William Eiers2025-12-09下载Access control policies are vital for securing modern cloud computing, where organizations must manage access to sensitive data across thousands of users in distributed system settings.
Skewness-Guided Pruning of Multimodal Swin Transformers for Federated Skin Lesion Classification on Edge DevicesKuniko Paxton, Koorosh Aslansefat, Dhavalkumar Thakker, Yiannis Papadopoulos2025-12-09下载In recent years, high-performance computer vision models have achieved remarkable success in medical imaging, with some skin lesion classification systems even surpassing dermatology specialists in di...
Parallel Batch Dynamic Vertex Coloring in O(\log Δ) Amortized Update TimeChase Hutton, Adam Melrod2025-12-09下载We present the first parallel batch-dynamic algorithm for maintaining a proper (Δ+ 1)-vertex coloring. Our approach builds on a new sequential dynamic algorithm inspired by the work of Bhattacharya ...
A Task Parallel Orthonormalization Multigrid Method For Multiphase Elliptic ProblemsTeoman Toprak, Florian Kummer2025-12-09下载Multigrid methods have been a popular approach for solving linear systems arising from the discretization of partial differential equations (PDEs) for several decades.
Spatio-Temporal Shifting to Reduce Carbon, Water, and Land-Use Footprints of Cloud WorkloadsGiulio Attenni, Youssef Moawad, Novella Bartolini, Lauritz Thamsen2025-12-09下载In this paper, we investigate the potential of spatial and temporal cloud workload shifting to reduce carbon, water, and land use footprints. Specifically, we perform a simulation study leveraging pub...
Model-based Testing of Practical Distributed Systems in Actor ModelIlya Kokorin, Evgeny Chernatskiy, Vitaly Aksenov2025-12-09下载Designing and implementing distributed systems correctly can be quite challenging. Although these systems are often accompanied by formal specifications that are verified using model-checking techniqu...
Basic Lock Algorithms in Lightweight Thread EnvironmentsTaras Skazhenik, Nikolai Korobenikov, Andrei Churbanov, Anton Malakhov, Vitaly Aksenov2025-12-09下载Traditionally, multithreaded data structures have been designed for access by the threads of Operating Systems (OS). However, implementations for access by programmable alternatives known as lightweig...
A scalable high-order multigrid-FFT Poisson solver for unbounded domains on adaptive multiresolution gridsGilles Poncelet, Jonathan Lambrechts, Thomas Gillis, Philippe Chatelain2025-12-09下载Multigrid solvers are among the most efficient methods for solving the Poisson equation, which is ubiquitous in computational physics. For example, in the context of incompressible flows, it is typica...
Magneton: Optimizing Energy Efficiency of ML Systems via Differential Energy DebuggingYi Pan, Wenbo Qian, Dedong Xie, Ruiyan Hu, Yigong Hu, Baris Kasikci2025-12-09下载The training and deployment of machine learning (ML) models have become extremely energy-intensive. While existing optimization efforts focus primarily on hardware energy efficiency, a significant but...
Emulation of Complex Matrix Multiplication based on the Chinese Remainder TheoremYuki Uchino, Qianxiang Ma, Toshiyuki Imamura, Katsuhisa Ozaki, Patrick Lars Gutsche2025-12-09下载Modern computing architectures feature low-precision matrix multiplication units that achieve substantially higher throughput than their high-precision counterparts.
Synergizing Monetization, Orchestration, and Semantics in Computing ContinuumChinmaya Kumar Dehury, Lauri Lovén, Praveen Kumar Donta, Ilir Murturi, Schahram Dustdar2025-12-09下载Industry demands are growing for hyper-distributed applications that span from the cloud to the edge in domains such as smart manufacturing, transportation, and agriculture.
Chopper: A Multi-Level GPU Characterization Tool & Derived Insights Into LLM Training InefficiencyMarco Kurzynski, Shaizeen Aga, Di Wu2025-12-09下载Training large language models (LLMs) efficiently requires a deep understanding of how modern GPU systems behave under real-world distributed training workloads.
Dora: QoE-Aware Hybrid Parallelism for Distributed Edge AIJianli Jin, Ziyang Lin, Qianli Dong, Yi Chen, Jayanth Srinivasa, Myungjin Lee, Zhaowei Tan, Fan Lai2025-12-09下载With the proliferation of edge AI applications, satisfying user quality of experience (QoE) requirements, such as model inference latency, has become a first class objective, as these models operate i...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Should AI Become an Intergenerational Civil Right?Jon Crowcroft, Rute C. Sofia, Dirk Trossen, Vassilis Tsaoussidis2025-12-09下载Artificial Intelligence (AI) is rapidly becoming a foundational layer of social, economic, and cognitive infrastructure. At the same time, the training and large-scale deployment of AI systems rely on...
WikIPedia: Unearthing a 20-Year History of IPv6 Client AddressingErik Rye, Dave Levin2025-12-09下载Due to their article editing policies, Wikimedia sites like Wikipedia have become inadvertent time capsules for IPv6 addresses. When Wikimedia users make edits without signing into an account, their I...
Delay-Oriented Distributed Scheduling with TransGNNBoxuan Wen, Junyu Luo2025-12-09下载Minimizing transmission delay in wireless multi-hop networks is a fundamental yet challenging task due to the complex coupling among interference, queue dynamics, and distributed control.
ITU-T Y.2325: NGN Evolution Towards FutureRashmi Kamran, Shwetha Kiran, Pranav Jha, Rashmi Yadav, Abhay Karandikar, Prasanna Chaporkar2025-12-09下载International Telecommunications Union (ITU) defined Next Generation Network (NGN) underlies most wireline and wireless packet-based telecommunications networks.
Inferring Causal Relationships to Improve Caching for Clients with Correlated Requests: Applications to VRAgrim Bari, Gustavo de Veciana, Yuqi Zhou2025-12-09下载Efficient edge caching reduces latency and alleviates backhaul congestion in modern networks. Traditional caching policies, such as Least Recently Used (LRU) and Least Frequently Used (LFU), perform w...
A Deep-SIC Channel Estimator Scheme in NOMA NetworkSumita Majhi, Kaushal Shelke, Pinaki Mitra2025-12-09下载In 5G and next-generation mobile ad-hoc networks, reliable handover is a key requirement, which guarantees continuity in connectivity, especially for mobile users and in high-density scenarios.
Improvement and Stabilization of Output Voltages in a Vertical Tidal Turbine Using Intelligent Control StrategiesFanambinantsoa Philibert Andriniriniaimalaza, Nour Murad, Randriamaitso Telesphore, Bilal Habachi, Randriatefison Nirilalaina, Manasina Ruffin, Andrianirina Charles Bernard, Ravelo Blaise2025-12-09下载This article investigates on the improvement and stabilization of alternating current (AC) and direct current (DC) output voltages in a Permanent Magnet Synchronous Generator (PMSG) driven by a vertic...
Turning Threat into Opportunity: DRL-Powered Anti-Jamming via Energy Harvesting in UAV-Disrupted ChannelsNgoc-Tan Nguyen, Thi-Thu Hoang, Trung-Dung Hoang, Thai-Duong Nguyen2025-12-09下载The open and broadcast nature of wireless communication systems, while enabling ubiquitous connectivity, also exposes them to jamming attacks that may critically compromise network performance or disr...
Multi-Agent Deep Reinforcement Learning for Collaborative UAV Relay Networks under Jamming AtatcksThai Duong Nguyen, Ngoc-Tan Nguyen, Thanh-Dao Nguyen, Nguyen Van Huynh, Dinh-Hieu Tran, Symeon Chatzinotas2025-12-09下载The deployment of Unmanned Aerial Vehicle (UAV) swarms as dynamic communication relays is critical for next-generation tactical networks. However, operating in contested environments requires solving ...
Collaborative Intelligence for UAV-Satellite Network Slicing: Towards a Joint QoS-Energy-Fairness MADRL OptimizationThanh-Dao Nguyen, Ngoc-Tan Nguyen, Thai-Duong Nguyen, Nguyen Van Huynh, Dinh-Hieu Tran, Symeon Chatzinotas2025-12-09下载Non terrestrial networks are critical for achieving global 6G coverage, yet efficient resource management in aerial and space environments remains challenging due to limited onboard power and dynamic ...
Evaluating Vulnerabilities of Connected Vehicles Under Cyber Attacks by Attack-Defense TreeMuhammad Baqer Mollah, Honggang Wang, Hua Fang2025-12-09下载Connected vehicles represent a key enabler of intelligent transportation systems, where vehicles are equipped with advanced communication, sensing, and computing technologies to interact not only with...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
NecoFuzz: Effective Fuzzing of Nested Virtualization via Fuzz-Harness Virtual MachinesReima Ishii, Takaaki Fukai, Takahiro Shinagawa2025-12-09下载Nested virtualization is now widely supported by major cloud vendors, allowing users to leverage virtualization-based technologies in the cloud.

cs.PF - Performance

标题作者发布日期PDF摘要
LLMs for Analog Circuit Design Continuum (ACDC)Yasaman Esfandiari, Jocelyn Rego, Austin Meyer, Jonathan Gallagher, Mia Levy2025-12-09下载Large Language Models (LLMs) and transformer architectures have shown impressive reasoning and generation capabilities across diverse natural language tasks.
Multi-domain performance analysis with scores tailored to user preferencesSébastien Piérard, Adrien Deliège, Marc Van Droogenbroeck2025-12-09下载The performance of algorithms, methods, and models tends to depend heavily on the distribution of cases on which they are applied, this distribution being specific to the applicative domain.
High-performance computing enabled contingency analysis for modern power networksAlexandre Gracia-Calvo, Francesca Rossi, Eduardo Iraola, Juan Carlos Olives-Camps, Eduardo Prieto-Araujo2025-12-09下载Modern power networks face increasing vulnerability to cascading failures due to high complexity and the growing penetration of intermittent resources, necessitating rigorous security assessment beyon...

基于 VitePress 构建