Skip to content

2025-09-20

cs.AR - Architecture

标题作者发布日期PDF摘要
FG-Attn: Leveraging Fine-Grained Sparsity In Diffusion TransformersSankeerth Durvasula, Kavya Sreedhar, Zain Moustafa, Suraj Kothawade, Ashish Gondimalla, Suvinay Subramanian, Narges Shahidi, Nandita Vijaykumar2025-09-20下载Generating realistic videos with diffusion transformers demands significant computation, with attention layers the central bottleneck; even producing a short clip requires running a transformer over a...
Vision-Based Perception for Autonomous Vehicles in Off-Road Environment Using Deep LearningNelson Alves Ferreira Neto2025-09-20下载Low-latency intelligent systems are required for autonomous driving on non-uniform terrain in open-pit mines and developing countries. This work proposes a perception system for autonomous vehicles on...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Trace Replay Simulation of MIT SuperCloud for Studying Optimal Sustainability PoliciesWesley Brewer, Matthias Maiterth, Damien Fay2025-09-20下载The rapid growth of AI supercomputing is creating unprecedented power demands, with next-generation GPU datacenters requiring hundreds of megawatts and producing fast, large swings in consumption.
orb-QFL: Orbital Quantum Federated LearningDev Gurung, Shiva Raj Pokhrel2025-09-20下载Recent breakthroughs in quantum computing present transformative opportunities for advancing Federated Learning (FL), particularly in non-terrestrial environments characterized by stringent communicat...
sat-QFL: Secure Quantum Federated Learning for Low Orbit SatellitesDev Gurung, Shiva Raj Pokhrel2025-09-20下载Low Earth orbit (LEO) constellations violate core assumptions of standard (quantum) federated learning (FL): client-server connectivity is intermittent, participation is time varying, and latency budg...
Shift Parallelism: Low-Latency, High-Throughput LLM Inference for Dynamic WorkloadsMert Hidayetoglu, Aurick Qiao, Michael Wyatt, Jeff Rasley, Yuxiong He, Samyam Rajbhandari2025-09-20下载Efficient parallelism is necessary for achieving low-latency, high-throughput inference with large language models (LLMs). Tensor parallelism (TP) is the state-of-the-art method for reducing LLM respo...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
A Comprehensive Protocol Stack for Quantum Networks with a Global Entanglement ModuleXiaojie Fan, C. R. Ramakrishnan, Himanshu Gupta2025-09-20下载The development of large-scale quantum networks requires not only advances in physical-layer technologies but also a comprehensive protocol stack that integrates communication, control, and resource m...
Vehicular Multistatic OTFS-ISAC: A Geometry-Aware Deployment and Kalman-Based TrackingJyotsna Rani, Kuntal Deka, Ganesh Prasad, Zilong Liu2025-09-20下载Integrated sensing and communication (ISAC) is a promising paradigm for next-generation vehicular networks, yet existing orthogonal frequency-division multiplexing (OFDM)-based designs suffer from lim...
Spatial Encoding of Flow Spaces for Intelligent SDN ApplicationsAbdur Rouf, Murat Yuksel2025-09-20下载Efficient encoding of network flow spaces while preserving spatial locality is essential for intelligent Software-Defined Networking (SDN) applications, particularly those employing reinforcement lear...

基于 VitePress 构建