2025-09-20

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
FG-Attn: Leveraging Fine-Grained Sparsity In Diffusion Transformers	Sankeerth Durvasula, Kavya Sreedhar, Zain Moustafa, Suraj Kothawade, Ashish Gondimalla, Suvinay Subramanian, Narges Shahidi, Nandita Vijaykumar	2025-09-20	下载	Generating realistic videos with diffusion transformers demands significant computation, with attention layers the central bottleneck; even producing a short clip requires running a transformer over a...
Vision-Based Perception for Autonomous Vehicles in Off-Road Environment Using Deep Learning	Nelson Alves Ferreira Neto	2025-09-20	下载	Low-latency intelligent systems are required for autonomous driving on non-uniform terrain in open-pit mines and developing countries. This work proposes a perception system for autonomous vehicles on...

标题	作者	发布日期	PDF	摘要
Trace Replay Simulation of MIT SuperCloud for Studying Optimal Sustainability Policies	Wesley Brewer, Matthias Maiterth, Damien Fay	2025-09-20	下载	The rapid growth of AI supercomputing is creating unprecedented power demands, with next-generation GPU datacenters requiring hundreds of megawatts and producing fast, large swings in consumption.
orb-QFL: Orbital Quantum Federated Learning	Dev Gurung, Shiva Raj Pokhrel	2025-09-20	下载	Recent breakthroughs in quantum computing present transformative opportunities for advancing Federated Learning (FL), particularly in non-terrestrial environments characterized by stringent communicat...
sat-QFL: Secure Quantum Federated Learning for Low Orbit Satellites	Dev Gurung, Shiva Raj Pokhrel	2025-09-20	下载	Low Earth orbit (LEO) constellations violate core assumptions of standard (quantum) federated learning (FL): client-server connectivity is intermittent, participation is time varying, and latency budg...
Shift Parallelism: Low-Latency, High-Throughput LLM Inference for Dynamic Workloads	Mert Hidayetoglu, Aurick Qiao, Michael Wyatt, Jeff Rasley, Yuxiong He, Samyam Rajbhandari	2025-09-20	下载	Efficient parallelism is necessary for achieving low-latency, high-throughput inference with large language models (LLMs). Tensor parallelism (TP) is the state-of-the-art method for reducing LLM respo...

标题	作者	发布日期	PDF	摘要
A Comprehensive Protocol Stack for Quantum Networks with a Global Entanglement Module	Xiaojie Fan, C. R. Ramakrishnan, Himanshu Gupta	2025-09-20	下载	The development of large-scale quantum networks requires not only advances in physical-layer technologies but also a comprehensive protocol stack that integrates communication, control, and resource m...
Vehicular Multistatic OTFS-ISAC: A Geometry-Aware Deployment and Kalman-Based Tracking	Jyotsna Rani, Kuntal Deka, Ganesh Prasad, Zilong Liu	2025-09-20	下载	Integrated sensing and communication (ISAC) is a promising paradigm for next-generation vehicular networks, yet existing orthogonal frequency-division multiplexing (OFDM)-based designs suffer from lim...
Spatial Encoding of Flow Spaces for Intelligent SDN Applications	Abdur Rouf, Murat Yuksel	2025-09-20	下载	Efficient encoding of network flow spaces while preserving spatial locality is essential for intelligent Software-Defined Networking (SDN) applications, particularly those employing reinforcement lear...