Skip to content

2026-01-22

cs.AR - Architecture

标题作者发布日期PDF摘要
A Case for Hypergraphs to Model and Map SNNs on Neuromorphic HardwareMarco Ronzani, Cristina Silvano2026-01-22下载Executing Spiking Neural Networks (SNNs) on neuromorphic hardware poses the problem of mapping neurons to cores. SNNs operate by propagating spikes between neurons that form a graph through synapses.
FlexLLM: Composable HLS Library for Flexible Hybrid LLM Accelerator DesignJiahao Zhang, Zifan He, Nicholas Fraser, Michaela Blott, Yizhou Sun, Jason Cong2026-01-22下载We present FlexLLM, a composable High-Level Synthesis (HLS) library for rapid development of domain-specific LLM accelerators. FlexLLM exposes key architectural degrees of freedom for stage-customized...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
ZEUS: An Efficient GPU Optimization Method Integrating PSO, BFGS, and Automatic DifferentiationDominik Soos, Marc Paterno, Desh Ranjan, Mohammad Zubair2026-01-22下载We introduce a novel, efficient computational method, ZEUS, for numerical optimization, and provide an open-source implementation. It has four key ingredients: (1) particle swarm optimization (PSO), (...
Space Filling Curves is All You Need: Communication-Avoiding Matrix Multiplication Made SimpleEvangelos Georganas, Alexander Heinecke, Pradeep Dubey2026-01-22下载General Matrix Multiplication (GEMM) is the cornerstone of Deep Learning and HPC workloads; accordingly, academia and industry have heavily optimized this kernel.
Scaling Sample-Based Quantum Diagonalization on GPU-Accelerated Systems using OpenMP OffloadRobert Walkup, Juha Jäykkä, Igor Pasichnyk, Zachary Streeter, Kasia Świrydowicz, Mikko Tukiainen, Yasuko Eckert, Luke Bertels, Daniel Claudino, Peter Groszkowski, Travis S. Humble, Constantinos Evangelinos, Javier Robledo-Moreno, William Kirby, Antonio Mezzacapo, Antonio Córcoles, Seetharami Seelam2026-01-22下载Hybrid quantum-HPC algorithms advance research by delegating complex tasks to quantum processors and using HPC systems to orchestrate workflows and complementary computations.
DSFedMed: Dual-Scale Federated Medical Image Segmentation via Mutual Distillation Between Foundation and Lightweight ModelsHanwen Zhang, Qiaojin Shen, Yuxi Liu, Yuesheng Zhu, Guibo Luo2026-01-22下载Foundation Models (FMs) have demonstrated strong generalization across diverse vision tasks. However, their deployment in federated settings is hindered by high computational demands, substantial comm...
Advancing RT Core-Accelerated Fixed-Radius Nearest Neighbor SearchEnzo Meneses, Hugo Bec, Cristóbal A. Navarro, Benoît Crespin, Felipe A. Quezada, Nancy Hitschfeld, Heinich Porro, Maxime Maria2026-01-22下载In this work we introduce three ideas that can further improve particle FRNN physics simulations running on RT Cores; i) a real-time update/rebuild ratio optimizer for the bounding volume hierarchy (B...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Multi-User Content Diversity in Wireless NetworksBelal Korany, Peerapol Tinnakornsrisuphap, Saadallah Kassir, Prashanth Hande, Hyun Yong Lee, Thomas Stockhammer, Hemanth Sampath2026-01-22下载Immersive applications such as eXtended Reality (XR), cloud gaming, and real-time video streaming are central to the vision of 6G networks. These applications require not only low latency and high dat...
Low-altitude Multi-UAV-assisted Data Collection and Semantic Forwarding for Post-Disaster ReliefXiaoya Zheng, Geng Sun, Jiahui Li, Jiacheng Wang, Weijie Yuan, Qingqing Wu, Dusit Niyato, Abbas Jamalipour2026-01-22下载The low-altitude economy (LAE) is an emerging economic paradigm which fosters integrated development across multiple fields. As a pivotal component of the LAE, low-altitude uncrewed aerial vehicles (U...
Dynamic Server Allocation Under Stochastic Switchover on Time-Varying LinksHossein Mohammadalizadeh, Holger Karl2026-01-22下载Dynamic resource allocation to parallel queues is a cornerstone of network scheduling, yet classical solutions often fail when accounting for the overhead of switching delays to queues with superior l...
RF Intelligence for Health: Classification of SmartBAN Signals in overcrowded ISM bandNicola Gallucci, Giacomo Aragnetti, Matteo Malagrinò, Francesco Linsalata, Maurizio Magarini, Lorenzo Mucchi2026-01-22下载Accurate classification of Radio-Frequency (RF) signals is essential for reliable wearable health-monitoring systems, providing awareness of the interference conditions in which medical protocols oper...
MapViT: A Two-Stage ViT-Based Framework for Real-Time Radio Quality Map Prediction in Dynamic EnvironmentsCyril Shih-Huan Hsu, Xi Li, Lanfranco Zanzi, Zhiheng Yang, Chrysa Papagianni, Xavier Costa Pérez2026-01-22下载Recent advancements in mobile and wireless networks are unlocking the full potential of robotic autonomy, enabling robots to take advantage of ultra-low latency, high data throughput, and ubiquitous c...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Sawtooth Wavefront Reordering: Enhanced CuTile FlashAttention on NVIDIA GB10Yifan Zhu, Yekai Pan, Chen Ding2026-01-22下载High-performance attention kernels are essential for Large Language Models. This paper presents analysis of CuTile-based Flash Attention memory behavior and a technique to improve its cache performanc...

cs.PF - Performance

标题作者发布日期PDF摘要
Sawtooth Wavefront Reordering: Enhanced CuTile FlashAttention on NVIDIA GB10Yifan Zhu, Yekai Pan, Chen Ding2026-01-22下载High-performance attention kernels are essential for Large Language Models. This paper presents analysis of CuTile-based Flash Attention memory behavior and a technique to improve its cache performanc...
Class Confidence Aware Reweighting for Long Tailed LearningBrainard Philemon Jagati, Jitendra Tembhurne, Harsh Goud, Rudra Pratap Singh, Chandrashekhar Meshram2026-01-22下载Deep neural network models degrade significantly in the long-tailed data distribution, with the overall training data dominated by a small set of classes in the head, and the tail classes obtaining le...

基于 VitePress 构建