2026-01-22

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
A Case for Hypergraphs to Model and Map SNNs on Neuromorphic Hardware	Marco Ronzani, Cristina Silvano	2026-01-22	下载	Executing Spiking Neural Networks (SNNs) on neuromorphic hardware poses the problem of mapping neurons to cores. SNNs operate by propagating spikes between neurons that form a graph through synapses.
FlexLLM: Composable HLS Library for Flexible Hybrid LLM Accelerator Design	Jiahao Zhang, Zifan He, Nicholas Fraser, Michaela Blott, Yizhou Sun, Jason Cong	2026-01-22	下载	We present FlexLLM, a composable High-Level Synthesis (HLS) library for rapid development of domain-specific LLM accelerators. FlexLLM exposes key architectural degrees of freedom for stage-customized...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
ZEUS: An Efficient GPU Optimization Method Integrating PSO, BFGS, and Automatic Differentiation	Dominik Soos, Marc Paterno, Desh Ranjan, Mohammad Zubair	2026-01-22	下载	We introduce a novel, efficient computational method, ZEUS, for numerical optimization, and provide an open-source implementation. It has four key ingredients: (1) particle swarm optimization (PSO), (...
Space Filling Curves is All You Need: Communication-Avoiding Matrix Multiplication Made Simple	Evangelos Georganas, Alexander Heinecke, Pradeep Dubey	2026-01-22	下载	General Matrix Multiplication (GEMM) is the cornerstone of Deep Learning and HPC workloads; accordingly, academia and industry have heavily optimized this kernel.
Scaling Sample-Based Quantum Diagonalization on GPU-Accelerated Systems using OpenMP Offload	Robert Walkup, Juha Jäykkä, Igor Pasichnyk, Zachary Streeter, Kasia Świrydowicz, Mikko Tukiainen, Yasuko Eckert, Luke Bertels, Daniel Claudino, Peter Groszkowski, Travis S. Humble, Constantinos Evangelinos, Javier Robledo-Moreno, William Kirby, Antonio Mezzacapo, Antonio Córcoles, Seetharami Seelam	2026-01-22	下载	Hybrid quantum-HPC algorithms advance research by delegating complex tasks to quantum processors and using HPC systems to orchestrate workflows and complementary computations.
DSFedMed: Dual-Scale Federated Medical Image Segmentation via Mutual Distillation Between Foundation and Lightweight Models	Hanwen Zhang, Qiaojin Shen, Yuxi Liu, Yuesheng Zhu, Guibo Luo	2026-01-22	下载	Foundation Models (FMs) have demonstrated strong generalization across diverse vision tasks. However, their deployment in federated settings is hindered by high computational demands, substantial comm...
Advancing RT Core-Accelerated Fixed-Radius Nearest Neighbor Search	Enzo Meneses, Hugo Bec, Cristóbal A. Navarro, Benoît Crespin, Felipe A. Quezada, Nancy Hitschfeld, Heinich Porro, Maxime Maria	2026-01-22	下载	In this work we introduce three ideas that can further improve particle FRNN physics simulations running on RT Cores; i) a real-time update/rebuild ratio optimizer for the bounding volume hierarchy (B...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Multi-User Content Diversity in Wireless Networks	Belal Korany, Peerapol Tinnakornsrisuphap, Saadallah Kassir, Prashanth Hande, Hyun Yong Lee, Thomas Stockhammer, Hemanth Sampath	2026-01-22	下载	Immersive applications such as eXtended Reality (XR), cloud gaming, and real-time video streaming are central to the vision of 6G networks. These applications require not only low latency and high dat...
Low-altitude Multi-UAV-assisted Data Collection and Semantic Forwarding for Post-Disaster Relief	Xiaoya Zheng, Geng Sun, Jiahui Li, Jiacheng Wang, Weijie Yuan, Qingqing Wu, Dusit Niyato, Abbas Jamalipour	2026-01-22	下载	The low-altitude economy (LAE) is an emerging economic paradigm which fosters integrated development across multiple fields. As a pivotal component of the LAE, low-altitude uncrewed aerial vehicles (U...
Dynamic Server Allocation Under Stochastic Switchover on Time-Varying Links	Hossein Mohammadalizadeh, Holger Karl	2026-01-22	下载	Dynamic resource allocation to parallel queues is a cornerstone of network scheduling, yet classical solutions often fail when accounting for the overhead of switching delays to queues with superior l...
RF Intelligence for Health: Classification of SmartBAN Signals in overcrowded ISM band	Nicola Gallucci, Giacomo Aragnetti, Matteo Malagrinò, Francesco Linsalata, Maurizio Magarini, Lorenzo Mucchi	2026-01-22	下载	Accurate classification of Radio-Frequency (RF) signals is essential for reliable wearable health-monitoring systems, providing awareness of the interference conditions in which medical protocols oper...
MapViT: A Two-Stage ViT-Based Framework for Real-Time Radio Quality Map Prediction in Dynamic Environments	Cyril Shih-Huan Hsu, Xi Li, Lanfranco Zanzi, Zhiheng Yang, Chrysa Papagianni, Xavier Costa Pérez	2026-01-22	下载	Recent advancements in mobile and wireless networks are unlocking the full potential of robotic autonomy, enabling robots to take advantage of ultra-low latency, high data throughput, and ubiquitous c...

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Sawtooth Wavefront Reordering: Enhanced CuTile FlashAttention on NVIDIA GB10	Yifan Zhu, Yekai Pan, Chen Ding	2026-01-22	下载	High-performance attention kernels are essential for Large Language Models. This paper presents analysis of CuTile-based Flash Attention memory behavior and a technique to improve its cache performanc...

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Sawtooth Wavefront Reordering: Enhanced CuTile FlashAttention on NVIDIA GB10	Yifan Zhu, Yekai Pan, Chen Ding	2026-01-22	下载	High-performance attention kernels are essential for Large Language Models. This paper presents analysis of CuTile-based Flash Attention memory behavior and a technique to improve its cache performanc...
Class Confidence Aware Reweighting for Long Tailed Learning	Brainard Philemon Jagati, Jitendra Tembhurne, Harsh Goud, Rudra Pratap Singh, Chandrashekhar Meshram	2026-01-22	下载	Deep neural network models degrade significantly in the long-tailed data distribution, with the overall training data dominated by a small set of classes in the head, and the tail classes obtaining le...