Appearance
2026-01-22
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A Case for Hypergraphs to Model and Map SNNs on Neuromorphic Hardware | Marco Ronzani, Cristina Silvano | 2026-01-22 | 下载 | Executing Spiking Neural Networks (SNNs) on neuromorphic hardware poses the problem of mapping neurons to cores. SNNs operate by propagating spikes between neurons that form a graph through synapses. |
| FlexLLM: Composable HLS Library for Flexible Hybrid LLM Accelerator Design | Jiahao Zhang, Zifan He, Nicholas Fraser, Michaela Blott, Yizhou Sun, Jason Cong | 2026-01-22 | 下载 | We present FlexLLM, a composable High-Level Synthesis (HLS) library for rapid development of domain-specific LLM accelerators. FlexLLM exposes key architectural degrees of freedom for stage-customized... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| ZEUS: An Efficient GPU Optimization Method Integrating PSO, BFGS, and Automatic Differentiation | Dominik Soos, Marc Paterno, Desh Ranjan, Mohammad Zubair | 2026-01-22 | 下载 | We introduce a novel, efficient computational method, ZEUS, for numerical optimization, and provide an open-source implementation. It has four key ingredients: (1) particle swarm optimization (PSO), (... |
| Space Filling Curves is All You Need: Communication-Avoiding Matrix Multiplication Made Simple | Evangelos Georganas, Alexander Heinecke, Pradeep Dubey | 2026-01-22 | 下载 | General Matrix Multiplication (GEMM) is the cornerstone of Deep Learning and HPC workloads; accordingly, academia and industry have heavily optimized this kernel. |
| Scaling Sample-Based Quantum Diagonalization on GPU-Accelerated Systems using OpenMP Offload | Robert Walkup, Juha Jäykkä, Igor Pasichnyk, Zachary Streeter, Kasia Świrydowicz, Mikko Tukiainen, Yasuko Eckert, Luke Bertels, Daniel Claudino, Peter Groszkowski, Travis S. Humble, Constantinos Evangelinos, Javier Robledo-Moreno, William Kirby, Antonio Mezzacapo, Antonio Córcoles, Seetharami Seelam | 2026-01-22 | 下载 | Hybrid quantum-HPC algorithms advance research by delegating complex tasks to quantum processors and using HPC systems to orchestrate workflows and complementary computations. |
| DSFedMed: Dual-Scale Federated Medical Image Segmentation via Mutual Distillation Between Foundation and Lightweight Models | Hanwen Zhang, Qiaojin Shen, Yuxi Liu, Yuesheng Zhu, Guibo Luo | 2026-01-22 | 下载 | Foundation Models (FMs) have demonstrated strong generalization across diverse vision tasks. However, their deployment in federated settings is hindered by high computational demands, substantial comm... |
| Advancing RT Core-Accelerated Fixed-Radius Nearest Neighbor Search | Enzo Meneses, Hugo Bec, Cristóbal A. Navarro, Benoît Crespin, Felipe A. Quezada, Nancy Hitschfeld, Heinich Porro, Maxime Maria | 2026-01-22 | 下载 | In this work we introduce three ideas that can further improve particle FRNN physics simulations running on RT Cores; i) a real-time update/rebuild ratio optimizer for the bounding volume hierarchy (B... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Multi-User Content Diversity in Wireless Networks | Belal Korany, Peerapol Tinnakornsrisuphap, Saadallah Kassir, Prashanth Hande, Hyun Yong Lee, Thomas Stockhammer, Hemanth Sampath | 2026-01-22 | 下载 | Immersive applications such as eXtended Reality (XR), cloud gaming, and real-time video streaming are central to the vision of 6G networks. These applications require not only low latency and high dat... |
| Low-altitude Multi-UAV-assisted Data Collection and Semantic Forwarding for Post-Disaster Relief | Xiaoya Zheng, Geng Sun, Jiahui Li, Jiacheng Wang, Weijie Yuan, Qingqing Wu, Dusit Niyato, Abbas Jamalipour | 2026-01-22 | 下载 | The low-altitude economy (LAE) is an emerging economic paradigm which fosters integrated development across multiple fields. As a pivotal component of the LAE, low-altitude uncrewed aerial vehicles (U... |
| Dynamic Server Allocation Under Stochastic Switchover on Time-Varying Links | Hossein Mohammadalizadeh, Holger Karl | 2026-01-22 | 下载 | Dynamic resource allocation to parallel queues is a cornerstone of network scheduling, yet classical solutions often fail when accounting for the overhead of switching delays to queues with superior l... |
| RF Intelligence for Health: Classification of SmartBAN Signals in overcrowded ISM band | Nicola Gallucci, Giacomo Aragnetti, Matteo Malagrinò, Francesco Linsalata, Maurizio Magarini, Lorenzo Mucchi | 2026-01-22 | 下载 | Accurate classification of Radio-Frequency (RF) signals is essential for reliable wearable health-monitoring systems, providing awareness of the interference conditions in which medical protocols oper... |
| MapViT: A Two-Stage ViT-Based Framework for Real-Time Radio Quality Map Prediction in Dynamic Environments | Cyril Shih-Huan Hsu, Xi Li, Lanfranco Zanzi, Zhiheng Yang, Chrysa Papagianni, Xavier Costa Pérez | 2026-01-22 | 下载 | Recent advancements in mobile and wireless networks are unlocking the full potential of robotic autonomy, enabling robots to take advantage of ultra-low latency, high data throughput, and ubiquitous c... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Sawtooth Wavefront Reordering: Enhanced CuTile FlashAttention on NVIDIA GB10 | Yifan Zhu, Yekai Pan, Chen Ding | 2026-01-22 | 下载 | High-performance attention kernels are essential for Large Language Models. This paper presents analysis of CuTile-based Flash Attention memory behavior and a technique to improve its cache performanc... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Sawtooth Wavefront Reordering: Enhanced CuTile FlashAttention on NVIDIA GB10 | Yifan Zhu, Yekai Pan, Chen Ding | 2026-01-22 | 下载 | High-performance attention kernels are essential for Large Language Models. This paper presents analysis of CuTile-based Flash Attention memory behavior and a technique to improve its cache performanc... |
| Class Confidence Aware Reweighting for Long Tailed Learning | Brainard Philemon Jagati, Jitendra Tembhurne, Harsh Goud, Rudra Pratap Singh, Chandrashekhar Meshram | 2026-01-22 | 下载 | Deep neural network models degrade significantly in the long-tailed data distribution, with the overall training data dominated by a small set of classes in the head, and the tail classes obtaining le... |