Appearance
2025-11-20
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Vorion: A RISC-V GPU with Hardware-Accelerated 3D Gaussian Rendering and Training | Yipeng Wang, Mengtian Yang, Chieh-pu Lo, Jaydeep P. Kulkarni | 2025-11-20 | 下载 | 3D Gaussian Splatting (3DGS) has recently emerged as a foundational technique for real-time neural rendering, 3D scene generation, volumetric video (4D) capture. |
| Unsupervised Graph Neural Network Framework for Balanced Multipatterning in Advanced Electronic Design Automation Layouts | Abdelrahman Helaly, Nourhan Sakr, Kareem Madkour, Ilhami Torunoglu | 2025-11-20 | 下载 | Multipatterning is an essential decomposition strategy in electronic design automation (EDA) that overcomes lithographic limitations when printing dense circuit layouts. |
| CIMinus: Empowering Sparse DNN Workloads Modeling and Exploration on SRAM-based CIM Architectures | Yingjie Qi, Jianlei Yang, Rubing Yang, Cenlin Duan, Xiaolin He, Ziyan He, Weitao Pan, Weisheng Zhao | 2025-11-20 | 下载 | Compute-in-memory (CIM) has emerged as a pivotal direction for accelerating workloads in the field of machine learning, such as Deep Neural Networks (DNNs). |
| Mitigating Shared Storage Congestion Using Control Theory | Thomas Collignon, Kouds Halitim, Raphaël Bleuse, Sophie Cerf, Bogdan Robu, Éric Rutten, Lionel Seinturier, Alexandre van Kempen | 2025-11-20 | 下载 | Efficient data access in High-Performance Computing (HPC) systems is essential to the performance of intensive computing tasks. Traditional optimizations of the I/O stack aim to improve peak performan... |
| KAN-SAs: Efficient Acceleration of Kolmogorov-Arnold Networks on Systolic Arrays | Sohaib Errabii, Olivier Sentieys, Marcello Traiola | 2025-11-20 | 下载 | Kolmogorov-Arnold Networks (KANs) have garnered significant attention for their promise of improved parameter efficiency and explainability compared to traditional Deep Neural Networks (DNNs). |
| Can Asymmetric Tile Buffering Be Beneficial? | Chengyue Wang, Wesley Pang, Xinrui Wu, Gregory Jun, Luis Romero, Endri Taka, Diana Marculescu, Tony Nowatzki, Pranathi Vasireddy, Joseph Melber, Deming Chen, Jason Cong | 2025-11-20 | 下载 | General matrix multiplication (GEMM) is the computational backbone of modern AI workloads, and its efficiency is critically dependent on effective tiling strategies. |
| A Scalable NorthPole System with End-to-End Vertical Integration for Low-Latency and Energy-Efficient LLM Inference | Michael V. DeBole, Rathinakumar Appuswamy, Neil McGlohon, Brian Taba, Steven K. Esser, Filipp Akopyan, John V. Arthur, Arnon Amir, Alexander Andreopoulos, Peter J. Carlson, Andrew S. Cassidy, Pallab Datta, Myron D. Flickner, Rajamohan Gandhasri, Guillaume J. Garreau, Megumi Ito, Jennifer L. Klamo, Jeffrey A. Kusnitz, Nathaniel J. McClatchey, Jeffrey L. McKinstry, Tapan K. Nayak, Carlos Ortega Otero, Hartmut Penner, William P. Risk, Jun Sawada, Jay Sivagnaname, Daniel F. Smith, Rafael Sousa, Ignacio Terrizzano, Takanori Ueda, Trent Gray-Donald, David Cox, Dharmendra S. Modha | 2025-11-20 | 下载 | A vertically integrated, end-to-end, research prototype system combines 288 NorthPole neural inference accelerator cards, offline training algorithms, a high-performance runtime stack, and a container... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter | Qinghao Hu, Shang Yang, Junxian Guo, Xiaozhe Yao, Yujun Lin, Yuxian Gu, Han Cai, Chuang Gan, Ana Klimovic, Song Han | 2025-11-20 | 下载 | The emergence of Large Language Models (LLMs) with strong reasoning capabilities marks a significant milestone, unlocking new frontiers in complex problem-solving. |
| Distributed MIS Algorithms for Rational Agents using Games | Nithin Salevemula, Shreyas Pai | 2025-11-20 | 下载 | We study the problem of computing a Maximal Independent Set (MIS) in distributed networks where each node is a rational agent whose payoff depends on whether it joins the MIS. |
| Optimizing Federated Learning in the Era of LLMs: Message Quantization and Streaming | Ziyue Xu, Zhihong Zhang, Holger R. Roth, Chester Chen, Yan Cheng, Andrew Feng | 2025-11-20 | 下载 | Federated Learning (FL) offers a promising solution for training machine learning models across distributed data sources while preserving data privacy. |
| A Fast Relax-and-Round Approach to Unit Commitment for Data Center Own Generation | Shaked Regev, Eve Tsybina, Slaven Peles | 2025-11-20 | 下载 | The rapid growth of data centers increasingly requires data center operators to "bring own generation" to complement the available utility power plants to supply all or part of data center load. |
| Optimizations on Graph-Level for Domain Specific Computations in Julia and Application to QED | Anton Reinhard, Simeon Ehrig, René Widera, Michael Bussmann, Uwe Hernandez Acosta | 2025-11-20 | 下载 | Complex computational problems in science often consist of smaller parts that can have largely distinct compute requirements from one another. |
| Fast LLM Post-training via Decoupled and Fastest-of-N Speculation | Rongxin Cheng, Kai Zhou, Xingda Wei, Siyuan Liu, Mingcong Han, Mingjing Ai, Yeju Zhou, Baoquan Zhong, Wencong Xiao, Rong Chen, Haibo Chen | 2025-11-20 | 下载 | Rollout dominates the training time in large language model (LLM) post-training, where the trained model is used to generate tokens given a batch of prompts. |
| Mitigating Shared Storage Congestion Using Control Theory | Thomas Collignon, Kouds Halitim, Raphaël Bleuse, Sophie Cerf, Bogdan Robu, Éric Rutten, Lionel Seinturier, Alexandre van Kempen | 2025-11-20 | 下载 | Efficient data access in High-Performance Computing (HPC) systems is essential to the performance of intensive computing tasks. Traditional optimizations of the I/O stack aim to improve peak performan... |
| Pipelined Dense Symmetric Eigenvalue Decomposition on Multi-GPU Architectures | Hansheng Wang, Ruiyi Zhan, Dajun Huang, Xingchen Liu, Qiao Li, Hancong Duan, Dingwen Tao, Guangming Tan, Shaoshuai Zhang | 2025-11-20 | 下载 | Large symmetric eigenvalue problems are commonly observed in many disciplines such as Chemistry and Physics, and several libraries including cuSOLVERMp, MAGMA and ELPA support computing large eigenval... |
| Can Asymmetric Tile Buffering Be Beneficial? | Chengyue Wang, Wesley Pang, Xinrui Wu, Gregory Jun, Luis Romero, Endri Taka, Diana Marculescu, Tony Nowatzki, Pranathi Vasireddy, Joseph Melber, Deming Chen, Jason Cong | 2025-11-20 | 下载 | General matrix multiplication (GEMM) is the computational backbone of modern AI workloads, and its efficiency is critically dependent on effective tiling strategies. |
| Digital Agriculture Sandbox for Collaborative Research | Osama Zafar, Rosemarie Santa González, Alfonso Morales, Erman Ayday | 2025-11-20 | 下载 | Digital agriculture is transforming the way we grow food by utilizing technology to make farming more efficient, sustainable, and productive. This modern approach to agriculture generates a wealth of ... |
| Efficient Chromosome Parallelization for Precision Medicine Genomic Workflows | Daniel Mas Montserrat, Ray Verma, Míriam Barrabés, Francisco M. de la Vega, Carlos D. Bustamante, Alexander G. Ioannidis | 2025-11-20 | 下载 | Large-scale genomic workflows used in precision medicine can process datasets spanning tens to hundreds of gigabytes per sample, leading to high memory spikes, intensive disk I/O, and task failures du... |
| Optimizing Communication in Byzantine Agreement Protocols with Slim-HBBFT | Nasit S Sony, Xianzhong Ding | 2025-11-20 | 下载 | Byzantine agreement protocols in asynchronous networks have received renewed interest because they do not rely on network behavior to achieve termination. |
| A Scalable NorthPole System with End-to-End Vertical Integration for Low-Latency and Energy-Efficient LLM Inference | Michael V. DeBole, Rathinakumar Appuswamy, Neil McGlohon, Brian Taba, Steven K. Esser, Filipp Akopyan, John V. Arthur, Arnon Amir, Alexander Andreopoulos, Peter J. Carlson, Andrew S. Cassidy, Pallab Datta, Myron D. Flickner, Rajamohan Gandhasri, Guillaume J. Garreau, Megumi Ito, Jennifer L. Klamo, Jeffrey A. Kusnitz, Nathaniel J. McClatchey, Jeffrey L. McKinstry, Tapan K. Nayak, Carlos Ortega Otero, Hartmut Penner, William P. Risk, Jun Sawada, Jay Sivagnaname, Daniel F. Smith, Rafael Sousa, Ignacio Terrizzano, Takanori Ueda, Trent Gray-Donald, David Cox, Dharmendra S. Modha | 2025-11-20 | 下载 | A vertically integrated, end-to-end, research prototype system combines 288 NorthPole neural inference accelerator cards, offline training algorithms, a high-performance runtime stack, and a container... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A streaming algorithm and hardware accelerator for top-K flow detection in network traffic | Carolina Gallardo-Pavesi, Yaime Fernández, Javier E. Soto, Cecilia Hernández, Miguel Figueroa | 2025-11-20 | 下载 | Identifying the largest K flows in network traffic is an important task for applications such as flow scheduling and anomaly detection, which aim to improve network efficiency and security. |
| Performance Comparison of 5G NR Uplink MIMO and Uplink Carrier Aggregations on Commercial Network | Henry Shao, Kasidis Arunruangsirilert | 2025-11-20 | 下载 | Demands for uplink on mobile networks are increasing with the rapid development of social media platforms, 4K/8K content creation, IoT applications, and Fixed Wireless Access (FWA) broadband. |
| Optimizing Quantum Key Distribution Network Performance using Graph Neural Networks | Akshit Pramod Anchan, Ameiy Acharya, Leki Chom Thungon | 2025-11-20 | 下载 | This paper proposes an optimization of Quantum Key Distribution (QKD) Networks using Graph Neural Networks (GNN) framework. Today, the development of quantum computers threatens the security systems o... |
| Payment-failure times for random Lightning paths | Taki E. M. Abedesselam, Fabio Giacomelli, Francesco Pasquale, Michele Salvi | 2025-11-20 | 下载 | We study a random process over graphs inspired by the way payments are executed in the Lightning Network, the main layer-two solution on top of Bitcoin. |
| Reasoning Meets Representation: Envisioning Neuro-Symbolic Wireless Foundation Models | Jaron Fontaine, Mohammad Cheraghinia, John Strassner, Adnan Shahid, Eli De Poorter | 2025-11-20 | 下载 | Recent advances in Wireless Physical Layer Foundation Models (WPFMs) promise a new paradigm of universal Radio Frequency (RF) representations. |
| Toward hyper-adaptive AI-enabled 6G networks for energy efficiency: techniques, classifications and tradeoffs | Mariem Zayene, Oussama Habachi, Gerard Chalhoub | 2025-11-20 | 下载 | Energy efficiency is shaping up to be one of the most challenging issues for 6G networks. The reason is fairly straightforward: Networks will need to meet extreme service demands while remaining susta... |
| Multi-Band Wireless Access-and-Backhaul (WAB) for 5G: Implementation and Experiments | Chiara Rubaltelli, Marcello Morini, Eugenio Moro, Ilario Filippini | 2025-11-20 | 下载 | Highly dynamic and mobile applications, such as vehicular networks, require stable connectivity, which is often challenging to achieve. Network densification is a key approach to address this issue an... |
| Green Distributed AI Training: Orchestrating Compute Across Renewable-Powered Micro Datacenters | Giuseppe Tomei, Andrea Mayer, Giuseppe Alcini, Stefano Salsano | 2025-11-20 | 下载 | The accelerating expansion of AI workloads is colliding with an energy landscape increasingly dominated by intermittent renewable generation. While vast quantities of zero-carbon energy are routinely ... |
| Bio-inspired Integrated Networking and Control for Large-Scale Swarm: A Hierarchical Co-design | Huan Lin, Dakai Liu, Lianghui Ding, Lin Wang, Feng Yang | 2025-11-20 | 下载 | Unmanned aerial vehicle (UAV) swarms encounter the challenge of high overhead due to both network management and formation control requirements. |
| Modeling Pointing, Acquisition, and Tracking Delays in Free-Space Optical Satellite Networks | Jason Gerard, Juan A. Fraire, Sandra Céspedes | 2025-11-20 | 下载 | Free-space optical inter-satellite links (OISLs) enable high-capacity space communications but require precise Pointing, Acquisition, and Tracking (PAT) between links. |
| Graph-Aware Temporal Encoder Based Service Migration and Resource Allocation in Satellite Networks | Haotong Wang, Jun Du, Chunxiao Jiang, Jintao Wang, Mérouane Debbah, Zhu Han | 2025-11-20 | 下载 | The rapid expansion of latency-sensitive applications has sparked renewed interest in deploying edge computing capabilities aboard satellite constellations, aiming to achieve truly global and seamless... |
| Machine Learning Epidemic Predictions Using Agent-based Wireless Sensor Network Models | Chukwunonso Henry Nwokoye, Blessing Oluchi, Sharna Waldron, Peace Ezzeh | 2025-11-20 | 下载 | The lack of epidemiological data in wireless sensor networks (WSNs) is a fundamental difficulty in constructing robust models to forecast and mitigate threats such as viruses and worms. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Algorithms and optimizations for global non-linear hybrid fluid-kinetic finite element stellarator simulations | Luca Venerando Greco | 2025-11-20 | 下载 | Predictive modeling of stellarator plasmas is crucial for advancing nuclear fusion energy, yet it faces unique computational difficulties. One of the main challenges is accurately simulating the dynam... |
| Optimizations on Graph-Level for Domain Specific Computations in Julia and Application to QED | Anton Reinhard, Simeon Ehrig, René Widera, Michael Bussmann, Uwe Hernandez Acosta | 2025-11-20 | 下载 | Complex computational problems in science often consist of smaller parts that can have largely distinct compute requirements from one another. |
| Can Asymmetric Tile Buffering Be Beneficial? | Chengyue Wang, Wesley Pang, Xinrui Wu, Gregory Jun, Luis Romero, Endri Taka, Diana Marculescu, Tony Nowatzki, Pranathi Vasireddy, Joseph Melber, Deming Chen, Jason Cong | 2025-11-20 | 下载 | General matrix multiplication (GEMM) is the computational backbone of modern AI workloads, and its efficiency is critically dependent on effective tiling strategies. |
| Efficient Chromosome Parallelization for Precision Medicine Genomic Workflows | Daniel Mas Montserrat, Ray Verma, Míriam Barrabés, Francisco M. de la Vega, Carlos D. Bustamante, Alexander G. Ioannidis | 2025-11-20 | 下载 | Large-scale genomic workflows used in precision medicine can process datasets spanning tens to hundreds of gigabytes per sample, leading to high memory spikes, intensive disk I/O, and task failures du... |