Appearance
2026-04-12
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| The xPU-athalon: Quantifying the Competition of AI Acceleration | Alicia Golden, Carole-Jean Wu, Gu-Yeon Wei, David Brooks | 2026-04-12 | 下载 | The push for greater efficiency in AI computation has given rise to an array of accelerator architectures that increasingly challenge the GPU's long-standing dominance. |
| Harnessing Photonics for Machine Intelligence | Hanqing Zhu, Shupeng Ning, Hongjian Zhou, Ziang Yin, Ray T. Chen, Jiaqi Gu, David Z. Pan | 2026-04-12 | 下载 | The exponential growth of machine-intelligence workloads is colliding with the power, memory, and interconnect limits of the post-Moore era, motivating compute substrates that scale beyond transistor ... |
| EMSpice 3: Full-chip Temperature-Aware Multiphysics Electromigration and IR-Drop Analysis | Haotian Lu, Sheldon X. -D. Tan | 2026-04-12 | 下载 | In this work, we present EMSpice 3, a full-chip temperature-aware multiphysics framework for coupled electromigration (EM), thermomigration (TM), and IR-drop analysis of power-grid networks. |
| L-PCN: A Point Cloud Accelerator Exploiting Spatial Locality through Octree-based Islandization | Yiming Gao, Jieming Yin, Yuxiang Wang, Xiangru Chen, Zhilei Chai, Bowen Jiang, Jiliang Zhang, Herman Lam | 2026-04-12 | 下载 | Existing Point Cloud Networks (PCNs) have proven to achieve great success in many point cloud tasks such as object part segmentation, shape classification, and so on. |
| From Characterization to Microarchitecture: Designing an Elegant and Reliable BFP-Based NPU | Jie Zhang, Jiapeng Guan, Hao Zhou, Xiaomeng Han, Tinglue Wang, Ran Wei, Zhe Jiang | 2026-04-12 | 下载 | Block Floating-Point (BFP) is emerging as an attractive data format for edge Neural Processing Units (NPUs), combining wide dynamic range with high hardware efficiency. |
| Strix: Re-thinking NPU Reliability from a System Perspective | Jiapeng Guan, Jie Zhang, Hao Zhou, Ran Wei, Dean You, Hui Wang, Yingquan Wang, Tinglue Wang, Xudong Zhao, Jing Li, Zhe Jiang | 2026-04-12 | 下载 | DNNs and LLMs increasingly rely on hardware accelerators, including in safety-critical domains, while technology scaling and growing model complexity make hardware faults more frequent. |
| LLM-PRISM: Characterizing Silent Data Corruption from Permanent GPU Faults in LLM Training | Abhishek Tyagi, Saurabh Hukerikar, Nirmal Saxena, Yanxiang Huang, Philip Shirvani, Chung-Hsuan Tung, Yuhao Zhu | 2026-04-12 | 下载 | Large-scale LLM training is increasingly susceptible to hardware defects stemming from manufacturing escapes and silicon aging. These defects manifest as Silent Data Corruption (SDC) that perturb grad... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Understanding Communication Backends in Cross-Silo Federated Learning | Amir Ziashahabi, Chaoyang He, Salman Avestimehr | 2026-04-12 | 下载 | Federated learning (FL) has emerged as a practical means for privacy-preserving distributed machine learning. FL's versatile design makes it suitable for various training settings, from IoT edge devic... |
| Workload composition smooths aggregate power demand while sustaining short-horizon ramps in AI data centers | Subir Majumder, Minlan Yu, Le Xie | 2026-04-12 | 下载 | Artificial intelligence (AI) is driving rapid growth in electricity demand, yet the grid-facing power dynamics of AI data centers remain poorly understood. |
| Bipartite matching under communication constraints | Moonmoon Mohanty, Gautham Bolar, Preetam Patil, Ayalvadi Ganesh, Jean-Francois Chamberland, Parimal Parag | 2026-04-12 | 下载 | In modern data center networks, thousands of hosts contend for shared link capacity; the scale of these systems makes centralized scheduling impractical. |
| COD-ssi: Enforcing Mutual Privacy for Credential Oblivious Disclosure in Self Sovereign Identity | Elia Onofri, Andrea De Salve, Paolo Mori, Laura Emilia Maria Ricci, Roberto Di Pietro | 2026-04-12 | 下载 | The Self-Sovereign Identity (SSI) paradigm is instrumental for decentralised identity management, allowing an entity to create, manage, and present their digital credentials without relying on central... |
| FEDBUD: Joint Incentive and Privacy Optimization for Resource-Constrained Federated Learning | Tao Liu, Xuehe Wang | 2026-04-12 | 下载 | Federated learning has become a popular paradigm for privacy protection and edge-based machine learning. However, defending against differential attacks and devising incentive strategies remain signif... |
| CIR: Lightweight Container Image for Cross-Platform Deployment | Fengzhi Li, Xiaohui Peng, Qingru Xu, Qisong Shi, Tuo Zhou, Yongxuan Dai, Yifan Wang, Ninghui Sun, Zhiwei Xu | 2026-04-12 | 下载 | In modern cloud and heterogeneous distributed infrastructures, container images are widely used as the deployment unit for machine learning applications. |
| Leveraging Mathematical Reasoning of LLMs for Efficient GPU Thread Mapping | Jose Maureira, Cristóbal A. Navarro, Hector Ferrada, Luis Veas-Castillo | 2026-04-12 | 下载 | Mapping parallel threads onto non-box-shaped domains is a known challenge in GPU computing that, if done efficiently, can prevent severe performance penalties from allocating unnecessary computational... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Bipartite matching under communication constraints | Moonmoon Mohanty, Gautham Bolar, Preetam Patil, Ayalvadi Ganesh, Jean-Francois Chamberland, Parimal Parag | 2026-04-12 | 下载 | In modern data center networks, thousands of hosts contend for shared link capacity; the scale of these systems makes centralized scheduling impractical. |