Appearance
2026-01-27
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| How Much Progress Has There Been in NVIDIA Datacenter GPUs? | Emanuele Del Sozzo, Martin Fleming, Kenneth Flamm, Neil Thompson | 2026-01-27 | 下载 | Graphics Processing Units (GPUs) are the state-of-the-art architecture for essential tasks, ranging from rendering 2D/3D graphics to accelerating workloads in supercomputing centers and, of course, Ar... |
| A Paradigm for Generalized Multi-Level Priority Encoders | Maxwell Phillips, Firas Hassan, Ahmed Ammar | 2026-01-27 | 下载 | Priority encoders are typically considered expensive hardware components in terms of complexity, especially at high bit precisions or input lengths (e.g., above 512 bits). |
| Primitive-Driven Acceleration of Hyperdimensional Computing for Real-Time Image Classification | Dhruv Parikh, Jebacyril Arockiaraj, Viktor Prasanna | 2026-01-27 | 下载 | Hyperdimensional Computing (HDC) represents data using extremely high-dimensional, low-precision vectors, termed hypervectors (HVs), and performs learning and inference through lightweight, noise-tole... |
| Veri-Sure: A Contract-Aware Multi-Agent Framework with Temporal Tracing and Formal Verification for Correct RTL Code Generation | Jiale Liu, Taiyu Zhou, Tianqi Jiang | 2026-01-27 | 下载 | In the rapidly evolving field of Electronic Design Automation (EDA), the deployment of Large Language Models (LLMs) for Register-Transfer Level (RTL) design has emerged as a promising direction. |
| GenPairX: A Hardware-Algorithm Co-Designed Accelerator for Paired-End Read Mapping | Julien Eudine, Chu Li, Zhuo Cheng, Renzo Andri, Can Firtina, Mohammad Sadrosadati, Nika Mansouri Ghiasi, Konstantina Koliogeorgi, Anirban Nag, Arash Tavakkol, Haiyu Mao, Onur Mutlu, Shai Bergman, Ji Zhang | 2026-01-27 | 下载 | Genome sequencing has become a central focus in computational biology. A genome study typically begins with sequencing, which produces millions to billions of short DNA fragments known as reads. |
| A Reconfigurable Framework for AI-FPGA Agent Integration and Acceleration | Aybars Yunusoglu, Talha Coskun, Hiruna Vishwamith, Murat Isik, I. Can Dikmen | 2026-01-27 | 下载 | Artificial intelligence (AI) is increasingly deployed in real-time and energy-constrained environments, driving demand for hardware platforms that can deliver high performance and power efficiency. |
| M2XFP: A Metadata-Augmented Microscaling Data Format for Efficient Low-bit Quantization | Weiming Hu, Zihan Zhang, Haoyan Zhang, Chen Zhang, Cong Guo, Yu Feng, Tianchi Hu, Guanglin Li, Guipeng Hu, Junsong Wang, Jingwen Leng | 2026-01-27 | 下载 | Existing low-bit Microscaling (MX) formats, such as MXFP4, often suffer from substantial accuracy degradation due to the use of a shared scaling factor with the Power-of-Two format. |
| In-Network Collective Operations: Game Changer or Challenge for AI Workloads? | Torsten Hoefler, Mikhail Khalilov, Josiah Clark, Surendra Anubolu, Mohan Kalkunte, Karen Schramm, Eric Spada, Duncan Roweth, Keith Underwood, Adrian Caulfield, Abdul Kabbani, Amirreza Rastegari | 2026-01-27 | 下载 | This paper summarizes the opportunities of in-network collective operations (INC) for accelerated collective operations in AI workloads. We provide sufficient detail to make this important field acces... |
| Probabilistic Sensing: Intelligence in Data Sampling | Ibrahim Albulushi, Saleh Bunaiyan, Suraj S. Cheema, Hesham ElSawy, Feras Al-Dirini | 2026-01-27 | 下载 | Extending the intelligence of sensors to the data-acquisition process - deciding whether to sample or not - can result in transformative energy-efficiency gains. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A Data-Informed Local Subspaces Method for Error-Bounded Lossy Compression of Large-Scale Scientific Datasets | Arshan Khan, Rohit Deshmukh, Ben O'Neill | 2026-01-27 | 下载 | The growing volume of scientific simulation data presents a significant challenge for storage and transfer. Error-bounded lossy compression has emerged as a critical solution for mitigating these chal... |
| Mapping Gemma3 onto an Edge Dataflow Architecture | Shouyu Du, Miaoxiang Yu, Zhenyu Xu, Zhiheng Ni, Jillian Cai, Qing Yang, Tao Wei | 2026-01-27 | 下载 | We present the first end-to-end deployment of the Gemma3 family of large language and vision models on a tiled edge dataflow architecture (AMD Ryzen AI NPU). |
| Delta Fair Sharing: Performance Isolation for Multi-Tenant Storage Systems | Tyler Griggs, Soujanya Ponnapalli, Dev Bali, Wenjie Ma, James DeLoye, Audrey Cheng, Jaewan Hong, Natacha Crooks, Scott Shenker, Ion Stoica, Matei Zaharia | 2026-01-27 | 下载 | Modern storage systems, often deployed to support multiple tenants in the cloud, must provide performance isolation. Unfortunately, traditional approaches such as fair sharing do not provide performan... |
| Enabling SSI-Compliant Use of EUDI Wallet Credentials through Trusted Execution Environment and Zero-Knowledge Proof | Nacereddine Sitouah, Francesco Bruschi, Stefano De Cillis | 2026-01-27 | 下载 | The passing of the eIDAS amendment marks an important milestone for EU countries and changes how they must manage digital credentials for both public services and businesses. |
| Self-Sovereign Identity and eIDAS 2.0: An Analysis of Control, Privacy, and Legal Implications | Nacereddine Sitouah, Marco Esposito, Francesco Bruschi | 2026-01-27 | 下载 | European digital identity initiatives are grounded in regulatory frameworks designed to ensure interoperability and robust, harmonized security standards. |
| Knowledge-Aware Evolution for Streaming Federated Continual Learning with Category Overlap and without Task Identifiers | Sixing Tan, Xianmin Liu | 2026-01-27 | 下载 | Federated Continual Learning (FCL) leverages inter-client collaboration to balance new knowledge acquisition and prior knowledge retention in non-stationary data. |
| Accelerating radio astronomy imaging with RICK: a step towards SKA-Mid and SKA-Low | Giovanni Lacopo, Emanuele De Rubeis, Claudio Gheller, Giuliano Taffoni, Luca Tornatore | 2026-01-27 | 下载 | The data volumes generated by modern radio interferometers, such as the SKA precursors, present significant computational challenges for imaging pipelines. |
| Convex Hull 3D Filtering with GPU Ray Tracing and Tensor Cores | Roberto Carrasco, Enzo Meneses, Hector Ferrada, Cristobal A. Navarro, Nancy Hitschfeld | 2026-01-27 | 下载 | In recent years, applications such as real-time simulations, autonomous systems, and video games increasingly demand the processing of complex geometric models under stringent time constraints. |
| Modular Foundation Model Inference at the Edge: Network-Aware Microservice Optimization | Juan Zhu, Zixin Wang, Shenghui Song, Jun Zhang, Khaled Ben Letaief | 2026-01-27 | 下载 | Foundation models (FMs) unlock unprecedented multimodal and multitask intelligence, yet their cloud-centric deployment precludes real-time responsiveness and compromises user privacy. |
| NET4EXA: Pioneering the Future of Interconnects for Supercomputing and AI | Michele Martinelli, Roberto Ammendola, Andrea Biagioni, Carlotta Chiarini, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Pier Stanislao Paolucci, Elena Pastorelli, Pierpaolo Perticaroli, Luca Pontisso, Cristian Rossi, Francesco Simula, Piero Vicini, David Colin, Grégoire Pichon, Alexandre Louvet, John Gliksberg, Claire Chen, Matteo Turisini, Andrea Monterubbiano, Jean-Philippe Nominé, Denis Dutoit, Hugo Taboada, Lilia Zaourar, Mohamed Benazouz, Angelos Bilas, Fabien Chaix, Manolis Katevenis, Nikolaos Chrysos, Evangelos Mageiropoulos, Christos Kozanitis, Thomas Moen, Steffen Persvold, Einar Rustad, Sandro Fiore, Fabrizio Granelli, Simone Pezzuto, Raffaello Potestio, Luca Tubiana, Philippe Velha, Flavio Vella, Daniele De Sensi, Salvatore Pontarelli | 2026-01-27 | 下载 | NET4EXA aims to develop a next-generation high-performance interconnect for HPC and AI systems, addressing the increasing demands of large-scale infrastructures, such as those required for training La... |
| Decentralized Nonsmooth Nonconvex Optimization with Client Sampling | Xinyan Chen, Weiguo Gao, Luo Luo | 2026-01-27 | 下载 | This paper considers decentralized nonsmooth nonconvex optimization problem with Lipschitz continuous local functions. We propose an efficient stochastic first-order method with client sampling, achie... |
| Revisiting Parameter Server in LLM Post-Training | Xinyi Wan, Penghui Qi, Guangxing Huang, Chaoyi Ruan, Min Lin, Jialin Li | 2026-01-27 | 下载 | Modern data parallel (DP) training favors collective communication over parameter servers (PS) for its simplicity and efficiency under balanced workloads. |
| KUBEDIRECT: Unleashing the Full Power of the Cluster Manager for Serverless Computing | Sheng Qi, Zhiquan Zhang, Xuanzhe Liu, Xin Jin | 2026-01-27 | 下载 | FaaS platforms rely on cluster managers like Kubernetes for resource management. Kubernetes is popular due to its state-centric APIs that decouple the control plane into modular controllers. |
| Native LLM and MLLM Inference at Scale on Apple Silicon | Wayner Barrios | 2026-01-27 | 下载 | The growing adoption of Apple Silicon for machine learning development has created demand for efficient inference solutions that leverage its unique unified memory architecture. |
| Axe: A Simple Unified Layout Abstraction for Machine Learning Compilers | Bohan Hou, Hongyi Jin, Guanjie Wang, Jinqi Chen, Yaxing Cai, Lijie Yang, Zihao Ye, Yaoyao Ding, Ruihang Lai, Tianqi Chen | 2026-01-27 | 下载 | Scaling modern deep learning workloads demands coordinated placement of data and compute across device meshes, memory hierarchies, and heterogeneous accelerators. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Quantum Takes Flight: Two-Stage Resilient Topology Optimization for UAV Networks | Huixiang Zhang, Mahzabeen Emu, Octavia A. Dobre | 2026-01-27 | 下载 | Next-generation Unmanned Aerial Vehicle (UAV) communication networks must maintain reliable connectivity under rapid topology changes, fluctuating link quality, and time-critical data exchange. |
| An Agentic AI Control Plane for 6G Network Slice Orchestration, Monitoring, and Trading | Eranga Bandara, Ross Gore, Sachin Shetty, Ravi Mukkamala, Tharaka Hewa, Abdul Rahman, Xueping Liang, Safdar H. Bouk, Amin Hass, Peter Foytik, Ng Wee Keong, Kasun De Zoysa | 2026-01-27 | 下载 | 6G networks are expected to be AI-native, intent-driven, and economically programmable, requiring fundamentally new approaches to network slice orchestration. |
| NET4EXA: Pioneering the Future of Interconnects for Supercomputing and AI | Michele Martinelli, Roberto Ammendola, Andrea Biagioni, Carlotta Chiarini, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Pier Stanislao Paolucci, Elena Pastorelli, Pierpaolo Perticaroli, Luca Pontisso, Cristian Rossi, Francesco Simula, Piero Vicini, David Colin, Grégoire Pichon, Alexandre Louvet, John Gliksberg, Claire Chen, Matteo Turisini, Andrea Monterubbiano, Jean-Philippe Nominé, Denis Dutoit, Hugo Taboada, Lilia Zaourar, Mohamed Benazouz, Angelos Bilas, Fabien Chaix, Manolis Katevenis, Nikolaos Chrysos, Evangelos Mageiropoulos, Christos Kozanitis, Thomas Moen, Steffen Persvold, Einar Rustad, Sandro Fiore, Fabrizio Granelli, Simone Pezzuto, Raffaello Potestio, Luca Tubiana, Philippe Velha, Flavio Vella, Daniele De Sensi, Salvatore Pontarelli | 2026-01-27 | 下载 | NET4EXA aims to develop a next-generation high-performance interconnect for HPC and AI systems, addressing the increasing demands of large-scale infrastructures, such as those required for training La... |
| Bridging Visual and Wireless Sensing: A Unified Radiation Field for 3D Radio Map Construction | Chaozheng Wen, Jingwen Tong, Zehong Lin, Chenghong Bian, Jun Zhang | 2026-01-27 | 下载 | The emerging applications of next-generation wireless networks (e.g., immersive 3D communication, low-altitude networks, and integrated sensing and communication) necessitate high-fidelity environment... |
| Enabling SLO-Aware 5G Multi-Access Edge Computing with SMEC | Xiao Zhang, Daehyeok Kim | 2026-01-27 | 下载 | Multi-access edge computing (MEC) promises to enable latency-critical applications by bringing computational power closer to mobile devices, but our measurements on commercial MEC deployments reveal f... |
| In-Network Collective Operations: Game Changer or Challenge for AI Workloads? | Torsten Hoefler, Mikhail Khalilov, Josiah Clark, Surendra Anubolu, Mohan Kalkunte, Karen Schramm, Eric Spada, Duncan Roweth, Keith Underwood, Adrian Caulfield, Abdul Kabbani, Amirreza Rastegari | 2026-01-27 | 下载 | This paper summarizes the opportunities of in-network collective operations (INC) for accelerated collective operations in AI workloads. We provide sufficient detail to make this important field acces... |
| UAV-Mounted Aerial Relays in Military Communications: A Comprehensive Survey | Faisal Al-Kamali, Francois Chan, Hussein A. Ammar, James H. Bayes, Claude D'Amours | 2026-01-27 | 下载 | Relays are pivotal in military communication networks, expanding coverage and ensuring reliable connectivity in challenging operational environments. |
| FTA-NTN: Fairness and Throughput Assurance in Non-Terrestrial Networks | Sachin Ravikant Trankatwar, Heiko Straulino, Petar Djukic, Burak Kantarci | 2026-01-27 | 下载 | Designing optimal non-terrestrial network (NTN) constellations is essential for maximizing throughput and ensuring fair resource distribution. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| In-Network Collective Operations: Game Changer or Challenge for AI Workloads? | Torsten Hoefler, Mikhail Khalilov, Josiah Clark, Surendra Anubolu, Mohan Kalkunte, Karen Schramm, Eric Spada, Duncan Roweth, Keith Underwood, Adrian Caulfield, Abdul Kabbani, Amirreza Rastegari | 2026-01-27 | 下载 | This paper summarizes the opportunities of in-network collective operations (INC) for accelerated collective operations in AI workloads. We provide sufficient detail to make this important field acces... |