Appearance
2025-10-12
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| HYPERDOA: Robust and Efficient DoA Estimation using Hyperdimensional Computing | Rajat Bhattacharjya, Woohyeok Park, Arnab Sarkar, Hyunwoo Oh, Mohsen Imani, Nikil Dutt | 2025-10-12 | 下载 | Direction of Arrival (DoA) estimation techniques face a critical trade-off, as classical methods often lack accuracy in challenging, low signal-to-noise ratio (SNR) conditions, while modern deep learn... |
| Bhasha-Rupantarika: Algorithm-Hardware Co-design approach for Multilingual Neural Machine Translation | Mukul Lokhande, Tanushree Dewangan, Mohd Sharik Mansoori, Tejas Chaudhari, Akarsh J., Damayanti Lokhande, Adam Teman, Santosh Kumar Vishvakarma | 2025-10-12 | 下载 | This paper introduces Bhasha-Rupantarika, a light and efficient multilingual translation system tailored through algorithm-hardware codesign for resource-limited settings. |
| ADiP: Adaptive-Precision Systolic Array for Matrix Multiplication Acceleration | Ahmed J. Abdelmaksoud, Cristian Sestito, Shiwei Wang, Themis Prodromakis | 2025-10-12 | 下载 | Transformers are at the core of modern AI nowadays. They rely heavily on matrix multiplication and require efficient acceleration due to their substantial memory and computational requirements. |
| Self-Attention to Operator Learning-based 3D-IC Thermal Simulation | Zhen Huang, Hong Wang, Wenkai Yang, Muxi Tang, Depeng Xie, Ting-Jung Lin, Yu Zhang, Wei W. Xing, Lei He | 2025-10-12 | 下载 | Thermal management in 3D ICs is increasingly challenging due to higher power densities. Traditional PDE-solving-based methods, while accurate, are too slow for iterative design. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| FIDRS: A Novel Framework for Integrated Distributed Reliable Systems | Mehdi Zekriyapanah Gashti | 2025-10-12 | 下载 | In this paper we represent a new framework for integrated distributed and reliable systems. In the proposed framework we have used three parts to increase Satisfaction and Performance of this framewor... |
| Fair Kernel-Lock-Free Claim/Release Protocol for Shared Object Access in Cooperatively Scheduled Runtimes | Kevin Chalmers, Jan Bækgaard Pedersen | 2025-10-12 | 下载 | We present the first spin-free, kernel-lock-free mutex that cooperates with user-mode schedulers and is formally proven FIFO-fair and linearizable using CSP/FDR. |
| SPHERE: Spherical partitioning for large-scale routing optimization | Robert Fabian Lindermann, Paul-Niklas Ken Kandora, Simon Caspar Zeller, Adrian Asmund Fessler, Steffen Rebennack | 2025-10-12 | 下载 | We study shortest-path routing in large weighted, undirected graphs, where expanding search frontiers raise time and memory costs for exact solvers. |
| CPU-Limits kill Performance: Time to rethink Resource Control | Chirag Shetty, Sarthak Chakraborty, Hubertus Franke, Larisa Shwartz, Chandra Narayanaswami, Indranil Gupta, Saurabh Jha | 2025-10-12 | 下载 | Research in compute resource management for cloud-native applications is dominated by the problem of setting optimal CPU limits -- a fundamental OS mechanism that strictly restricts a container's CPU ... |
| DCP: Addressing Input Dynamism In Long-Context Training via Dynamic Context Parallelism | Chenyu Jiang, Zhenkun Cai, Ye Tian, Zhen Jia, Yida Wang, Chuan Wu | 2025-10-12 | 下载 | Context parallelism has emerged as a key technique to support long-context training, a growing trend in generative AI for modern large models. |
| Multitask Learning with Learned Task Relationships | Zirui Wan, Stefan Vlaski | 2025-10-12 | 下载 | Classical consensus-based strategies for federated and decentralized learning are statistically suboptimal in the presence of heterogeneous local data or task distributions. |
| A Verified High-Performance Composable Object Library for Remote Direct Memory Access (Extended Version) | Guillaume Ambal, George Hodgkins, Mark Madler, Gregory Chockler, Brijesh Dongol, Joseph Izraelevitz, Azalea Raad, Viktor Vafeiadis | 2025-10-12 | 下载 | Remote Direct Memory Access (RDMA) is a memory technology that allows remote devices to directly write to and read from each other's memory, bypassing components such as the CPU and operating system. |
| FLAMMABLE: A Multi-Model Federated Learning Framework with Multi-Model Engagement and Adaptive Batch Sizes | Shouxu Lin, Zimeng Pan, Yuhang Yao, Haeyoung Noh, Pei Zhang, Carlee Joe-Wong | 2025-10-12 | 下载 | Multi-Model Federated Learning (MMFL) is an emerging direction in Federated Learning (FL) where multiple models are trained in parallel, generally on various datasets. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A Framework for AI-Native Semantic-Based Dynamic Slicing for 6G Networks | Mayukh Roy Chowdhury, Eman Hammad, Lauri Loven, Susanna Pirttikangas, Aloizio P da Silva, Walid Saad | 2025-10-12 | 下载 | In the ensuing ultra-dense and diverse environment in future \ac{6G} communication networks, it will be critical to optimize network resources via mechanisms that recognize and cater to the diversity,... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| CPU-Limits kill Performance: Time to rethink Resource Control | Chirag Shetty, Sarthak Chakraborty, Hubertus Franke, Larisa Shwartz, Chandra Narayanaswami, Indranil Gupta, Saurabh Jha | 2025-10-12 | 下载 | Research in compute resource management for cloud-native applications is dominated by the problem of setting optimal CPU limits -- a fundamental OS mechanism that strictly restricts a container's CPU ... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| CPU-Limits kill Performance: Time to rethink Resource Control | Chirag Shetty, Sarthak Chakraborty, Hubertus Franke, Larisa Shwartz, Chandra Narayanaswami, Indranil Gupta, Saurabh Jha | 2025-10-12 | 下载 | Research in compute resource management for cloud-native applications is dominated by the problem of setting optimal CPU limits -- a fundamental OS mechanism that strictly restricts a container's CPU ... |
| CAPSim: A Fast CPU Performance Simulator Using Attention-based Predictor | Buqing Xu, Jianfeng Zhu, Yichi Zhang, Qinyi Cai, Guanhua Li, Shaojun Wei, Leibo Liu | 2025-10-12 | 下载 | CPU simulators are vital for computer architecture research, primarily for estimating performance under different programs. This poses challenges for fast and accurate simulation of modern CPUs, espec... |