Appearance
2025-08-26
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| GENIE-ASI: Generative Instruction and Executable Code for Analog Subcircuit Identification | Phuoc Pham, Arun Venkitaraman, Chia-Yu Hsieh, Andrea Bonetti, Stefan Uhlich, Markus Leibl, Simon Hofmann, Eisaku Ohbuchi, Lorenzo Servadei, Ulf Schlichtmann, Robert Wille | 2025-08-26 | 下载 | Analog subcircuit identification is a core task in analog design, essential for simulation, sizing, and layout. Traditional methods often require extensive human expertise, rule-based encoding, or lar... |
| Architecting Distributed Quantum Computers: Design Insights from Resource Estimation | Dmitry Filippov, Peter Yang, Prakash Murali | 2025-08-26 | 下载 | To enable practically useful quantum computing, we require hundreds to thousands of logical qubits (collections of physical qubits with error correction). |
| Building an Open CGRA Ecosystem for Agile Innovation | Rohan Juneja, Pranav Dangi, Thilini Kaushalya Bandara, Zhaoying Li, Dhananjaya Wijerathne, Li-Shiuan Peh, Tulika Mitra | 2025-08-26 | 下载 | Modern computing workloads, particularly in AI and edge applications, demand hardware-software co-design to meet aggressive performance and energy targets. |
| APT-LLM: Exploiting Arbitrary-Precision Tensor Core Computing for LLM Acceleration | Shaobo Ma, Chao Fang, Haikuo Shao, Zhongfeng Wang | 2025-08-26 | 下载 | Large language models (LLMs) have revolutionized AI applications, yet their enormous computational demands severely limit deployment and real-time performance. |
| TaiBai: A fully programmable brain-inspired processor with topology-aware efficiency | Qianpeng Li, Yu Song, Xin Liu, Wenna Song, Boshi Zhao, Zhichao Wang, Aoxin Chen, Tielin Zhang, Liang Chen | 2025-08-26 | 下载 | Brain-inspired computing has emerged as a promising paradigm to overcome the energy-efficiency limitations of conventional intelligent systems by emulating the brain's partitioned architecture and eve... |
| SeDA: Secure and Efficient DNN Accelerators with Hardware/Software Synergy | Wei Xuan, Zhongrui Wang, Lang Feng, Ning Lin, Zihao Xuan, Rongliang Fu, Tsung-Yi Ho, Yuzhong Jiao, Luhong Liang | 2025-08-26 | 下载 | Ensuring the confidentiality and integrity of DNN accelerators is paramount across various scenarios spanning autonomous driving, healthcare, and finance. |
| Beyond Tokens: Enhancing RTL Quality Estimation via Structural Graph Learning | Yi Liu, Hongji Zhang, Yiwen Wang, Dimitris Tsaras, Lei Chen, Mingxuan Yuan, Qiang Xu | 2025-08-26 | 下载 | Estimating the quality of register transfer level (RTL) designs is crucial in the electronic design automation (EDA) workflow, as it enables instant feedback on key metrics like area and delay without... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Formal Modeling and Verification of the Algorand Consensus Protocol in CADP | Andrea Esposito, Francesco P. Rossi, Marco Bernardo, Francesco Fabris, Hubert Garavel | 2025-08-26 | 下载 | Algorand is a scalable and secure permissionless blockchain that achieves proof-of-stake consensus via cryptographic self-sortition and binary Byzantine agreement. |
| HAP: Hybrid Adaptive Parallelism for Efficient Mixture-of-Experts Inference | Haoran Lin, Xianzhi Yu, Kang Zhao, Han Bao, Zongyuan Zhan, Ting Hu, Wulong Liu, Zekun Yin, Xin Li, Weiguo Liu | 2025-08-26 | 下载 | Current inference systems for Mixture-of-Experts (MoE) models primarily employ static parallelization strategies. However, these static approaches cannot consistently achieve optimal performance acros... |
| Architecting Distributed Quantum Computers: Design Insights from Resource Estimation | Dmitry Filippov, Peter Yang, Prakash Murali | 2025-08-26 | 下载 | To enable practically useful quantum computing, we require hundreds to thousands of logical qubits (collections of physical qubits with error correction). |
| Ab-initio Quantum Transport with the GW Approximation, 42,240 Atoms, and Sustained Exascale Performance | Nicolas Vetsch, Alexander Maeder, Vincent Maillou, Anders Winka, Jiang Cao, Grzegorz Kwasniewski, Leonard Deuschle, Torsten Hoefler, Alexandros Nikolaos Ziogas, Mathieu Luisier | 2025-08-26 | 下载 | Designing nanoscale electronic devices such as the currently manufactured nanoribbon field-effect transistors (NRFETs) requires advanced modeling tools capturing all relevant quantum mechanical effect... |
| Federated Fine-Tuning of Sparsely-Activated Large Language Models on Resource-Constrained Devices | Fahao Chen, Jie Wan, Peng Li, Zhou Su, Dongxiao Yu | 2025-08-26 | 下载 | Federated fine-tuning of Mixture-of-Experts (MoE)-based large language models (LLMs) is challenging due to their massive computational requirements and the resource constraints of participants. |
| CARMA: Collocation-Aware Resource Manager | Ehsan Yousefzadeh-Asl-Miandoab, Florina M. Ciorba, Pınar Tözün | 2025-08-26 | 下载 | GPUs running deep learning (DL) workloads are frequently underutilized. Collocating multiple DL training tasks on the same GPU can improve utilization but introduces two key risks: (1) out-of-memory (... |
| Dual-Distilled Heterogeneous Federated Learning with Adaptive Margins for Trainable Global Prototypes | Fatema Siddika, Md Anwar Hossen, Wensheng Zhang, Anuj Sharma, Juan Pablo Muñoz, Ali Jannesari | 2025-08-26 | 下载 | Heterogeneous Federated Learning (HFL) has gained significant attention for its capacity to handle both model and data heterogeneity across clients. |
| Deep Learning-Enabled Supercritical Flame Simulation at Detailed Chemistry and Real-Fluid Accuracy Towards Trillion-Cell Scale | Zhuoqiang Guo, Runze Mao, Lijun Liu, Guangming Tan, Weile Jia, Zhi X. Chen | 2025-08-26 | 下载 | For decades, supercritical flame simulations incorporating detailed chemistry and real-fluid transport have been limited to millions of cells, constraining the resolved spatial and temporal scales of ... |
| SIREN: Software Identification and Recognition in HPC Systems | Thomas Jakobsche, Fredrik Robertsén, Jessica R. Jones, Utz-Uwe Haus, Florina M. Ciorba | 2025-08-26 | 下载 | HPC systems use monitoring and operational data analytics to ensure efficiency, performance, and orderly operations. Application-specific insights are crucial for analyzing the increasing complexity a... |
| ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive | Xinhao Luo, Zihan Liu, Yangjie Zhou, Shihan Fang, Ziyu Huang, Yu Feng, Chen Zhang, Shixuan Sun, Zhenzhe Zheng, Jingwen Leng, Minyi Guo | 2025-08-26 | 下载 | Large language model (LLM) decoding suffers from high latency due to fragmented execution across operators and heavy reliance on off-chip memory for data exchange and reduction. |
| Examining MPI and its Extensions for Asynchronous Multithreaded Communication | Jiakun Yan, Marc Snir, Yanfei Guo | 2025-08-26 | 下载 | The increasing complexity of HPC architectures and the growing adoption of irregular scientific algorithms demand efficient support for asynchronous, multithreaded communication. |
| History Rhymes: Accelerating LLM Reinforcement Learning with RhymeRL | Jingkai He, Tianjian Li, Erhu Feng, Dong Du, Qian Liu, Tao Liu, Yubin Xia, Haibo Chen | 2025-08-26 | 下载 | With the rapid advancement of large language models (LLMs), reinforcement learning (RL) has emerged as a pivotal methodology for enhancing the reasoning capabilities of LLMs. |
| Strata: Hierarchical Context Caching for Long Context Language Model Serving | Zhiqiang Xie, Ziyi Xu, Mark Zhao, Yuwei An, Vikram Sharma Mailthody, Scott Mahlke, Michael Garland, Christos Kozyrakis | 2025-08-26 | 下载 | Large Language Models (LLMs) with expanding context windows face significant performance hurdles. While caching key-value (KV) states is critical for avoiding redundant computation, the storage footpr... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Connectivity Analysis of LoRaWAN-Based Non-Terrestrial Networks for Subterranean mMTC | Kaiqiang Lin, Mohamed-Slim Alouini | 2025-08-26 | 下载 | Wireless underground sensor networks (WUSNs) offer significant social and economic benefits by enabling the monitoring of subterranean entities. |
| A Theory of Goal-Oriented Medium Access: Protocol Design and Distributed Bandit Learning | Federico Chiariotti, Andrea Zanella | 2025-08-26 | 下载 | The Goal-oriented Communication (GoC) paradigm breaks the separation between communication and the content of the data, tailoring communication decisions to the specific needs of the receiver and targ... |
| Sharing is Caring: Analysis of Hybrid Network Sharing Strategies for Energy Efficient Multi-Operator Cellular Systems | Laura Finarelli, Maoquan Ni, Michela Meo, Falko Dressler, Gianluca Rizzo | 2025-08-26 | 下载 | This paper introduces a novel analytical framework for evaluating energy-efficient, QoS-aware network-sharing strategies in cellular networks. |
| OrbCC: High-Throughput and Low-Latency Data Transport for LEO Satellite Networks | Aiden Valentine, Ian Wakeman, George Parisis | 2025-08-26 | 下载 | The highly dynamic nature of Low-Earth Orbit (LEO) satellite networks introduces challenges that existing transport protocols fail to address, including non-congestive latency variation and loss, tran... |
| Adaptive 6G Networks-in-Network Management for Industrial Applications | Daniel Lindenschmitt, Paul Seehofer, Marius Schmitz, Jan Mertes, Roland Bless, Martina Zitterbart, Jan C. Aurich, Hans D. Schotten | 2025-08-26 | 下载 | This paper presents the application of Dynamic Spectrum Management (DSM) for future 6G industrial networks, establishing an efficient controller for the Networks-in-Network (NiN) concept. |
| Combining Static and Dynamic Traffic with Delay Guarantees in Time-Sensitive Networking | Lisa Maile, Kai-Steffen Hielscher, Reinhard German | 2025-08-26 | 下载 | To support reliable and low-latency communication, Time-Sensitive Networking introduced protocols and interfaces for resource allocation in Ethernet. |
| Saving Energy with Relaxed Latency Constraints: A Study on Data Compression and Communication | Pietro Talli, Anup Mishra, Federico Chiariotti, Israel Leyva-Mayorga, Andrea Zanella, Petar Popovski | 2025-08-26 | 下载 | With the advent of edge computing, data generated by end devices can be pre-processed before transmission, possibly saving transmission time and energy. |
| Network Calculus Results for TSN: An Introduction | Lisa Maile, Kai-Steffen Hielscher, Reinhard German | 2025-08-26 | 下载 | Time-Sensitive Networking (TSN) is a set of standards that enables the industry to provide real-time guarantees for time-critical communications with Ethernet hardware. |
| A Survey on Cloud-Edge-Terminal Collaborative Intelligence in AIoT Networks | Jiaqi Wu, Jing Liu, Yang Liu, Lixu Wang, Zehua Wang, Wei Chen, Zijian Tian, Richard Yu, Victor C. M. Leung | 2025-08-26 | 下载 | The proliferation of Internet of things (IoT) devices in smart cities, transportation, healthcare, and industrial applications, coupled with the explosive growth of AI-driven services, has increased d... |
| Toward Edge General Intelligence with Agentic AI and Agentification: Concepts, Technologies, and Future Directions | Ruichen Zhang, Guangyuan Liu, Yinqiu Liu, Changyuan Zhao, Jiacheng Wang, Yunting Xu, Dusit Niyato, Jiawen Kang, Yonghui Li, Shiwen Mao, Sumei Sun, Xuemin Shen, Dong In Kim | 2025-08-26 | 下载 | The rapid expansion of sixth-generation (6G) wireless networks and the Internet of Things (IoT) has catalyzed the evolution from centralized cloud intelligence towards decentralized edge general intel... |
| Dynamic Trajectory Optimization and Power Control for Hierarchical UAV Swarms in 6G Aerial Access Network | Ziye Jia, Jia He, Lijun He, Min Sheng, Junyu Liu, Qihui Wu, Zhu Han | 2025-08-26 | 下载 | Unmanned aerial vehicles (UAVs) can serve as aerial base stations (BSs) to extend the ubiquitous connectivity for ground users (GUs) in the sixth-generation (6G) era. |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| UrgenGo: Urgency-Aware Transparent GPU Kernel Launching for Autonomous Driving | Hanqi Zhu, Wuyang Zhang, Xinran Zhang, Ziyang Tao, Xinrui Lin, Yu Zhang, Jianmin Ji, Yanyong Zhang | 2025-08-26 | 下载 | The rapid advancements in autonomous driving have introduced increasingly complex, real-time GPU-bound tasks critical for reliable vehicle operation. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Robust Recursive Query Parallelism in Graph Database Management Systems | Anurag Chakraborty, Semih Salihoğlu | 2025-08-26 | 下载 | Efficient multi-core parallel processing of recursive join queries is critical for achieving good performance in graph database management systems (GDBMSs). Prior work adopts two broad approaches. |
| Exact Persistent Stochastic Non-Interference | Carla Piazza, Riccardo Romanello, Sabina Rossi | 2025-08-26 | 下载 | Persistent Stochastic Non-Interference (PSNI) was introduced to capture a quantitative security property in stochastic process algebras, ensuring that a high-level process does not influence the obser... |
| CARMA: Collocation-Aware Resource Manager | Ehsan Yousefzadeh-Asl-Miandoab, Florina M. Ciorba, Pınar Tözün | 2025-08-26 | 下载 | GPUs running deep learning (DL) workloads are frequently underutilized. Collocating multiple DL training tasks on the same GPU can improve utilization but introduces two key risks: (1) out-of-memory (... |