Skip to content

2025-03-28

cs.AR - Architecture

标题作者发布日期PDF摘要
Learning Library Cell Representations in Vector SpaceRongjian Liang, Yi-Chen Lu, Wen-Hao Liu, Haoxing Ren2025-03-28下载We propose Lib2Vec, a novel self-supervised framework to efficiently learn meaningful vector representations of library cells, enabling ML models to capture essential cell semantics.
Benchmarking Ultra-Low-Power μNPUsJosh Millar, Yushan Huang, Sarab Sethi, Hamed Haddadi, Anil Madhavapeddy2025-03-28下载Efficient on-device neural network (NN) inference offers predictable latency, improved privacy and reliability, and lower operating costs for vendors than cloud-based inference.
NLS: Natural-Level Synthesis for Hardware Implementation Through GenAIKaiyuan Yang, Huang Ouyang, Xinyi Wang, Bingjie Lu, Yanbo Wang, Charith Abhayaratne, Sizhao Li, Long Jin, Tiantai Deng2025-03-28下载This paper introduces Natural-Level Synthesis, an innovative approach for generating hardware using generative artificial intelligence on both the system level and component-level.
A Survey of Circuit Foundation Model: Foundation AI Models for VLSI Circuit Design and EDAWenji Fang, Jing Wang, Yao Lu, Shang Liu, Yuchao Wu, Yuzhe Ma, Zhiyao Xie2025-03-28下载Artificial intelligence (AI)-driven electronic design automation (EDA) techniques have been extensively explored for VLSI circuit design applications.
CIMR-V: An End-to-End SRAM-based CIM Accelerator with RISC-V for AI Edge DeviceYan-Cheng Guo and, Tian-Sheuan Chang, Chih-Sheng Lin, Bo-Cheng Chiou, Chih-Ming Lai, Shyh-Shyuan Sheu, Wei-Chung Lo, Shih-Chieh Chang2025-03-28下载Computing-in-memory (CIM) is renowned in deep learning due to its high energy efficiency resulting from highly parallel computing with minimal data movement.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
A Pilot Study on Tunable Precision Emulation via Automatic BLAS OffloadingHang Liu, Junjie Li, Yinzhi Wang2025-03-28下载This study explores the use of automatic BLAS offloading and INT8-based emulation for accelerating traditional HPC workloads on modern GPU architectures.
A Performance Analysis of Task Scheduling for UQ Workflows on HPC SystemsChung Ming Loi, Anne Reinarz, Mikkel Lykkegaard, William Hornsby, James Buchanan, Linus Seelinger2025-03-28下载Uncertainty Quantification (UQ) workloads are becoming increasingly common in science and engineering. They involve the submission of thousands or even millions of similar tasks with potentially unpre...
Hiding Latencies in Network-Based Image Loading for Deep LearningFrancesco Versaci, Giovanni Busonera2025-03-28下载In the last decades, the computational power of GPUs has grown exponentially, allowing current deep learning (DL) applications to handle increasingly large amounts of data at a progressively higher th...
Niyama : Breaking the Silos of LLM Inference ServingKanishk Goel, Jayashree Mohan, Nipun Kwatra, Ravi Shreyas Anupindi, Ramachandran Ramjee2025-03-28下载The widespread adoption of Large Language Models (LLMs) has enabled diverse applications with very different latency requirements. Existing LLM serving frameworks rely on siloed infrastructure with co...
On the Solvability of Byzantine-tolerant Reliable Communication in Dynamic NetworksSilvia Bonomi, Giovanni Farina, Sébastien Tixeuil2025-03-28下载A reliable communication primitive guarantees the delivery, integrity, and authorship of messages exchanged between correct processes of a distributed system.
Memory-aware Adaptive Scheduling of Scientific Workflows on Heterogeneous ArchitecturesSvetlana Kulagina, Anne Benoit, Henning Meyerhenke2025-03-28下载The analysis of massive scientific data often happens in the form of workflows with interdependent tasks. When such a scientific workflow needs to be scheduled on a parallel or distributed system, one...
SimDC: A High-Fidelity Device Simulation Platform for Device-Cloud Collaborative ComputingRuiguang Pei, Junjie Wu, Dan Peng, Min Fang, Jianan Zhang, Zhihui Fu, Jun Wang2025-03-28下载The advent of edge intelligence and escalating concerns for data privacy protection have sparked a surge of interest in device-cloud collaborative computing.
CAT: A GPU-Accelerated FHE Framework with Its Application to High-Precision Private Dataset QueryQirui Li, Rui Zong2025-03-28下载We introduce an open-source GPU-accelerated fully homomorphic encryption (FHE) framework CAT, which surpasses existing solutions in functionality and efficiency.
Route-and-Aggregate Decentralized Federated Learning Under Communication ErrorsWeicai Li, Tiejun Lv, Wei Ni, Jingbo Zhao, Ekram Hossain, H. Vincent Poor2025-03-28下载Decentralized federated learning (D-FL) allows clients to aggregate learning models locally, offering flexibility and scalability. Existing D-FL methods use gossip protocols, which are inefficient whe...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
NetSSM: Multi-Flow and State-Aware Network Trace Generation using State Space ModelsAndrew Chu, Xi Jiang, Shinan Liu, Arjun Bhagoji, Francesco Bronzino, Paul Schmitt, Nick Feamster2025-03-28下载Access to raw network traffic data is essential for many computer networking tasks, from traffic modeling to performance evaluation. Unfortunately, this data is scarce due to high collection costs and...
Training Large Language Models for Advanced Typosquatting DetectionJackson Welch2025-03-28下载Typosquatting is a long-standing cyber threat that exploits human error in typing URLs to deceive users, distribute malware, and conduct phishing attacks.
Privacy-Preserving Secure Neighbor Discovery for Wireless NetworksAhmed Mohamed Hussain, Panos Papadimitratos2025-03-28下载Traditional Neighbor Discovery (ND) and Secure Neighbor Discovery (SND) are key elements for network functionality. SND is a hard problem, satisfying not only typical security properties (authenticati...
Route-and-Aggregate Decentralized Federated Learning Under Communication ErrorsWeicai Li, Tiejun Lv, Wei Ni, Jingbo Zhao, Ekram Hossain, H. Vincent Poor2025-03-28下载Decentralized federated learning (D-FL) allows clients to aggregate learning models locally, offering flexibility and scalability. Existing D-FL methods use gossip protocols, which are inefficient whe...
QoS-Aware Service Restoration in 5G Optical Transport NetworksZahra Sharifi Soltani, Arash Rezaee, Orlando Arias, Vinod M Vokkarane2025-03-28下载Only the chairs can edit The rapid growth of high-bandwidth applications in fifth-generation (5G) networks and beyond has driven a substantial increase in traffic within transport optical networks.
Saving Storage Space Using Files on the WebKevin Saric, Gowri Sankar Ramachandran, Raja Jurdak, Surya Nepal2025-03-28下载As conventional storage density reaches its physical limits, the cost of a gigabyte of storage is no longer plummeting, but rather has remained mostly flat for the past decade.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Saving Storage Space Using Files on the WebKevin Saric, Gowri Sankar Ramachandran, Raja Jurdak, Surya Nepal2025-03-28下载As conventional storage density reaches its physical limits, the cost of a gigabyte of storage is no longer plummeting, but rather has remained mostly flat for the past decade.

cs.PF - Performance

标题作者发布日期PDF摘要
Quamba2: A Robust and Scalable Post-training Quantization Framework for Selective State Space ModelsHung-Yueh Chiang, Chi-Chih Chang, Natalia Frumkin, Kai-Chiang Wu, Mohamed S. Abdelfattah, Diana Marculescu2025-03-28下载State Space Models (SSMs) are emerging as a compelling alternative to Transformers because of their consistent memory usage and high performance.
A Pilot Study on Tunable Precision Emulation via Automatic BLAS OffloadingHang Liu, Junjie Li, Yinzhi Wang2025-03-28下载This study explores the use of automatic BLAS offloading and INT8-based emulation for accelerating traditional HPC workloads on modern GPU architectures.
Service-the-Longest-Queue Among d Choices Policy for Quantum Entanglement SwitchingGuo Xian Yau, Thirupathaiah Vasantam, Gayane Vardoyan2025-03-28下载An Entanglement Generation Switch (EGS) is a quantum network hub that provides entangled states to a set of connected nodes by enabling them to share a limited number of hub resources.

基于 VitePress 构建