Skip to content

2025-04-18

cs.AR - Architecture

标题作者发布日期PDF摘要
DataMaestro: A Versatile and Efficient Data Streaming Engine Bringing Decoupled Memory Access To Dataflow AcceleratorsXiaoling Yi, Yunhao Deng, Ryan Antonio, Fanchen Kong, Guilherme Paim, Marian Verhelst2025-04-18下载Deep Neural Networks (DNNs) have achieved remarkable success across various intelligent tasks but encounter performance and energy challenges in inference execution due to data movement bottlenecks.
A CMOS Probabilistic Computing Chip With In-situ hardware Aware LearningJinesh Jhonsa, William Whitehead, David McCarthy, Shuvro Chowdhury, Kerem Camsari, Luke Theogarajan2025-04-18下载This paper demonstrates a probabilistic bit physics inspired solver with 440 spins configured in a Chimera graph, occupying an area of 0.44 mm^2.
MetaDSE: A Few-shot Meta-learning Framework for Cross-workload CPU Design Space ExplorationRunzhen Xue, Hao Wu, Mingyu Yan, Ziheng Xiao, Xiaochun Ye, Dongrui Fan2025-04-18下载Cross-workload design space exploration (DSE) is crucial in CPU architecture design. Existing DSE methods typically employ the transfer learning technique to leverage knowledge from source workloads, ...
HPU: High-Bandwidth Processing Unit for Scalable, Cost-effective LLM Inference via GPU Co-processingMyunghyun Rhee, Joonseop Sim, Taeyoung Ahn, Seungyong Lee, Daegun Yoon, Euiseok Kim, Kyoung Park, Youngpyo Joo, Hoshik Kim2025-04-18下载The attention layer, a core component of Transformer-based LLMs, brings out inefficiencies in current GPU systems due to its low operational intensity and the substantial memory requirements of KV cac...
EXAM: Exploiting Exclusive System-Level Cache in Apple M-Series SoCs for Enhanced Cache Occupancy AttacksTianhong Xu, Aidong Adam Ding, Yunsi Fei2025-04-18下载Cache occupancy attacks exploit the shared nature of cache hierarchies to infer a victim's activities by monitoring overall cache usage, unlike access-driven cache attacks that focus on specific cache...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Cloud based DevOps Framework for Identifying Risk Factors of Hospital UtilizationMonojit Banerjee, Akaash Vishal Hazarika, Mahak Shah2025-04-18下载A scalable and reliable system is required to analyze the National Health and Nutrition Examination Survey (NHANES) data efficiently to understand hospital utilization risk factors.
Toward Portable GPU Performance: Julia Recursive Implementation of TRMM and TRSMVicki Carrica, Maxwell Onyango, Rabab Alomairy, Evelyne Ringoot, James Schloss, Alan Edelman2025-04-18下载This paper presents a performant and portable recursive implementation of triangular matrix-matrix multiplication (TRMM) and triangular solve (TRSM) in Julia for GPUs, two kernels that underlie many l...
Robust Decentralized Quantum Kernel Learning for Noisy and Adversarial EnvironmentWenxuan Ma, Kuan-Cheng Chen, Shang Yu, Mengxiang Liu, Ruilong Deng2025-04-18下载This paper proposes a general decentralized framework for quantum kernel learning (QKL). It has robustness against quantum noise and can also be designed to defend adversarial information attacks form...
Robust Distributed Arrays: Provably Secure Networking for Data Availability SamplingDankrad Feist, Gottfried Herold, Mark Simkin, Benedikt Wagner2025-04-18下载Data Availability Sampling (DAS), a central component of Ethereum's roadmap, enables clients to verify data availability without requiring any single client to download the entire dataset.
High-Throughput LLM inference on Heterogeneous ClustersYi Xiong, Jinqi Huang, Wenjie Huang, Xuebing Yu, Entong Li, Zhixiong Ning, Jinhua Zhou, Li Zeng, Xin Chen2025-04-18下载Nowadays, many companies possess various types of AI accelerators, forming heterogeneous clusters. Efficiently leveraging these clusters for high-throughput large language model (LLM) inference servic...
SFL-LEO: Asynchronous Split-Federated Learning Design for LEO Satellite-Ground Network FrameworkJiasheng Wu, Jingjing Zhang, Zheng Lin, Zhe Chen, Xiong Wang, Wenjun Zhu, Yue Gao2025-04-18下载Recently, the rapid development of LEO satellite networks spurs another widespread concern-data processing at satellites. However, achieving efficient computation at LEO satellites in highly dynamic s...
Trust, but verifyMichael J. Yuan, Carlos Lospoy, Sydney Lai, James Snewin, Ju Long2025-04-18下载Decentralized AI agent networks, such as Gaia, allows individuals to run customized LLMs on their own computers and then provide services to the public.
HPU: High-Bandwidth Processing Unit for Scalable, Cost-effective LLM Inference via GPU Co-processingMyunghyun Rhee, Joonseop Sim, Taeyoung Ahn, Seungyong Lee, Daegun Yoon, Euiseok Kim, Kyoung Park, Youngpyo Joo, Hoshik Kim2025-04-18下载The attention layer, a core component of Transformer-based LLMs, brings out inefficiencies in current GPU systems due to its low operational intensity and the substantial memory requirements of KV cac...
Quantum repeaters enhanced by vacuum beam guidesYu Gan, Mohadeseh Azari, Nitish Kumar Chandra, Xin Jin, Jinglei Cheng, Kaushik P. Seshadreesan, Junyu Liu2025-04-18下载The development of large-scale quantum communication networks faces critical challenges due to photon loss and decoherence in optical fiber channels.
Bibliometric Analysis of Scientific Publications on Blockchain Research and ApplicationsLingfeng Bao, Jiameng Yang, Xiaohu Yang, Chunming Rong2025-04-18下载Since the introduction of Bitcoin in 2008, blockchain technology has garnered widespread attention. Scholars from various research fields, countries, and institutions have published a significant numb...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
5Guard: Isolation-aware End-to-End Slicing of 5G NetworksMehdi Bolourian, Noura Limam, Mohammad Ali Salahuddin, Raouf Boutaba2025-04-18下载Network slicing logically partitions the 5G infrastructure to cater to diverse verticals with varying requirements. However, resource sharing exposes the slices to threats and performance degradation,...
The Effect of the Network in Cutting Carbon for Geo-shifted WorkloadsYibo Guo, Amanda Tomlinson, Runlong Su, George Porter2025-04-18下载Organizations are increasingly offloading their workloads to cloud platforms. For workloads with relaxed deadlines, this presents an opportunity to reduce the total carbon footprint of these computati...
Statistical Analysis and End-to-End Performance Evaluation of Traffic Models for Automotive DataMarcello Bullo, Amir Ashtari Gargari, Paolo Testolina, Michele Zorzi, Marco Giordani2025-04-18下载Autonomous driving is a major paradigm shift in transportation, with the potential to enhance safety, optimize traffic congestion, and reduce fuel consumption.
Joint Optimization of Controller Placement and Switch Assignment in SDN-based LEO Satellite NetworksZhiyun Jiang, Wei Li, Menglong Yang2025-04-18下载Software-defined networking (SDN) based low earth orbit (LEO) satellite networks leverage the SDN's benefits of the separation of data plane and control plane, control plane programmability, and centr...
Towards End-to-End Network Intent Management with Large Language ModelsLam Dinh, Sihem Cherrared, Xiaofeng Huang, Fabrice Guillemin2025-04-18下载Large Language Models (LLMs) are likely to play a key role in Intent-Based Networking (IBN) as they show remarkable performance in interpreting human language as well as code generation, enabling the ...
SFL-LEO: Asynchronous Split-Federated Learning Design for LEO Satellite-Ground Network FrameworkJiasheng Wu, Jingjing Zhang, Zheng Lin, Zhe Chen, Xiong Wang, Wenjun Zhu, Yue Gao2025-04-18下载Recently, the rapid development of LEO satellite networks spurs another widespread concern-data processing at satellites. However, achieving efficient computation at LEO satellites in highly dynamic s...
Decentralized Handover Parameter Optimization with MARL for Load Balancing in 5G NetworksYang Shen, Shuqi Chai, Bing Li, Xiaodong Luo, Qingjiang Shi, Rongqing Zhang2025-04-18下载In cellular networks, cell handover refers to the process where a device switches from one base station to another, and this mechanism is crucial for balancing the load among different cells.
Quantum repeaters enhanced by vacuum beam guidesYu Gan, Mohadeseh Azari, Nitish Kumar Chandra, Xin Jin, Jinglei Cheng, Kaushik P. Seshadreesan, Junyu Liu2025-04-18下载The development of large-scale quantum communication networks faces critical challenges due to photon loss and decoherence in optical fiber channels.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Terminal Lucidity: Envisioning the Future of the TerminalMichael MacInnis, Olga Baysal, Michele Lanza2025-04-18下载The Unix terminal, or just simply, the terminal, can be found being applied in almost every facet of computing. It is available across all major platforms and often integrated into other applications.

基于 VitePress 构建