Skip to content

2025-08-30

cs.AR - Architecture

标题作者发布日期PDF摘要
On the Thermal Vulnerability of 3D-Stacked High-Bandwidth Memory ArchitecturesMehdi Elahi, Mohamed R. Elshamy, Abdel-Hameed A. Badawy, Ahmad Patooghy2025-08-30下载3D-stacked High Bandwidth Memory (HBM) architectures provide high-performance memory interactions to address the well-known performance challenge, namely the memory wall.
COMET: A Framework for Modeling Compound Operation Dataflows with Explicit CollectivesShubham Negi, Manik Singhal, Aayush Ankit, Sudeep Bhoja, Kaushik Roy2025-08-30下载Modern machine learning accelerators are designed to efficiently execute deep neural networks (DNNs) by optimizing data movement, memory hierarchy, and compute throughput.
Real-Time Piano Note Frequency Detection Using FPGA and FFT CoreShafayet M. Anik, D. G. Perera2025-08-30下载Real-time frequency analysis of musical instruments, such as the piano, is an essential feature in areas like electronic tuners, music visualizers, and live sound monitoring.
Bit Transition Reduction by Data Transmission Ordering in NoC-based DNN AcceleratorYizhi Chen, Jingwei Li, Wenyao Zhu, Zhonghai Lu2025-08-30下载As Deep Neural Networks (DNN) are becoming essential, Network-on-Chip (NoC)-based DNN accelerators gained increasing popularity. To save link power in NoC, many researchers focus on reducing the Bit T...
AGS: Accelerating 3D Gaussian Splatting SLAM via CODEC-Assisted Frame Covisibility DetectionHoushu He, Naifeng Jing, Li Jiang, Xiaoyao Liang, Zhuoran Song2025-08-30下载Simultaneous Localization and Mapping (SLAM) is a critical task that enables autonomous vehicles to construct maps and localize themselves in unknown environments.
FlexLink: Boosting your NVLink Bandwidth by 27% without accuracy concernAo Shen, Rui Zhang, Junping Zhao2025-08-30下载As large language models (LLMs) continue to scale, multi-node deployment has become a necessity. Consequently, communication has become a critical performance bottleneck.
DarwinWafer: A Wafer-Scale Neuromorphic ChipXiaolei Zhu, Xiaofei Jin, Ziyang Kang, Chonghui Sun, Junjie Feng, Dingwen Hu, Zengyi Wang, Hanyue Zhuang, Qian Zheng, Huajin Tang, Shi Gu, Xin Du, De Ma, Gang Pan2025-08-30下载Neuromorphic computing promises brain-like efficiency, yet today's multi-chip systems scale over PCBs and incur orders-of-magnitude penalties in bandwidth, latency, and energy, undermining biological ...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Federated Survival Analysis with Node-Level Differential Privacy: Private Kaplan-Meier CurvesNarasimha Raghavan Veeraragavan, Jan Franz Nygård2025-08-30下载We investigate how to calculate Kaplan-Meier survival curves across multiple health-care jurisdictions while protecting patient privacy with node-level differential privacy.
COMET: A Framework for Modeling Compound Operation Dataflows with Explicit CollectivesShubham Negi, Manik Singhal, Aayush Ankit, Sudeep Bhoja, Kaushik Roy2025-08-30下载Modern machine learning accelerators are designed to efficiently execute deep neural networks (DNNs) by optimizing data movement, memory hierarchy, and compute throughput.
KVComp: A High-Performance, LLM-Aware, Lossy Compression Framework for KV CacheBo Jiang, Taolue Yang, Youyuan Liu, Chengming Zhang, Xubin He, Sian Jin2025-08-30下载Transformer-based large language models (LLMs) demonstrate impressive potential in various practical applications. However, long context inference poses a significant challenge due to the enormous mem...
FlexLink: Boosting your NVLink Bandwidth by 27% without accuracy concernAo Shen, Rui Zhang, Junping Zhao2025-08-30下载As large language models (LLMs) continue to scale, multi-node deployment has become a necessity. Consequently, communication has become a critical performance bottleneck.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
FLEET: A Federated Learning Emulation and Evaluation Testbed for Holistic ResearchOsama Abu Hamdan, Hao Che, Engin Arslan, Md Arifuzzaman2025-08-30下载Federated Learning (FL) presents a robust paradigm for privacy-preserving, decentralized machine learning. However, a significant gap persists between the theoretical design of FL algorithms and their...
SmartFLow: A Communication-Efficient SDN Framework for Cross-Silo Federated LearningOsama Abu Hamdan, Hao Che, Engin Arslan, Md Arifuzzaman2025-08-30下载Cross-silo Federated Learning (FL) enables multiple institutions to collaboratively train machine learning models while preserving data privacy.
Dual Actor DDPG for Airborne STAR-RIS Assisted CommunicationsDanish Rizvi, David Boyle2025-08-30下载This study departs from the prevailing assumption of independent Transmission and Reflection Coefficients (TRC) in Airborne Simultaneous Transmit and Reflect Reconfigurable Intelligent Surface (STAR-R...
Interference Between FM Cell Sites and CDMA Cell SitesP. Kumar2025-08-30下载Interference is the major problem now days in telecommunication sector. One type of interference which is very common now days is FM Cell sites interference between CDMA Cell sites.
SpliDT: Partitioned Decision Trees for Scalable Stateful Inference at Line RateMurayyiam Parvez, Annus Zulfiqar, Roman Beltiukov, Shir Landau Feibish, Walter Willinger, Arpit Gupta, Muhammad Shahbaz2025-08-30下载Machine learning (ML) is increasingly being deployed in programmable data planes (switches and SmartNICs) to enable real-time traffic analysis, security monitoring, and in-network decision-making.
SABR: A Stable Adaptive Bitrate Framework Using Behavior Cloning Pretraining and Reinforcement Learning Fine-TuningPengcheng Luo, Yunyang Zhao, Bowen Zhang, Genke Yang, Boon-Hee Soong, Chau Yuen2025-08-30下载With the advent of 5G, the internet has entered a new video-centric era. From short-video platforms like TikTok to long-video platforms like Bilibili, online video services are reshaping user consumpt...
Intelligent Spectrum Management in Satellite CommunicationsRakshitha De Silva, Shiva Raj Pokhrel, Jonathan Kua, Sithamparanathan Kandeepan2025-08-30下载Satellite Communication (SatCom) networks represent a fundamental pillar in modern global connectivity, facilitating reliable service and extensive coverage across a plethora of applications.

cs.PF - Performance

标题作者发布日期PDF摘要
Efficient Graph Knowledge Distillation from GNNs to Kolmogorov--Arnold Networks via Self-Attention Dynamic SamplingCan Cui, Zilong Fu, Penghe Huang, Yuanyuan Li, Wu Deng, Dongyan Li2025-08-30下载Recent success of graph neural networks (GNNs) in modeling complex graph-structured data has fueled interest in deploying them on resource-constrained edge devices.

基于 VitePress 构建