Skip to content

2025-08-04

cs.AR - Architecture

标题作者发布日期PDF摘要
ReGate: Enabling Power Gating in Neural Processing UnitsYuqi Xue, Jian Huang2025-08-04下载The energy efficiency of neural processing units (NPU) is playing a critical role in developing sustainable data centers. Our study with different generations of NPU chips reveals that 30%-72% of thei...
ASDR: Exploiting Adaptive Sampling and Data Reuse for CIM-based Instant Neural RenderingFangxin Liu, Haomin Li, Bowen Zhu, Zongwu Wang, Zhuoran Song, Habing Guan, Li Jiang2025-08-04下载Neural Radiance Fields (NeRF) offer significant promise for generating photorealistic images and videos. However, existing mainstream neural rendering models often fall short in meeting the demands fo...
Revisit Choice Network for Synthesis and Technology MappingChen Chen, Jiaqi Yin, Cunxi Yu2025-08-04下载Choice network construction is a critical technique for alleviating structural bias issues in Boolean optimization, equivalence checking, and technology mapping.
GSIM: Accelerating RTL Simulation for Large-Scale DesignsLu Chen, Dingyi Zhao, Zihao Yu, Ninghui Sun, Yungang Bao2025-08-04下载Register Transfer Level (RTL) simulation is widely used in design space exploration, verification, debugging, and preliminary performance evaluation for hardware design.
Revelator: Rapid Data Fetching via OS-Driven Hash-based Speculative Address TranslationKonstantinos Kanellopoulos, Konstantinos Sgouras, Andreas Kosmas Kakolyris, Vlad-Petru Nitu, Berkin Kerim Konar, Rahul Bera, Onur Mutlu2025-08-04下载Address translation is a major performance bottleneck in modern computing systems. Speculative address translation can hide this latency by predicting the physical address (PA) of requested data early...
GPU in the Blind Spot: Overlooked Security Risks in TransportationSefatun-Noor Puspa, Mashrur Chowdhury2025-08-04下载Graphics processing units (GPUs) are becoming an essential part of the intelligent transportation system (ITS) for enabling video-based and artificial intelligence (AI) based applications.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
PROV-AGENT: Unified Provenance for Tracking AI Agent Interactions in Agentic WorkflowsRenan Souza, Amal Gueroudji, Stephen DeWitt, Daniel Rosendo, Tirthankar Ghosal, Robert Ross, Prasanna Balaprakash, Rafael Ferreira da Silva2025-08-04下载Large Language Models (LLMs) and other foundation models are increasingly used as the core of AI agents. In agentic workflows, these agents plan tasks, interact with humans and peers, and influence sc...
Fully Decentralised Consensus for Extreme-scale BlockchainSiamak Abdi, Giuseppe Di Fatta, Atta Badii, Giancarlo Fortino2025-08-04下载Blockchain is a decentralised, immutable ledger technology that has been widely adopted in many sectors for various applications such as cryptocurrencies, smart contracts and supply chain management.
Blockchain Epidemic Consensus for Large-Scale NetworksSiamak Abdi, Giuseppe Di Fatta, Atta Badii, Giancarlo Fortino2025-08-04下载Blockchain is a distributed ledger technology that has applications in many domains such as cryptocurrency, smart contracts, supply chain management, and many others.
Communication and Computation Efficient Split Federated Learning in O-RANShunxian Gu, Chaoqun You, Bangbang Ren, Deke Guo2025-08-04下载The hierarchical architecture of Open Radio Access Network (O-RAN) has enabled a new Federated Learning (FL) paradigm that trains models using data from non- and near-real-time (near-RT) Radio Intelli...
Huawei Cloud Model-as-a-Service on the CloudMatrix384 SuperPodAo Xiao, Bangzheng He, Baoquan Zhang, Baoxing Huai, Bingji Wang, Bo Wang, Bo Xu, Boyi Hou, Chan Yang, Changhong Liu, Cheng Cui, Chenyu Zhu, Cong Feng, Daohui Wang, Dayun Lin, Duo Zhao, Fengshao Zou, Fu Wang, Gangqiang Zhang, Gengyuan Dan, Guanjie Chen, Guodong Guan, Guodong Yang, Haifeng Li, Haipei Zhu, Haley Li, Hao Feng, Hao Huang, Hao Xu, Hengrui Ma, Hengtao Fan, Hui Liu, Jia Li, Jiang Liu, Jiang Xu, Jie Meng, Jinhan Xin, Junhao Hu, Juwei Chen, Lan Yu, Lanxin Miao, Liang Liu, Linan Jing, Lu Zhou, Meina Han, Mingkun Deng, Mingyu Deng, Naitian Deng, Nizhong Lin, Peihan Zhao, Peng Pan, Pengfei Shen, Ping Li, Qi Zhang, Qian Wang, Qin ZhC Qingrong Xia, Qingyi Zhang, Qunchao Fu, Ren Guo, Ruimin Gao, Shaochun Li, Sheng Long, Shentian Li, Shining Wan, Shuai Shen, Shuangfu Zeng, Shuming Jing, Siqi Yang, Song Zhang, Tao Xu, Tianlin Du, Ting Chen, Wanxu Wu, Wei Jiang, Weinan Tong, Weiwei Chen, Wen Peng, Wenli Zhou, Wenquan Yang, Wenxin Liang, Xiang Liu, Xiaoli Zhou, Xin Jin, Xinyu Duan, Xu Li, Xu Zhang, Xusheng Chen, Yalong Shan, Yang Gan, Yao Lu, Yi Deng, Yi Zheng, Ying Xiong, Yingfei Zheng, Yiyun Zheng, Yizhou Shan, Yong Gao, Yong Zhang, Yongqiang Yang, Yuanjin Gong, Yue Yu, Yuetao Chen, Yukun Zhu, Yulong He, Yusu Zhao, Yuyan Wu, Zenan Zhang, Zhaojin Zhuo, Zhaoyang Ji, Zhefeng Wang, Zheng Wang, Zhenan Fan, Zhenhua Yang, Zhenli Sheng, Zhibin Yu, Zhigang Ji, Zhihao Ren, Zhipeng Bian, Zhixia Liu, Zhiyu Dong, Zhonghua Li, Zhou Yu, Zhuoming Shen, Zhuwei Peng, Zi Ye, Zihao Xiang, Zimin Fu, Zixuan Zhang2025-08-04下载Scaled-out MoE LLMs and scaled-up SuperPods create new systems challenges for production Model-as-a-Service (MaaS), requiring disaggregation, low-latency communication, and decentralized serving.
TeraNoC: A Multi-Channel 32-bit Fine-Grained, Hybrid Mesh-Crossbar NoC for Efficient Scale-up of 1000+ Core Shared-L1-Memory ClustersYichao Zhang, Zexin Fu, Tim Fischer, Yinrong Li, Marco Bertuletti, Luca Benini2025-08-04下载A key challenge in on-chip interconnect design is to scale up bandwidth while maintaining low latency and high area efficiency. 2D-meshes scale with low wiring area and congestion overhead; however, t...
FlashCommunication V2: Bit Splitting and Spike Reserving for Any Bit CommunicationQingyuan Li, Bo Zhang, Hui Kang, Tianhao Xu, Yulei Qian, Yuchen Xie, Lin Ma2025-08-04下载Nowadays, communication bottlenecks have emerged as a critical challenge in the distributed training and deployment of large language models (LLMs).
On Effectiveness of Graph Neural Network Architectures for Network Digital Twins (NDTs)Iulisloi Zacarias, Oussama Ben Taarit, Admela Jukan2025-08-04下载Future networks, such as 6G, will need to support a vast and diverse range of interconnected devices and applications, each with its own set of requirements.
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe ZooQianli Ma, Yaowei Zheng, Zhelun Shi, Zhongkai Zhao, Bin Jia, Ziyue Huang, Zhiqi Lin, Youjie Li, Jiacheng Yang, Yanghua Peng, Zhi Zhang, Xin Liu2025-08-04下载Recent advances in large language models (LLMs) have driven impressive progress in omni-modal understanding and generation. However, training omni-modal LLMs remains a significant challenge due to the...
PUSHtap: PIM-based In-Memory HTAP with Unified Data Storage FormatYilong Zhao, Mingyu Gao, Huanchen Zhang, Fangxin Liu, Gongye Chen, He Xian, Haibing Guan, Li Jiang2025-08-04下载Hybrid transaction/analytical processing (HTAP) is an emerging database paradigm that supports both online transaction processing (OLTP) and online analytical processing (OLAP) workloads.
FedAPTA: Federated Multi-task Learning for Heterogeneous Devices with Adaptive Layer-wise Pruning and Task-aware AggregationZhen Yu, Yachao Yuan, Jin Wang, Zhipeng Cheng, Jianhua Hu2025-08-04下载Federated Learning (FL) has shown considerable promise in Machine Learning (ML) across numerous devices for privacy protection, efficient data utilization, and dynamic collaboration.
Self-assessment approach for resource management protocols in heterogeneous computational systemsRui Eduardo Lopes, Duarte Raposo, Pedro V. Teixeira, Susana Sargento2025-08-04下载With an ever growing number of heterogeneous applicational services running on equally heterogeneous computational systems, the problem of resource management becomes more essential.
DySTopYizhou Shi, Qianpiao Ma, Yan Xu, Junlong Zhou, Ming Hu, Yunming Liao, Hongli Xu2025-08-04下载Federated Learning (FL) has emerged as a potential distributed learning paradigm that enables model training on edge devices (i.e., workers) while preserving data privacy.
Prefill-Decode Aggregation or Disaggregation? Unifying Both for Goodput-Optimized LLM ServingChao Wang, Pengfei Zuo, Zhangyu Chen, Yunkai Liang, Zhou Yu, Ming-Chang Yang2025-08-04下载An ongoing debate considers whether prefill-decode (PD) aggregation or disaggregation is superior for serving large language models (LLMs). This has driven optimizations for both approaches, each show...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
A Reinforcement Learning Framework for Mobility Control of gNBs in Dynamic Radio Access NetworksPedro Duarte, André Coelho, Manuel Ricardo2025-08-04下载The increasing complexity of wireless environments, characterized by user mobility and dynamic obstructions, poses challenges for the maintenance of Line-of-Sight (LoS) connectivity.
Secure mmWave Beamforming with Proactive-ISAC Defense Against Beam-Stealing AttacksSeyed Bagher Hashemi Natanzi, Hossein Mohammadi, Bo Tang, Vuk Marojevic2025-08-04下载Millimeter-wave (mmWave) communication systems face increasing susceptibility to advanced beam-stealing attacks, posing a significant physical layer security threat.
RC-Gossip: Information Freshness in Clustered Networks with Rate-Changing GossipIrtiza Hasan, Ahmed Arafa2025-08-04下载A clustered gossip network is considered in which a source updates its information over time, and end-nodes, organized in clusters through clusterheads, are keeping track of it.
Fully Decentralised Consensus for Extreme-scale BlockchainSiamak Abdi, Giuseppe Di Fatta, Atta Badii, Giancarlo Fortino2025-08-04下载Blockchain is a decentralised, immutable ledger technology that has been widely adopted in many sectors for various applications such as cryptocurrencies, smart contracts and supply chain management.
ASINT: Learning AS-to-Organization Mapping from Internet MetadataYongzhe Xu, Weitong Li, Eeshan Umrani, Taejoong Chung2025-08-04下载Accurate AS-to-organization mapping underpins Internet measurement and security, yet registries are fragmented, PeeringDB is narrow, and routing views reflect connectivity rather than ownership.
On Effectiveness of Graph Neural Network Architectures for Network Digital Twins (NDTs)Iulisloi Zacarias, Oussama Ben Taarit, Admela Jukan2025-08-04下载Future networks, such as 6G, will need to support a vast and diverse range of interconnected devices and applications, each with its own set of requirements.
Distillation-Enhanced Clustering Acceleration for Encrypted Traffic ClassificationZiyue Huang, Chungang Lin, Weiyao Zhang, Xuying Meng, Yujun Zhang2025-08-04下载Traffic classification plays a significant role in network service management. The advancement of deep learning has established pretrained models as a robust approach for this task.
Balancing Information Accuracy and Response Timeliness in Networked LLMsYigit Turkmen, Baturalp Buyukates, Melih Bastopcu2025-08-04下载Recent advancements in Large Language Models (LLMs) have transformed many fields including scientific discovery, content generation, biomedical text mining, and educational technology.
5G Core Fault Detection and Root Cause Analysis using Machine Learning and Generative AIJoseph H. R. Isaac, Harish Saradagam, Nallamothu Pardhasaradhi2025-08-04下载With the advent of 5G networks and technologies, ensuring the integrity and performance of packet core traffic is paramount. During network analysis, test files such as Packet Capture (PCAP) files and...
PRIME: Plasticity-Robust Incremental Model for Encrypted Traffic Classification in Dynamic Network EnvironmentsTian Qin, Guang Cheng, Zihan Chen, Yuyang Zhou2025-08-04下载With the continuous development of network environments and technologies, ensuring cyber security and governance is increasingly challenging. Network traffic classification(ETC) can analyzes attribute...
Convolutions are Competitive with Transformers for Encrypted Traffic Classification with Pre-trainingChungang Lin, Weiyao Zhang, Tianyu Zuo, Chao Zha, Yilong Jiang, Ruiqi Meng, Haitong Luo, Xuying Meng, Yujun Zhang2025-08-04下载Encrypted traffic classification is vital for modern network management and security. To reduce reliance on handcrafted features and labeled data, recent methods focus on learning generic representati...
Physiological Signal-Driven QoE Optimization for Wireless Virtual Reality TransmissionChang Wu, Yuang Chen, Yiyuan Chen, Fengqian Guo, Xiaowei Qin, Hancheng Lu2025-08-04下载Abrupt resolution changes in virtual reality (VR) streaming can significantly impair the quality-of-experience (QoE) of users, particularly during transitions from high to low resolutions.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Revelator: Rapid Data Fetching via OS-Driven Hash-based Speculative Address TranslationKonstantinos Kanellopoulos, Konstantinos Sgouras, Andreas Kosmas Kakolyris, Vlad-Petru Nitu, Berkin Kerim Konar, Rahul Bera, Onur Mutlu2025-08-04下载Address translation is a major performance bottleneck in modern computing systems. Speculative address translation can hide this latency by predicting the physical address (PA) of requested data early...

基于 VitePress 构建