2024-11-04

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation	Mufei Li, Viraj Shitole, Eli Chien, Changhai Man, Zhaodong Wang, Srinivas Sridharan, Ying Zhang, Tushar Krishna, Pan Li	2024-11-04	下载	Directed acyclic graphs (DAGs) serve as crucial data representations in domains such as hardware synthesis and compiler/program optimization for computing systems.
CXL-DMSim: A Full-System CXL Disaggregated Memory Simulator With Comprehensive Silicon Validation	Yanjing Wang, Lizhou Wu, Wentao Hong, Yang Ou, Zicong Wang, Sunfeng Gao, Jie Zhang, Sheng Ma, Dezun Dong, Xingyun Qi, Mingche Lai, Nong Xiao	2024-11-04	下载	Compute eXpress Link (CXL) has emerged as a key enabler of memory disaggregation for future heterogeneous computing systems to expand memory on-demand and improve resource utilization.
AssertLLM: Generating Hardware Verification Assertions from Design Specifications via Multi-LLMs	Zhiyuan Yan, Wenji Fang, Mengming Li, Min Li, Shang Liu, Zhiyao Xie, Hongce Zhang	2024-11-04	下载	Assertion-based verification (ABV) is a critical method to ensure logic designs comply with their architectural specifications. ABV requires assertions, which are generally converted from specificatio...

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Taming the Beast of User-Programmed Transactions on Blockchains: A Declarative Transaction Approach	Nodirbek Korchiev, Akash Pateria, Vodelina Samatova, Sogolsadat Mansouri, Kemafor Anyanwu	2024-11-04	下载	Blockchains are being positioned as the "technology of trust" that can be used to mediate transactions between non-trusting parties without the need for a central authority.
Configurable Non-uniform All-to-all Algorithms	Ke Fan, Jens Domke, Seydou Ba, Sidharth Kumar	2024-11-04	下载	MPI_Alltoallv generalizes the uniform all-to-all communication (MPI_Alltoall) by enabling the exchange of data blocks of varied sizes among processes.
PipeLLM: Fast and Confidential Large Language Model Services with Speculative Pipelined Encryption	Yifan Tan, Cheng Tan, Zeyu Mi, Haibo Chen	2024-11-04	下载	Confidential computing on GPUs, like NVIDIA H100, mitigates the security risks of outsourced Large Language Models (LLMs) by implementing strong isolation and data encryption.
Fast and Robust Information Spreading in the Noisy PULL Model	Niccolò D'Archivio, Amos Korman, Emanuele Natale, Robin Vacus	2024-11-04	下载	Understanding how information can efficiently spread in distributed systems under noisy communications is a fundamental question in both biological research and artificial system design.
Benchmarking Accuracy in an Emulated Memory Experiment	Tim Chan	2024-11-04	下载	This note proposes a simpler method to extract the logical error rate from an emulated surface code memory experiment.
Discrete the solving model of time-variant standard Sylvester-conjugate matrix equations using Euler-forward formula	Jiakuang He, Dongqing Wu	2024-11-04	下载	Time-variant standard Sylvester-conjugate matrix equations are presented as early time-variant versions of the complex conjugate matrix equations.
LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation	Mufei Li, Viraj Shitole, Eli Chien, Changhai Man, Zhaodong Wang, Srinivas Sridharan, Ying Zhang, Tushar Krishna, Pan Li	2024-11-04	下载	Directed acyclic graphs (DAGs) serve as crucial data representations in domains such as hardware synthesis and compiler/program optimization for computing systems.
Memory-Efficient Community Detection on Large Graphs Using Weighted Sketches	Subhajit Sahu	2024-11-04	下载	Community detection in graphs identifies groups of nodes with denser connections within the groups than between them, and while existing studies often focus on optimizing detection performance, memory...
FedMoE-DA: Federated Mixture of Experts via Domain Aware Fine-grained Aggregation	Ziwei Zhan, Wenkuan Zhao, Yuanqing Li, Weijie Liu, Xiaoxi Zhang, Chee Wei Tan, Chuan Wu, Deke Guo, Xu Chen	2024-11-04	下载	Federated learning (FL) is a collaborative machine learning approach that enables multiple clients to train models without sharing their private data.
Real-time and Downtime-tolerant Fault Diagnosis for Railway Turnout Machines (RTMs) Empowered with Cloud-Edge Pipeline Parallelism	Fan Wu, Muhammad Bilal, Haolong Xiang, Heng Wang, Jinjun Yu, Xiaolong Xu	2024-11-04	下载	Railway Turnout Machines (RTMs) are mission-critical components of the railway transportation infrastructure, responsible for directing trains onto desired tracks.
Against Multifaceted Graph Heterogeneity via Asymmetric Federated Prompt Learning	Zhuoning Guo, Ruiqian Han, Hao Liu	2024-11-04	下载	Federated Graph Learning (FGL) aims to collaboratively and privately optimize graph models on divergent data for different tasks. A critical challenge in FGL is to enable effective yet efficient feder...
FaaSTube: Optimizing GPU-oriented Data Transfer for Serverless Computing	Hao Wu, Junxiao Deng, Minchen Yu, Yue Yu, Yaochen Liu, Hao Fan, Song Wu, Wei Wang	2024-11-04	下载	Serverless computing has gained significant traction for machine learning inference applications, which are often deployed as serverless workflows consisting of multiple CPU and GPU functions with dat...
FedReMa: Improving Personalized Federated Learning via Leveraging the Most Relevant Clients	Han Liang, Ziwei Zhan, Weijie Liu, Xiaoxi Zhang, Chee Wei Tan, Xu Chen	2024-11-04	下载	Federated Learning (FL) is a distributed machine learning paradigm that achieves a globally robust model through decentralized computation and periodic model synthesis, primarily focusing on the globa...
Minder: Faulty Machine Detection for Large-scale Distributed Model Training	Yangtao Deng, Xiang Shi, Zhuo Jiang, Xingjian Zhang, Lei Zhang, Zhang Zhang, Bo Li, Zuquan Song, Hang Zhu, Gaohong Liu, Fuliang Li, Shuguang Wang, Haibin Lin, Jianxi Ye, Minlan Yu	2024-11-04	下载	Large-scale distributed model training requires simultaneous training on up to thousands of machines. Faulty machine detection is critical when an unexpected fault occurs in a machine.
Context Parallelism for Scalable Million-Token Inference	Amy Yang, Jingyi Yang, Aya Ibrahim, Xinfeng Xie, Bangsheng Tang, Grigory Sizov, Jeremy Reizenstein, Jongsoo Park, Jianyu Huang	2024-11-04	下载	We present context parallelism for long-context large language model inference, which achieves near-linear scaling for long-context prefill latency with up to 128 H100 GPUs across 16 nodes.
xDiT: an Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism	Jiarui Fang, Jinzhe Pan, Xibo Sun, Aoyu Li, Jiannan Wang	2024-11-04	下载	Diffusion models are pivotal for generating high-quality images and videos. Inspired by the success of OpenAI's Sora, the backbone of diffusion models is evolving from U-Net to Transformer, known as D...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for Network	Nouf Alabbasi, Omar Erak, Omar Alhussein, Ismail Lotfi, Sami Muhaidat, Merouane Debbah	2024-11-04	下载	The telecommunications industry's rapid evolution demands intelligent systems capable of managing complex networks and adapting to emerging technologies.
LLM-based Continuous Intrusion Detection Framework for Next-Gen Networks	Frederic Adjewa, Moez Esseghir, Leila Merghem-Boulahia	2024-11-04	下载	In this paper, we present an adaptive framework designed for the continuous detection, identification and classification of emerging attacks in network traffic.
Technical Report: Performance Comparison of Service Mesh Frameworks: the MTLS Test Case	Anat Bremler Barr, Ofek Lavi, Yaniv Naor, Sanjeev Rampal, Jhonatan Tavori	2024-11-04	下载	Service Mesh has become essential for modern cloud-native applications by abstracting communication between microservices and providing zero-trust security, observability, and advanced traffic control...
A Survey on AI-driven Energy Optimisation in Terrestrial Next Generation Radio Access Networks	Kishan Sthankiya, Nagham Saeed, Greg McSorley, Mona Jaber, Richard G. Clegg	2024-11-04	下载	This survey uncovers the tension between AI techniques designed for energy saving in mobile networks and the energy demands those same techniques create.
Optimizing AoI at Query in Multiuser Wireless Uplink Networks: A Whittle Index Approach	Jingwei Liu, He Chen	2024-11-04	下载	In this paper, we explore how to schedule multiple users to optimize information freshness in a pull-based wireless network, where the status updates from users are requested by randomly arriving quer...
Real-time and Downtime-tolerant Fault Diagnosis for Railway Turnout Machines (RTMs) Empowered with Cloud-Edge Pipeline Parallelism	Fan Wu, Muhammad Bilal, Haolong Xiang, Heng Wang, Jinjun Yu, Xiaolong Xu	2024-11-04	下载	Railway Turnout Machines (RTMs) are mission-critical components of the railway transportation infrastructure, responsible for directing trains onto desired tracks.
Adaptive Optimization of TLS Overhead for Wireless Communication in Critical Infrastructure	Jörn Bodenhausen, Laurenz Grote, Michael Rademacher, Martin Henze	2024-11-04	下载	With critical infrastructure increasingly relying on wireless communication, using end-to-end security such as TLS becomes imperative. However, TLS introduces significant overhead for resource-constra...
A new control- and management architecture for SDN-enabled quantum key distribution networks	Peter Horoschenkoff, Jasper Rödiger, Martin Wilske	2024-11-04	下载	This paper aims to address the challenge of designing secure and high performance Quantum Key Distribution Networks (QKDN), which are essential for encrypted communication in the era of quantum comput...
Fairness-Utilization Trade-off in Wireless Networks with Explainable Kolmogorov-Arnold Networks	Masoud Shokrnezhad, Hamidreza Mazandarani, Tarik Taleb	2024-11-04	下载	The effective distribution of user transmit powers is essential for the significant advancements that the emergence of 6G wireless networks brings.
Connection Performance Modeling and Analysis of a Radiosonde Network in a Typhoon	Hanyi Liu, Xianbin Cao, Peng Yang, Zehui Xiong, Tony Q. S. Quek, Dapeng Oliver Wu	2024-11-04	下载	This paper is concerned with the theoretical modeling and analysis of uplink connection performance of a radiosonde network deployed in a typhoon.
Efficient Conflict Graph Creation for Time-Sensitive Networks with Dynamically Changing Communication Demands	Heiko Geppert, Frank Dürr, Kurt Rothermel	2024-11-04	下载	Many applications of cyber-physical systems require real-time communication: manufacturing, automotive, etc. Recent Ethernet standards for Time Sensitive Networking (TSN) offer time-triggered scheduli...