Skip to content

2024-12-11

cs.AR - Architecture

标题作者发布日期PDF摘要
Security Properties for Open-Source Hardware DesignsJayden Rogers, Niyaz Shakeel, Divya Mankani, Samantha Espinosa, Cade Chabra, Kaki Ryan, Cynthia Sturton2024-12-11下载The hardware security community relies on databases of known vulnerabilities and open-source designs to develop formal verification methods for identifying hardware security flaws.
Empirical Measurements of AI Training Power Demand on a GPU-Accelerated NodeImran Latif, Alex C. Newkirk, Matthew R. Carbone, Arslan Munir, Yuewei Lin, Jonathan Koomey, Xi Yu, Zhiuha Dong2024-12-11下载The expansion of artificial intelligence (AI) applications has driven substantial investment in computational infrastructure, especially by cloud computing providers.
TurboAttention: Efficient Attention Approximation For High Throughputs LLMsHao Kang, Srikant Bharadwaj, James Hensman, Tushar Krishna, Victor Ruhle, Saravan Rajmohan2024-12-11下载Large language model (LLM) inference demands significant amount of computation and memory, especially in the key attention mechanism. While techniques, such as quantization and acceleration algorithms...
Enhancing CGRA Efficiency Through Aligned Compute and Communication ProvisioningZhaoying Li, Pranav Dangi, Chenyang Yin, Thilini Kaushalya Bandara, Rohan Juneja, Cheng Tan, Zhenyu Bai, Tulika Mitra2024-12-11下载Coarse-grained Reconfigurable Arrays (CGRAs) are domain-agnostic accelerators that enhance the energy efficiency of resource-constrained edge devices.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Analytic Roofline Modeling and Energy Analysis of LULESH Proxy Application on Multi-Core ClustersAyesha Afzal, Georg Hager, Gerhard Wellein2024-12-11下载We present a thorough performance and energy consumption analysis of the LULESH proxy application in its OpenMP and MPI variants on two different clusters based on Intel Ice Lake (ICL) and Sapphire Ra...
Protecting Confidentiality, Privacy and Integrity in Collaborative LearningDong Chen, Alice Dethise, Istemi Ekin Akkus, Ivica Rimac, Klaus Satzke, Antti Koskela, Marco Canini, Wei Wang, Ruichuan Chen2024-12-11下载A collaboration between dataset owners and model owners is needed to facilitate effective machine learning (ML) training. During this collaboration, however, dataset owners and model owners want to pr...
Can vehicular cloud replace edge computing?Rosario Patanè, Nadjib Achir, Andrea Araldo, Lila Boukhatem2024-12-11下载Edge computing (EC) consists of deploying computation resources close to the users, thus enabling low-latency applications, such as augmented reality and online gaming.
Federated Learning for Traffic Flow Prediction with Synthetic Data AugmentationFermin Orozco, Pedro Porto Buarque de Gusmão, Hongkai Wen, Johan Wahlström, Man Luo2024-12-11下载Deep-learning based traffic prediction models require vast amounts of data to learn embedded spatial and temporal dependencies. The inherent privacy and commercial sensitivity of such data has encoura...
Pioplat: A Scalable, Low-Cost Framework for Latency Reduction in Ethereum BlockchainKe Wang, Qiao Wang, Yue Li, Zhi Guan, Zhong Chen2024-12-11下载As decentralized applications on permissionless blockchains are prevalent, more and more latency-sensitive usage scenarios emerged, where the lower the latency of sending and receiving messages, the b...
Analyzing the Performance Portability of SYCL across CPUs, GPUs, and Hybrid Systems with SW Sequence AlignmentManuel Costanzo, Enzo Rucci, Carlos García-Sánchez, Marcelo Naiouf, Manuel Prieto-Matías2024-12-11下载The high-performance computing (HPC) landscape is undergoing rapid transformation, with an increasing emphasis on energy-efficient and heterogeneous computing environments.
EaCO: Resource Sharing Dynamics and Its Impact on Energy Efficiency for DNN TrainingKawsar Haghshenas, Mona Hashemi2024-12-11下载Deep Learning Training (DLT) is a growing workload in shared GPU/CPU clusters due to its high computational cost and increasing number of jobs.
Collaborative Inference for Large Models with Task Offloading and Early ExitingZuan Xie, Yang Xu, Hongli Xu, Yunming Liao, Zhiyuan Yao2024-12-11下载In 5G smart cities, edge computing is employed to provide nearby computing services for end devices, and the large-scale models (e.g., GPT and LLaMA) can be deployed at the network edge to boost the s...
Learn How to Query from Unlabeled Data Streams in Federated LearningYuchang Sun, Xinran Li, Tao Lin, Jun Zhang2024-12-11下载Federated learning (FL) enables collaborative learning among decentralized clients while safeguarding the privacy of their local data. Existing studies on FL typically assume offline labeled data avai...
Quantum Simultaneous Protocols without Public Coins using Modified Equality QueriesFrançois Le Gall, Oran Nadler, Harumichi Nishimura, Rotem Oshman2024-12-11下载In this paper we study a quantum version of the multiparty simultaneous message-passing (SMP) model, and we show that in some cases, quantum communication can replace public randomness, even with no e...
Parsl+CWL: Towards Combining the Python and CWL EcosystemsNishchay Karle, Ben Clifford, Yadu Babuji, Ryan Chard, Daniel S. Katz, Kyle Chard2024-12-11下载The Common Workflow Language (CWL) is a widely adopted language for defining and sharing computational workflows. It is designed to be independent of the execution engine on which workflows are execut...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Integrating Optimization Theory with Deep Learning for Wireless Network DesignSinem Coleri, Aysun Gurur Onalan, Marco di Renzo2024-12-11下载Traditional wireless network design relies on optimization algorithms derived from domain-specific mathematical models, which are often inefficient and unsuitable for dynamic, real-time applications d...
Generative Semantic Communication: Architectures, Technologies, and ApplicationsJinke Ren, Yaping Sun, Hongyang Du, Weiwen Yuan, Chongjie Wang, Xianda Wang, Yingbin Zhou, Ziwei Zhu, Fangxin Wang, Shuguang Cui2024-12-11下载This paper delves into the applications of generative artificial intelligence (GAI) in semantic communication (SemCom) and presents a thorough study.
Orderly Management of Packets in RDMA by EunomiaSana Mahmood, Jinqi Lu, Soudeh Ghorbani2024-12-11下载To fulfill the low latency requirements of today's applications, deployment of RDMA in datacenters has become prevalent over the recent years.
ECSeptional DNS Data: Evaluating Nameserver ECS Deployments with Response-Aware ScanningPatrick Sattler, Johannes Zirngibl, Fahad Hilal, Oliver Gasser, Kevin Vermeulen, Georg Carle, Mattijs Jonker2024-12-11下载DNS is one of the cornerstones of the Internet. Nowadays, a substantial fraction of DNS queries are handled by public resolvers (e.g., Google Public DNS and Cisco's OpenDNS) rather than ISP nameserver...
Can vehicular cloud replace edge computing?Rosario Patanè, Nadjib Achir, Andrea Araldo, Lila Boukhatem2024-12-11下载Edge computing (EC) consists of deploying computation resources close to the users, thus enabling low-latency applications, such as augmented reality and online gaming.
GDSG: Graph Diffusion-based Solution Generator for Optimization Problems in MEC NetworksRuihuai Liang, Bo Yang, Pengyu Chen, Xuelin Cao, Zhiwen Yu, Mérouane Debbah, Dusit Niyato, H. Vincent Poor, Chau Yuen2024-12-11下载Optimization is crucial for MEC networks to function efficiently and reliably, most of which are NP-hard and lack efficient approximation algorithms.

cs.PF - Performance

标题作者发布日期PDF摘要
Analytic Roofline Modeling and Energy Analysis of LULESH Proxy Application on Multi-Core ClustersAyesha Afzal, Georg Hager, Gerhard Wellein2024-12-11下载We present a thorough performance and energy consumption analysis of the LULESH proxy application in its OpenMP and MPI variants on two different clusters based on Intel Ice Lake (ICL) and Sapphire Ra...

基于 VitePress 构建