Appearance
2024-12-11
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Security Properties for Open-Source Hardware Designs | Jayden Rogers, Niyaz Shakeel, Divya Mankani, Samantha Espinosa, Cade Chabra, Kaki Ryan, Cynthia Sturton | 2024-12-11 | 下载 | The hardware security community relies on databases of known vulnerabilities and open-source designs to develop formal verification methods for identifying hardware security flaws. |
| Empirical Measurements of AI Training Power Demand on a GPU-Accelerated Node | Imran Latif, Alex C. Newkirk, Matthew R. Carbone, Arslan Munir, Yuewei Lin, Jonathan Koomey, Xi Yu, Zhiuha Dong | 2024-12-11 | 下载 | The expansion of artificial intelligence (AI) applications has driven substantial investment in computational infrastructure, especially by cloud computing providers. |
| TurboAttention: Efficient Attention Approximation For High Throughputs LLMs | Hao Kang, Srikant Bharadwaj, James Hensman, Tushar Krishna, Victor Ruhle, Saravan Rajmohan | 2024-12-11 | 下载 | Large language model (LLM) inference demands significant amount of computation and memory, especially in the key attention mechanism. While techniques, such as quantization and acceleration algorithms... |
| Enhancing CGRA Efficiency Through Aligned Compute and Communication Provisioning | Zhaoying Li, Pranav Dangi, Chenyang Yin, Thilini Kaushalya Bandara, Rohan Juneja, Cheng Tan, Zhenyu Bai, Tulika Mitra | 2024-12-11 | 下载 | Coarse-grained Reconfigurable Arrays (CGRAs) are domain-agnostic accelerators that enhance the energy efficiency of resource-constrained edge devices. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Analytic Roofline Modeling and Energy Analysis of LULESH Proxy Application on Multi-Core Clusters | Ayesha Afzal, Georg Hager, Gerhard Wellein | 2024-12-11 | 下载 | We present a thorough performance and energy consumption analysis of the LULESH proxy application in its OpenMP and MPI variants on two different clusters based on Intel Ice Lake (ICL) and Sapphire Ra... |
| Protecting Confidentiality, Privacy and Integrity in Collaborative Learning | Dong Chen, Alice Dethise, Istemi Ekin Akkus, Ivica Rimac, Klaus Satzke, Antti Koskela, Marco Canini, Wei Wang, Ruichuan Chen | 2024-12-11 | 下载 | A collaboration between dataset owners and model owners is needed to facilitate effective machine learning (ML) training. During this collaboration, however, dataset owners and model owners want to pr... |
| Can vehicular cloud replace edge computing? | Rosario Patanè, Nadjib Achir, Andrea Araldo, Lila Boukhatem | 2024-12-11 | 下载 | Edge computing (EC) consists of deploying computation resources close to the users, thus enabling low-latency applications, such as augmented reality and online gaming. |
| Federated Learning for Traffic Flow Prediction with Synthetic Data Augmentation | Fermin Orozco, Pedro Porto Buarque de Gusmão, Hongkai Wen, Johan Wahlström, Man Luo | 2024-12-11 | 下载 | Deep-learning based traffic prediction models require vast amounts of data to learn embedded spatial and temporal dependencies. The inherent privacy and commercial sensitivity of such data has encoura... |
| Pioplat: A Scalable, Low-Cost Framework for Latency Reduction in Ethereum Blockchain | Ke Wang, Qiao Wang, Yue Li, Zhi Guan, Zhong Chen | 2024-12-11 | 下载 | As decentralized applications on permissionless blockchains are prevalent, more and more latency-sensitive usage scenarios emerged, where the lower the latency of sending and receiving messages, the b... |
| Analyzing the Performance Portability of SYCL across CPUs, GPUs, and Hybrid Systems with SW Sequence Alignment | Manuel Costanzo, Enzo Rucci, Carlos García-Sánchez, Marcelo Naiouf, Manuel Prieto-Matías | 2024-12-11 | 下载 | The high-performance computing (HPC) landscape is undergoing rapid transformation, with an increasing emphasis on energy-efficient and heterogeneous computing environments. |
| EaCO: Resource Sharing Dynamics and Its Impact on Energy Efficiency for DNN Training | Kawsar Haghshenas, Mona Hashemi | 2024-12-11 | 下载 | Deep Learning Training (DLT) is a growing workload in shared GPU/CPU clusters due to its high computational cost and increasing number of jobs. |
| Collaborative Inference for Large Models with Task Offloading and Early Exiting | Zuan Xie, Yang Xu, Hongli Xu, Yunming Liao, Zhiyuan Yao | 2024-12-11 | 下载 | In 5G smart cities, edge computing is employed to provide nearby computing services for end devices, and the large-scale models (e.g., GPT and LLaMA) can be deployed at the network edge to boost the s... |
| Learn How to Query from Unlabeled Data Streams in Federated Learning | Yuchang Sun, Xinran Li, Tao Lin, Jun Zhang | 2024-12-11 | 下载 | Federated learning (FL) enables collaborative learning among decentralized clients while safeguarding the privacy of their local data. Existing studies on FL typically assume offline labeled data avai... |
| Quantum Simultaneous Protocols without Public Coins using Modified Equality Queries | François Le Gall, Oran Nadler, Harumichi Nishimura, Rotem Oshman | 2024-12-11 | 下载 | In this paper we study a quantum version of the multiparty simultaneous message-passing (SMP) model, and we show that in some cases, quantum communication can replace public randomness, even with no e... |
| Parsl+CWL: Towards Combining the Python and CWL Ecosystems | Nishchay Karle, Ben Clifford, Yadu Babuji, Ryan Chard, Daniel S. Katz, Kyle Chard | 2024-12-11 | 下载 | The Common Workflow Language (CWL) is a widely adopted language for defining and sharing computational workflows. It is designed to be independent of the execution engine on which workflows are execut... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Integrating Optimization Theory with Deep Learning for Wireless Network Design | Sinem Coleri, Aysun Gurur Onalan, Marco di Renzo | 2024-12-11 | 下载 | Traditional wireless network design relies on optimization algorithms derived from domain-specific mathematical models, which are often inefficient and unsuitable for dynamic, real-time applications d... |
| Generative Semantic Communication: Architectures, Technologies, and Applications | Jinke Ren, Yaping Sun, Hongyang Du, Weiwen Yuan, Chongjie Wang, Xianda Wang, Yingbin Zhou, Ziwei Zhu, Fangxin Wang, Shuguang Cui | 2024-12-11 | 下载 | This paper delves into the applications of generative artificial intelligence (GAI) in semantic communication (SemCom) and presents a thorough study. |
| Orderly Management of Packets in RDMA by Eunomia | Sana Mahmood, Jinqi Lu, Soudeh Ghorbani | 2024-12-11 | 下载 | To fulfill the low latency requirements of today's applications, deployment of RDMA in datacenters has become prevalent over the recent years. |
| ECSeptional DNS Data: Evaluating Nameserver ECS Deployments with Response-Aware Scanning | Patrick Sattler, Johannes Zirngibl, Fahad Hilal, Oliver Gasser, Kevin Vermeulen, Georg Carle, Mattijs Jonker | 2024-12-11 | 下载 | DNS is one of the cornerstones of the Internet. Nowadays, a substantial fraction of DNS queries are handled by public resolvers (e.g., Google Public DNS and Cisco's OpenDNS) rather than ISP nameserver... |
| Can vehicular cloud replace edge computing? | Rosario Patanè, Nadjib Achir, Andrea Araldo, Lila Boukhatem | 2024-12-11 | 下载 | Edge computing (EC) consists of deploying computation resources close to the users, thus enabling low-latency applications, such as augmented reality and online gaming. |
| GDSG: Graph Diffusion-based Solution Generator for Optimization Problems in MEC Networks | Ruihuai Liang, Bo Yang, Pengyu Chen, Xuelin Cao, Zhiwen Yu, Mérouane Debbah, Dusit Niyato, H. Vincent Poor, Chau Yuen | 2024-12-11 | 下载 | Optimization is crucial for MEC networks to function efficiently and reliably, most of which are NP-hard and lack efficient approximation algorithms. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Analytic Roofline Modeling and Energy Analysis of LULESH Proxy Application on Multi-Core Clusters | Ayesha Afzal, Georg Hager, Gerhard Wellein | 2024-12-11 | 下载 | We present a thorough performance and energy consumption analysis of the LULESH proxy application in its OpenMP and MPI variants on two different clusters based on Intel Ice Lake (ICL) and Sapphire Ra... |