Appearance
2024-04-29
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level Synthesis | Andy He, Darren Key, Mason Bulling, Andrew Chang, Skyler Shapiro, Everett Lee | 2024-04-29 | 下载 | Graphics Processing Units (GPUs) have become the leading hardware accelerator for deep learning applications and are used widely in training and inference of transformers; transformers have achieved s... |
| Time Reversal for Near-Field Communications on Multi-chip Wireless Networks | Fátima Rodríguez-Galán, Ama Bandara, Elana Pereira de Santana, Peter Haring Bolívar, Eduard Alarcón, Sergi Abadal | 2024-04-29 | 下载 | Wireless Network-on-Chip (WNoC) has been proposed as a low-latency, versatile, and broadcast-capable complement to current interconnects in the quest for satisfying the ever-increasing communications ... |
| ICMarks: A Robust Watermarking Framework for Integrated Circuit Physical Design IP Protection | Ruisi Zhang, Rachel Selina Rajarathnam, David Z. Pan, Farinaz Koushanfar | 2024-04-29 | 下载 | Physical design watermarking on contemporary integrated circuit (IC) layout encodes signatures without considering the dense connections and design constraints, which could lead to performance degrada... |
| DRAM-Profiler: An Experimental DRAM RowHammer Vulnerability Profiling Mechanism | Ranyang Zhou, Jacqueline T. Liu, Nakul Kochar, Sabbir Ahmed, Adnan Siraj Rakin, Shaahin Angizi | 2024-04-29 | 下载 | RowHammer stands out as a prominent example, potentially the pioneering one, showcasing how a failure mechanism at the circuit level can give rise to a significant and pervasive security vulnerability... |
| Improving Multi-Instance GPU Efficiency via Sub-Entry Sharing TLB Design | Bingyao Li, Yueqi Wang, Tianyu Wang, Lieven Eeckhout, Jun Yang, Aamer Jaleel, Xulong Tang | 2024-04-29 | 下载 | NVIDIA's Multi-Instance GPU (MIG) technology enables partitioning GPU computing power and memory into separate hardware instances, providing complete isolation including compute resources, caches, and... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Workload Intelligence: Punching Holes Through the Cloud Abstraction | Lexiang Huang, Anjaly Parayil, Jue Zhang, Xiaoting Qin, Chetan Bansal, Jovan Stojkovic, Pantea Zardoshti, Pulkit Misra, Eli Cortez, Raphael Ghelman, Íñigo Goiri, Saravan Rajmohan, Jim Kleewein, Rodrigo Fonseca, Timothy Zhu, Ricardo Bianchini | 2024-04-29 | 下载 | Today, cloud workloads are essentially opaque to the cloud platform. Typically, the only information the platform receives is the virtual machine (VM) type and possibly a decoration to the type (e.g. |
| HMTRace: Hardware-Assisted Memory-Tagging based Dynamic Data Race Detection | Jaidev Shastri, Xiaoguang Wang, Basavesh Ammanaghatta Shivakumar, Freek Verbeek, Binoy Ravindran | 2024-04-29 | 下载 | Data race, a category of insidious software concurrency bugs, is often challenging and resource-intensive to detect and debug. Existing dynamic race detection tools incur significant execution time an... |
| Optimal Parallel Algorithms for Dendrogram Computation and Single-Linkage Clustering | Laxman Dhulipala, Xiaojun Dong, Kishen N Gowda, Yan Gu | 2024-04-29 | 下载 | Computing a Single-Linkage Dendrogram (SLD) is a key step in the classic single-linkage hierarchical clustering algorithm. Given an input edge-weighted tree , the SLD of is a binary dendrogram ... |
| Performance-Aligned LLMs for Generating Fast Code | Daniel Nichols, Pranav Polasam, Harshitha Menon, Aniruddha Marathe, Todd Gamblin, Abhinav Bhatele | 2024-04-29 | 下载 | Optimizing scientific software is a difficult task because codebases are often large and complex, and performance can depend upon several factors including the algorithm, its implementation, and hardw... |
| Atomicity in Distributed Quantum Computing | Zhicheng Zhang, Mingsheng Ying | 2024-04-29 | 下载 | Atomicity is a ubiquitous assumption in distributed computing, under which actions are indivisible and appear sequential. In classical computing, this assumption has several theoretical and practical ... |
| Converter: A CEAML Reasoner Python package to Streamline Orchestration Across Cloud and Edge Continuum | Ioannis Korontanis, Antonios Makris, Konstantinos Tserpes | 2024-04-29 | 下载 | In recent years, there has been a concerted effort in both industry and research sectors to innovate new approaches to DevOps. The primary aim is to facilitate developers in transitioning their applic... |
| Dflow, a Python framework for constructing cloud-native AI-for-Science workflows | Xinzijian Liu, Yanbo Han, Zhuoyuan Li, Jiahao Fan, Chengqian Zhang, Jinzhe Zeng, Yifan Shan, Yannan Yuan, Wei-Hong Xu, Yun-Pei Liu, Yuzhi Zhang, Tongqi Wen, Darrin M. York, Zhicheng Zhong, Hang Zheng, Jun Cheng, Linfeng Zhang, Han Wang | 2024-04-29 | 下载 | In the AI-for-science era, scientific computing scenarios such as concurrent learning and high-throughput computing demand a new generation of infrastructure that supports scalable computing resources... |
| Reactive Composition of UAV Delivery Services in Urban Environments | Woojin Lee, Babar Shahzaad, Balsam Alkouz, Athman Bouguettaya | 2024-04-29 | 下载 | We propose a novel failure-aware reactive UAV delivery service composition framework. A skyway network infrastructure is presented for the effective provisioning of services in urban areas. |
| Improving Multi-Instance GPU Efficiency via Sub-Entry Sharing TLB Design | Bingyao Li, Yueqi Wang, Tianyu Wang, Lieven Eeckhout, Jun Yang, Aamer Jaleel, Xulong Tang | 2024-04-29 | 下载 | NVIDIA's Multi-Instance GPU (MIG) technology enables partitioning GPU computing power and memory into separate hardware instances, providing complete isolation including compute resources, caches, and... |
| FEDQ-Trust: Efficient Data-Driven Trust Prediction for Mobile Edge-Based IoT Systems | Jiahui Bai, Hai Dong, Athman Bouguettaya | 2024-04-29 | 下载 | We introduce FEDQ-Trust, an innovative data-driven trust prediction approach designed for mobile edge-based Internet of Things (IoT) environments. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| High Spectral-Efficiency, Ultra-low MIMO SDM Transmission over a Field-Deployed Multi-Core OAM Fiber | Junyi Liu, Zengquan Xu, Shuqi Mo, Yuming Huang, Yining Huang, Zhenhua Li, Yuying Guo, Lei Shen, Shuo Xu, Ran Gao, Cheng Du, Qian Feng, Jie Luo, Jie Liu, Siyuan Yu | 2024-04-29 | 下载 | Few-mode multi-core fiber (FM-MCF) based Space-Division Multiplexing (SDM) systems possess the potential to maximize the number of multiplexed spatial channels per fiber by harnessing both the space (... |
| Quantum Backbone Networks for Hybrid Quantum Dataframe Transmission | Francesco Vista, Daniel Holme, Stephen DiAdamo | 2024-04-29 | 下载 | To realize a global quantum Internet, there is a need for communication between quantum subnetworks. To accomplish this task, there have been multiple design proposals for a quantum backbone network a... |
| Mobile Networks on the Move: Optimizing Moving Base Stations Dynamics in Urban Scenarios | Laura Finarelli, Falko Dressler, Marco Marsan Ajmone, Gianluca Rizzo | 2024-04-29 | 下载 | Base station densification is one of the key approaches for delivering high capacity in radio access networks. However, current static deployments are often impractical and financially unsustainable, ... |
| Artificial General Intelligence (AGI)-Native Wireless Systems: A Journey Beyond 6G | Walid Saad, Omar Hashash, Christo Kurisummoottil Thomas, Christina Chaccour, Merouane Debbah, Narayan Mandayam, Zhu Han | 2024-04-29 | 下载 | Building future wireless systems that support services like digital twins (DTs) is challenging to achieve through advances to conventional technologies like meta-surfaces. |
| Decomposition Model Assisted Energy-Saving Design in Radio Access Network | Xiaoxue Zhao, Yijun Yu, Yexing Li, Dong Li, Yao Wang, Chungang Yang | 2024-04-29 | 下载 | The continuous emergence of novel services and massive connections involve huge energy consumption towards ultra-dense radio access networks. Moreover, there exist much more number of controllable par... |
| Network Intent Decomposition and Optimization for Energy-Aware Radio Access Network | Yao Wang, Yijun Yu, Yexing Li, Dong Li, Xiaoxue Zhao, Chungang Yang | 2024-04-29 | 下载 | With recent advancements in the sixth generation (6G) communication technologies, more vertical industries have encountered diverse network services. |
| 6G comprehensive intelligence: network operations and optimization based on Large Language Models | Sifan Long, Fengxiao Tang, Yangfan Li, Tiao Tan, Zhengjie Jin, Ming Zhao, Nei Kato | 2024-04-29 | 下载 | The sixth generation mobile communication standard (6G) can promote the development of Industrial Internet and Internet of Things (IoT). To achieve comprehensive intelligent development of the network... |