Skip to content

2024-04-29

cs.AR - Architecture

标题作者发布日期PDF摘要
HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level SynthesisAndy He, Darren Key, Mason Bulling, Andrew Chang, Skyler Shapiro, Everett Lee2024-04-29下载Graphics Processing Units (GPUs) have become the leading hardware accelerator for deep learning applications and are used widely in training and inference of transformers; transformers have achieved s...
Time Reversal for Near-Field Communications on Multi-chip Wireless NetworksFátima Rodríguez-Galán, Ama Bandara, Elana Pereira de Santana, Peter Haring Bolívar, Eduard Alarcón, Sergi Abadal2024-04-29下载Wireless Network-on-Chip (WNoC) has been proposed as a low-latency, versatile, and broadcast-capable complement to current interconnects in the quest for satisfying the ever-increasing communications ...
ICMarks: A Robust Watermarking Framework for Integrated Circuit Physical Design IP ProtectionRuisi Zhang, Rachel Selina Rajarathnam, David Z. Pan, Farinaz Koushanfar2024-04-29下载Physical design watermarking on contemporary integrated circuit (IC) layout encodes signatures without considering the dense connections and design constraints, which could lead to performance degrada...
DRAM-Profiler: An Experimental DRAM RowHammer Vulnerability Profiling MechanismRanyang Zhou, Jacqueline T. Liu, Nakul Kochar, Sabbir Ahmed, Adnan Siraj Rakin, Shaahin Angizi2024-04-29下载RowHammer stands out as a prominent example, potentially the pioneering one, showcasing how a failure mechanism at the circuit level can give rise to a significant and pervasive security vulnerability...
Improving Multi-Instance GPU Efficiency via Sub-Entry Sharing TLB DesignBingyao Li, Yueqi Wang, Tianyu Wang, Lieven Eeckhout, Jun Yang, Aamer Jaleel, Xulong Tang2024-04-29下载NVIDIA's Multi-Instance GPU (MIG) technology enables partitioning GPU computing power and memory into separate hardware instances, providing complete isolation including compute resources, caches, and...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Workload Intelligence: Punching Holes Through the Cloud AbstractionLexiang Huang, Anjaly Parayil, Jue Zhang, Xiaoting Qin, Chetan Bansal, Jovan Stojkovic, Pantea Zardoshti, Pulkit Misra, Eli Cortez, Raphael Ghelman, Íñigo Goiri, Saravan Rajmohan, Jim Kleewein, Rodrigo Fonseca, Timothy Zhu, Ricardo Bianchini2024-04-29下载Today, cloud workloads are essentially opaque to the cloud platform. Typically, the only information the platform receives is the virtual machine (VM) type and possibly a decoration to the type (e.g.
HMTRace: Hardware-Assisted Memory-Tagging based Dynamic Data Race DetectionJaidev Shastri, Xiaoguang Wang, Basavesh Ammanaghatta Shivakumar, Freek Verbeek, Binoy Ravindran2024-04-29下载Data race, a category of insidious software concurrency bugs, is often challenging and resource-intensive to detect and debug. Existing dynamic race detection tools incur significant execution time an...
Optimal Parallel Algorithms for Dendrogram Computation and Single-Linkage ClusteringLaxman Dhulipala, Xiaojun Dong, Kishen N Gowda, Yan Gu2024-04-29下载Computing a Single-Linkage Dendrogram (SLD) is a key step in the classic single-linkage hierarchical clustering algorithm. Given an input edge-weighted tree TT, the SLD of TT is a binary dendrogram ...
Performance-Aligned LLMs for Generating Fast CodeDaniel Nichols, Pranav Polasam, Harshitha Menon, Aniruddha Marathe, Todd Gamblin, Abhinav Bhatele2024-04-29下载Optimizing scientific software is a difficult task because codebases are often large and complex, and performance can depend upon several factors including the algorithm, its implementation, and hardw...
Atomicity in Distributed Quantum ComputingZhicheng Zhang, Mingsheng Ying2024-04-29下载Atomicity is a ubiquitous assumption in distributed computing, under which actions are indivisible and appear sequential. In classical computing, this assumption has several theoretical and practical ...
Converter: A CEAML Reasoner Python package to Streamline Orchestration Across Cloud and Edge ContinuumIoannis Korontanis, Antonios Makris, Konstantinos Tserpes2024-04-29下载In recent years, there has been a concerted effort in both industry and research sectors to innovate new approaches to DevOps. The primary aim is to facilitate developers in transitioning their applic...
Dflow, a Python framework for constructing cloud-native AI-for-Science workflowsXinzijian Liu, Yanbo Han, Zhuoyuan Li, Jiahao Fan, Chengqian Zhang, Jinzhe Zeng, Yifan Shan, Yannan Yuan, Wei-Hong Xu, Yun-Pei Liu, Yuzhi Zhang, Tongqi Wen, Darrin M. York, Zhicheng Zhong, Hang Zheng, Jun Cheng, Linfeng Zhang, Han Wang2024-04-29下载In the AI-for-science era, scientific computing scenarios such as concurrent learning and high-throughput computing demand a new generation of infrastructure that supports scalable computing resources...
Reactive Composition of UAV Delivery Services in Urban EnvironmentsWoojin Lee, Babar Shahzaad, Balsam Alkouz, Athman Bouguettaya2024-04-29下载We propose a novel failure-aware reactive UAV delivery service composition framework. A skyway network infrastructure is presented for the effective provisioning of services in urban areas.
Improving Multi-Instance GPU Efficiency via Sub-Entry Sharing TLB DesignBingyao Li, Yueqi Wang, Tianyu Wang, Lieven Eeckhout, Jun Yang, Aamer Jaleel, Xulong Tang2024-04-29下载NVIDIA's Multi-Instance GPU (MIG) technology enables partitioning GPU computing power and memory into separate hardware instances, providing complete isolation including compute resources, caches, and...
FEDQ-Trust: Efficient Data-Driven Trust Prediction for Mobile Edge-Based IoT SystemsJiahui Bai, Hai Dong, Athman Bouguettaya2024-04-29下载We introduce FEDQ-Trust, an innovative data-driven trust prediction approach designed for mobile edge-based Internet of Things (IoT) environments.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
High Spectral-Efficiency, Ultra-low MIMO SDM Transmission over a Field-Deployed Multi-Core OAM FiberJunyi Liu, Zengquan Xu, Shuqi Mo, Yuming Huang, Yining Huang, Zhenhua Li, Yuying Guo, Lei Shen, Shuo Xu, Ran Gao, Cheng Du, Qian Feng, Jie Luo, Jie Liu, Siyuan Yu2024-04-29下载Few-mode multi-core fiber (FM-MCF) based Space-Division Multiplexing (SDM) systems possess the potential to maximize the number of multiplexed spatial channels per fiber by harnessing both the space (...
Quantum Backbone Networks for Hybrid Quantum Dataframe TransmissionFrancesco Vista, Daniel Holme, Stephen DiAdamo2024-04-29下载To realize a global quantum Internet, there is a need for communication between quantum subnetworks. To accomplish this task, there have been multiple design proposals for a quantum backbone network a...
Mobile Networks on the Move: Optimizing Moving Base Stations Dynamics in Urban ScenariosLaura Finarelli, Falko Dressler, Marco Marsan Ajmone, Gianluca Rizzo2024-04-29下载Base station densification is one of the key approaches for delivering high capacity in radio access networks. However, current static deployments are often impractical and financially unsustainable, ...
Artificial General Intelligence (AGI)-Native Wireless Systems: A Journey Beyond 6GWalid Saad, Omar Hashash, Christo Kurisummoottil Thomas, Christina Chaccour, Merouane Debbah, Narayan Mandayam, Zhu Han2024-04-29下载Building future wireless systems that support services like digital twins (DTs) is challenging to achieve through advances to conventional technologies like meta-surfaces.
Decomposition Model Assisted Energy-Saving Design in Radio Access NetworkXiaoxue Zhao, Yijun Yu, Yexing Li, Dong Li, Yao Wang, Chungang Yang2024-04-29下载The continuous emergence of novel services and massive connections involve huge energy consumption towards ultra-dense radio access networks. Moreover, there exist much more number of controllable par...
Network Intent Decomposition and Optimization for Energy-Aware Radio Access NetworkYao Wang, Yijun Yu, Yexing Li, Dong Li, Xiaoxue Zhao, Chungang Yang2024-04-29下载With recent advancements in the sixth generation (6G) communication technologies, more vertical industries have encountered diverse network services.
6G comprehensive intelligence: network operations and optimization based on Large Language ModelsSifan Long, Fengxiao Tang, Yangfan Li, Tiao Tan, Zhengjie Jin, Ming Zhao, Nei Kato2024-04-29下载The sixth generation mobile communication standard (6G) can promote the development of Industrial Internet and Internet of Things (IoT). To achieve comprehensive intelligent development of the network...

基于 VitePress 构建