Appearance
2024-04-25
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Record Acceleration of the Two-Dimensional Ising Model Using High-Performance Wafer Scale Engine | Dirk Van Essendelft, Hayl Almolyki, Wei Shi, Terry Jordan, Mei-Yu Wang, Wissam A. Saidi | 2024-04-25 | 下载 | The versatility and wide-ranging applicability of the Ising model, originally introduced to study phase transitions in magnetic materials, have made it a cornerstone in statistical physics and a valua... |
| Implementing and Optimizing the Scaled Dot-Product Attention on Streaming Dataflow | Gina Sohn, Nathan Zhang, Kunle Olukotun | 2024-04-25 | 下载 | Transformer models serve as the backbone of many state-ofthe-art language models, and most use the scaled dot-product attention (SDPA) mechanism to capture relationships between tokens. |
| Digital ASIC Design with Ongoing LLMs: Strategies and Prospects | Maoyang Xiang, Emil Goh, T. Hui Teo | 2024-04-25 | 下载 | The escalating complexity of modern digital systems has imposed significant challenges on integrated circuit (IC) design, necessitating tools that can simplify the IC design flow. |
| FLAASH: Flexible Accelerator Architecture for Sparse High-Order Tensor Contraction | Gabriel Kulp, Andrew Ensinger, Lizhong Chen | 2024-04-25 | 下载 | Tensors play a vital role in machine learning (ML) and often exhibit properties best explored while maintaining high-order. Efficiently performing ML computations requires taking advantage of sparsity... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Portable, Massively Parallel Implementation of a Material Point Method for Compressible Flows | Paolo Joseph Baioni, Tommaso Benacchio, Luigi Capone, Carlo de Falco | 2024-04-25 | 下载 | The recent evolution of software and hardware technologies is leading to a renewed computational interest in Particle-In-Cell (PIC) methods such as the Material Point Method (MPM). |
| Large Scale Multi-GPU Based Parallel Traffic Simulation for Accelerated Traffic Assignment and Propagation | Xuan Jiang, Raja Sengupta, James Demmel, Samuel Williams | 2024-04-25 | 下载 | Traffic propagation simulation is crucial for urban planning, enabling congestion analysis, travel time estimation, and route optimization. Traditional micro-simulation frameworks are limited to main ... |
| ESG: Pipeline-Conscious Efficient Scheduling of DNN Workflows on Serverless Platforms with Shareable GPUs | Xinning Hui, Yuanchao Xu, Zhishan Guo, Xipeng Shen | 2024-04-25 | 下载 | Recent years have witnessed increasing interest in machine learning inferences on serverless computing for its auto-scaling and cost effective properties. |
| A Communication- and Memory-Aware Model for Load Balancing Tasks | Jonathan Lifflander, Philippe P. Pebay, Nicole L. Slattengren, Pierre L. Pebay, Robert A. Pfeiffer, Joseph D. Kotulski, Sean T. McGovern | 2024-04-25 | 下载 | While load balancing in distributed-memory computing has been well-studied, we present an innovative approach to this problem: a unified, reduced-order model that combines three key components to desc... |
| Hybrid Heterogeneous Clusters Can Lower the Energy Consumption of LLM Inference Workloads | Grant Wilkins, Srinivasan Keshav, Richard Mortier | 2024-04-25 | 下载 | Both the training and use of Large Language Models (LLMs) require large amounts of energy. Their increasing popularity, therefore, raises critical concerns regarding the energy efficiency and sustaina... |
| Blockchain-enabled Energy Trading and Battery-based Sharing in Microgrids | Abdulrezzak Zekiye, Ouns Bouachir, Öznur Özkasap, Moayad Aloqaily | 2024-04-25 | 下载 | Carbon footprint reduction can be achieved through various methods, including the adoption of renewable energy sources. The installation of such sources, like photovoltaic panels, while environmentall... |
| On Software Ageing Indicators in OpenStack | Yevhen Yazvinskyi, Jasmin Bogatinovski, Jorge Cardoso, Odej Kao | 2024-04-25 | 下载 | Distributed systems in general and cloud systems in particular, are susceptible to failures that can lead to substantial economic and data losses, security breaches, and even potential threats to huma... |
| An Open-Source Fast Parallel Routing Approach for Commercial FPGAs | Xinshi Zang, Wenhao Lin, Shiju Lin, Jinwei Liu, Evangeline F. Y. Young | 2024-04-25 | 下载 | In the face of escalating complexity and size of contemporary FPGAs and circuits, routing emerges as a pivotal and time-intensive phase in FPGA compilation flows. |
| Tightening I/O Lower Bounds through the Hourglass Dependency Pattern | Lionel Eyraud-Dubois, Guillaume Iooss, Julien Langou, Fabrice Rastello | 2024-04-25 | 下载 | When designing an algorithm, one cares about arithmetic/computational complexity, but data movement (I/O) complexity plays an increasingly important role that highly impacts performance and energy con... |
| Dirigent: Lightweight Serverless Orchestration | Lazar Cvetković, François Costa, Mihajlo Djokic, Michal Friedman, Ana Klimovic | 2024-04-25 | 下载 | While Function as a Service (FaaS) platforms can initialize function sandboxes on worker nodes in 10-100s of milliseconds, the latency to schedule functions in real FaaS clusters can be orders of magn... |
| Byzantine Attacks Exploiting Penalties in Ethereum PoS | Ulysse Pavloff, Yackolley Amoussou-Genou, Sara Tucci-Piergiovanni | 2024-04-25 | 下载 | In May 2023, the Ethereum blockchain experienced its first inactivity leak, a mechanism designed to reinstate chain finalization amid persistent network disruptions. |
| Parallel and (Nearly) Work-Efficient Dynamic Programming | Xiangyun Ding, Yan Gu, Yihan Sun | 2024-04-25 | 下载 | The idea of dynamic programming (DP), proposed by Bellman in the 1950s, is one of the most important algorithmic techniques. However, in parallel, many fundamental and sequentially simple problems bec... |
| Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services | Jiachen Liu, Jae-Won Chung, Zhiyu Wu, Fan Lai, Myungjin Lee, Mosharaf Chowdhury | 2024-04-25 | 下载 | Large language models (LLMs) are now at the core of conversational AI services such as real-time translation and chatbots, which provide live user interaction by incrementally streaming text to the us... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Compiler for Distributed Quantum Computing: a Reinforcement Learning Approach | Panagiotis Promponas, Akrit Mudvari, Luca Della Chiesa, Paul Polakos, Louis Samuel, Leandros Tassiulas | 2024-04-25 | 下载 | The practical realization of quantum programs that require large-scale qubit systems is hindered by current technological limitations. Distributed Quantum Computing (DQC) presents a viable path to sca... |
| CarbonCP: Carbon-Aware DNN Partitioning with Conformal Prediction for Sustainable Edge Intelligence | Hongyu Ke, Wanxin Jin, Haoxin Wang | 2024-04-25 | 下载 | This paper presents a solution to address carbon emission mitigation for end-to-end edge computing systems, including the computing at battery-powered edge devices and servers, as well as the communic... |
| Enhancing Quality of Experience in Telecommunication Networks: A Review of Frameworks and Machine Learning Algorithms | Parsa H. S. Panahi, Amir H. Jalilvand, Abolfazl Diyanat | 2024-04-25 | 下载 | The Internet service provider industry is currently experiencing intense competition as companies strive to provide top-notch services to their customers. |
| Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks | Shufan Wang, Guojun Xiong, Shichen Zhang, Huacheng Zeng, Jian Li, Shivendra Panwar | 2024-04-25 | 下载 | We study the data packet transmission problem (mmDPT) in dense cell-free millimeter wave (mmWave) networks, i.e., users sending data packet requests to access points (APs) via uplinks and APs transmit... |
| Multi-Band mm-Wave Measurement Platform Towards Environment-Aware Beam Management | Aleksandar Ichkov, Aron Schott, Niklas Beckmann, Ljiljana Simić | 2024-04-25 | 下载 | Agile beam management is key for providing seamless millimeter wave (mm-wave) connectivity given the site-specific spatio-temporal variations of the mm-wave channel. |
| Energy Efficient Service Placement for IoT Networks | Mohammed A. Alshahrani, Ahmad Adnan Qidan, Taisir E. H. El-Gorashi, Jaafar M. H. Elmirghani | 2024-04-25 | 下载 | In recent years, there has been a significant expansion in the Internet of Things (IoT), with a growing number of devices being connected to the internet. |
| Exploring the Dynamics of Data Transmission in 5G Networks: A Conceptual Analysis | Nikita Smirnov, Sven Tomforde | 2024-04-25 | 下载 | This conceptual analysis examines the dynamics of data transmission in 5G networks. It addresses various aspects of sending data from cameras and LiDARs installed on a remote-controlled ferry to a lan... |
| Quantum-assisted trustworthiness for the Quantum Internet | Agustin Zaballos, Adria Mallorqui, Joan Navarro | 2024-04-25 | 下载 | Device redundancy is one of the most well-known mechanisms in distributed systems to increase the overall system fault tolerance and, consequently, trustworthiness. |
| An Open-Source Fast Parallel Routing Approach for Commercial FPGAs | Xinshi Zang, Wenhao Lin, Shiju Lin, Jinwei Liu, Evangeline F. Y. Young | 2024-04-25 | 下载 | In the face of escalating complexity and size of contemporary FPGAs and circuits, routing emerges as a pivotal and time-intensive phase in FPGA compilation flows. |
| Integration of Mixture of Experts and Multimodal Generative AI in Internet of Vehicles: A Survey | Minrui Xu, Dusit Niyato, Jiawen Kang, Zehui Xiong, Abbas Jamalipour, Yuguang Fang, Dong In Kim, Xuemin, Shen | 2024-04-25 | 下载 | Generative AI (GAI) can enhance the cognitive, reasoning, and planning capabilities of intelligent modules in the Internet of Vehicles (IoV) by synthesizing augmented datasets, completing sensor data,... |
| Timely Communications for Remote Inference | Md Kamran Chowdhury Shisher, Yin Sun, I-Hong Hou | 2024-04-25 | 下载 | In this paper, we analyze the impact of data freshness on remote inference systems, where a pre-trained neural network blue infers a time-varying target (e.g. |
| Spectrum Sharing Policy in the Asia-Pacific Region | Zhiyong Feng, Zhiqing Wei | 2024-04-25 | 下载 | In this chapter, we investigate the spectrum measurement results in Asia-Pacific region. Then the spectrum sharing policy in the Asia-Pacific region is reviewed in details, where the national projects... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Dirigent: Lightweight Serverless Orchestration | Lazar Cvetković, François Costa, Mihajlo Djokic, Michal Friedman, Ana Klimovic | 2024-04-25 | 下载 | While Function as a Service (FaaS) platforms can initialize function sandboxes on worker nodes in 10-100s of milliseconds, the latency to schedule functions in real FaaS clusters can be orders of magn... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| CarbonCP: Carbon-Aware DNN Partitioning with Conformal Prediction for Sustainable Edge Intelligence | Hongyu Ke, Wanxin Jin, Haoxin Wang | 2024-04-25 | 下载 | This paper presents a solution to address carbon emission mitigation for end-to-end edge computing systems, including the computing at battery-powered edge devices and servers, as well as the communic... |