Skip to content

2024-04-25

cs.AR - Architecture

标题作者发布日期PDF摘要
Record Acceleration of the Two-Dimensional Ising Model Using High-Performance Wafer Scale EngineDirk Van Essendelft, Hayl Almolyki, Wei Shi, Terry Jordan, Mei-Yu Wang, Wissam A. Saidi2024-04-25下载The versatility and wide-ranging applicability of the Ising model, originally introduced to study phase transitions in magnetic materials, have made it a cornerstone in statistical physics and a valua...
Implementing and Optimizing the Scaled Dot-Product Attention on Streaming DataflowGina Sohn, Nathan Zhang, Kunle Olukotun2024-04-25下载Transformer models serve as the backbone of many state-ofthe-art language models, and most use the scaled dot-product attention (SDPA) mechanism to capture relationships between tokens.
Digital ASIC Design with Ongoing LLMs: Strategies and ProspectsMaoyang Xiang, Emil Goh, T. Hui Teo2024-04-25下载The escalating complexity of modern digital systems has imposed significant challenges on integrated circuit (IC) design, necessitating tools that can simplify the IC design flow.
FLAASH: Flexible Accelerator Architecture for Sparse High-Order Tensor ContractionGabriel Kulp, Andrew Ensinger, Lizhong Chen2024-04-25下载Tensors play a vital role in machine learning (ML) and often exhibit properties best explored while maintaining high-order. Efficiently performing ML computations requires taking advantage of sparsity...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Portable, Massively Parallel Implementation of a Material Point Method for Compressible FlowsPaolo Joseph Baioni, Tommaso Benacchio, Luigi Capone, Carlo de Falco2024-04-25下载The recent evolution of software and hardware technologies is leading to a renewed computational interest in Particle-In-Cell (PIC) methods such as the Material Point Method (MPM).
Large Scale Multi-GPU Based Parallel Traffic Simulation for Accelerated Traffic Assignment and PropagationXuan Jiang, Raja Sengupta, James Demmel, Samuel Williams2024-04-25下载Traffic propagation simulation is crucial for urban planning, enabling congestion analysis, travel time estimation, and route optimization. Traditional micro-simulation frameworks are limited to main ...
ESG: Pipeline-Conscious Efficient Scheduling of DNN Workflows on Serverless Platforms with Shareable GPUsXinning Hui, Yuanchao Xu, Zhishan Guo, Xipeng Shen2024-04-25下载Recent years have witnessed increasing interest in machine learning inferences on serverless computing for its auto-scaling and cost effective properties.
A Communication- and Memory-Aware Model for Load Balancing TasksJonathan Lifflander, Philippe P. Pebay, Nicole L. Slattengren, Pierre L. Pebay, Robert A. Pfeiffer, Joseph D. Kotulski, Sean T. McGovern2024-04-25下载While load balancing in distributed-memory computing has been well-studied, we present an innovative approach to this problem: a unified, reduced-order model that combines three key components to desc...
Hybrid Heterogeneous Clusters Can Lower the Energy Consumption of LLM Inference WorkloadsGrant Wilkins, Srinivasan Keshav, Richard Mortier2024-04-25下载Both the training and use of Large Language Models (LLMs) require large amounts of energy. Their increasing popularity, therefore, raises critical concerns regarding the energy efficiency and sustaina...
Blockchain-enabled Energy Trading and Battery-based Sharing in MicrogridsAbdulrezzak Zekiye, Ouns Bouachir, Öznur Özkasap, Moayad Aloqaily2024-04-25下载Carbon footprint reduction can be achieved through various methods, including the adoption of renewable energy sources. The installation of such sources, like photovoltaic panels, while environmentall...
On Software Ageing Indicators in OpenStackYevhen Yazvinskyi, Jasmin Bogatinovski, Jorge Cardoso, Odej Kao2024-04-25下载Distributed systems in general and cloud systems in particular, are susceptible to failures that can lead to substantial economic and data losses, security breaches, and even potential threats to huma...
An Open-Source Fast Parallel Routing Approach for Commercial FPGAsXinshi Zang, Wenhao Lin, Shiju Lin, Jinwei Liu, Evangeline F. Y. Young2024-04-25下载In the face of escalating complexity and size of contemporary FPGAs and circuits, routing emerges as a pivotal and time-intensive phase in FPGA compilation flows.
Tightening I/O Lower Bounds through the Hourglass Dependency PatternLionel Eyraud-Dubois, Guillaume Iooss, Julien Langou, Fabrice Rastello2024-04-25下载When designing an algorithm, one cares about arithmetic/computational complexity, but data movement (I/O) complexity plays an increasingly important role that highly impacts performance and energy con...
Dirigent: Lightweight Serverless OrchestrationLazar Cvetković, François Costa, Mihajlo Djokic, Michal Friedman, Ana Klimovic2024-04-25下载While Function as a Service (FaaS) platforms can initialize function sandboxes on worker nodes in 10-100s of milliseconds, the latency to schedule functions in real FaaS clusters can be orders of magn...
Byzantine Attacks Exploiting Penalties in Ethereum PoSUlysse Pavloff, Yackolley Amoussou-Genou, Sara Tucci-Piergiovanni2024-04-25下载In May 2023, the Ethereum blockchain experienced its first inactivity leak, a mechanism designed to reinstate chain finalization amid persistent network disruptions.
Parallel and (Nearly) Work-Efficient Dynamic ProgrammingXiangyun Ding, Yan Gu, Yihan Sun2024-04-25下载The idea of dynamic programming (DP), proposed by Bellman in the 1950s, is one of the most important algorithmic techniques. However, in parallel, many fundamental and sequentially simple problems bec...
Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming ServicesJiachen Liu, Jae-Won Chung, Zhiyu Wu, Fan Lai, Myungjin Lee, Mosharaf Chowdhury2024-04-25下载Large language models (LLMs) are now at the core of conversational AI services such as real-time translation and chatbots, which provide live user interaction by incrementally streaming text to the us...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Compiler for Distributed Quantum Computing: a Reinforcement Learning ApproachPanagiotis Promponas, Akrit Mudvari, Luca Della Chiesa, Paul Polakos, Louis Samuel, Leandros Tassiulas2024-04-25下载The practical realization of quantum programs that require large-scale qubit systems is hindered by current technological limitations. Distributed Quantum Computing (DQC) presents a viable path to sca...
CarbonCP: Carbon-Aware DNN Partitioning with Conformal Prediction for Sustainable Edge IntelligenceHongyu Ke, Wanxin Jin, Haoxin Wang2024-04-25下载This paper presents a solution to address carbon emission mitigation for end-to-end edge computing systems, including the computing at battery-powered edge devices and servers, as well as the communic...
Enhancing Quality of Experience in Telecommunication Networks: A Review of Frameworks and Machine Learning AlgorithmsParsa H. S. Panahi, Amir H. Jalilvand, Abolfazl Diyanat2024-04-25下载The Internet service provider industry is currently experiencing intense competition as companies strive to provide top-notch services to their customers.
Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave NetworksShufan Wang, Guojun Xiong, Shichen Zhang, Huacheng Zeng, Jian Li, Shivendra Panwar2024-04-25下载We study the data packet transmission problem (mmDPT) in dense cell-free millimeter wave (mmWave) networks, i.e., users sending data packet requests to access points (APs) via uplinks and APs transmit...
Multi-Band mm-Wave Measurement Platform Towards Environment-Aware Beam ManagementAleksandar Ichkov, Aron Schott, Niklas Beckmann, Ljiljana Simić2024-04-25下载Agile beam management is key for providing seamless millimeter wave (mm-wave) connectivity given the site-specific spatio-temporal variations of the mm-wave channel.
Energy Efficient Service Placement for IoT NetworksMohammed A. Alshahrani, Ahmad Adnan Qidan, Taisir E. H. El-Gorashi, Jaafar M. H. Elmirghani2024-04-25下载In recent years, there has been a significant expansion in the Internet of Things (IoT), with a growing number of devices being connected to the internet.
Exploring the Dynamics of Data Transmission in 5G Networks: A Conceptual AnalysisNikita Smirnov, Sven Tomforde2024-04-25下载This conceptual analysis examines the dynamics of data transmission in 5G networks. It addresses various aspects of sending data from cameras and LiDARs installed on a remote-controlled ferry to a lan...
Quantum-assisted trustworthiness for the Quantum InternetAgustin Zaballos, Adria Mallorqui, Joan Navarro2024-04-25下载Device redundancy is one of the most well-known mechanisms in distributed systems to increase the overall system fault tolerance and, consequently, trustworthiness.
An Open-Source Fast Parallel Routing Approach for Commercial FPGAsXinshi Zang, Wenhao Lin, Shiju Lin, Jinwei Liu, Evangeline F. Y. Young2024-04-25下载In the face of escalating complexity and size of contemporary FPGAs and circuits, routing emerges as a pivotal and time-intensive phase in FPGA compilation flows.
Integration of Mixture of Experts and Multimodal Generative AI in Internet of Vehicles: A SurveyMinrui Xu, Dusit Niyato, Jiawen Kang, Zehui Xiong, Abbas Jamalipour, Yuguang Fang, Dong In Kim, Xuemin, Shen2024-04-25下载Generative AI (GAI) can enhance the cognitive, reasoning, and planning capabilities of intelligent modules in the Internet of Vehicles (IoV) by synthesizing augmented datasets, completing sensor data,...
Timely Communications for Remote InferenceMd Kamran Chowdhury Shisher, Yin Sun, I-Hong Hou2024-04-25下载In this paper, we analyze the impact of data freshness on remote inference systems, where a pre-trained neural network blue infers a time-varying target (e.g.
Spectrum Sharing Policy in the Asia-Pacific RegionZhiyong Feng, Zhiqing Wei2024-04-25下载In this chapter, we investigate the spectrum measurement results in Asia-Pacific region. Then the spectrum sharing policy in the Asia-Pacific region is reviewed in details, where the national projects...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Dirigent: Lightweight Serverless OrchestrationLazar Cvetković, François Costa, Mihajlo Djokic, Michal Friedman, Ana Klimovic2024-04-25下载While Function as a Service (FaaS) platforms can initialize function sandboxes on worker nodes in 10-100s of milliseconds, the latency to schedule functions in real FaaS clusters can be orders of magn...

cs.PF - Performance

标题作者发布日期PDF摘要
CarbonCP: Carbon-Aware DNN Partitioning with Conformal Prediction for Sustainable Edge IntelligenceHongyu Ke, Wanxin Jin, Haoxin Wang2024-04-25下载This paper presents a solution to address carbon emission mitigation for end-to-end edge computing systems, including the computing at battery-powered edge devices and servers, as well as the communic...

基于 VitePress 构建