Skip to content

2024-04-08

cs.AR - Architecture

标题作者发布日期PDF摘要
Resistive Memory-based Neural Differential Equation Solver for Score-based Diffusion ModelJichang Yang, Hegan Chen, Jia Chen, Songqi Wang, Shaocong Wang, Yifei Yu, Xi Chen, Bo Wang, Xinyuan Zhang, Binbin Cui, Yi Li, Ning Lin, Meng Xu, Yi Li, Xiaoxin Xu, Xiaojuan Qi, Zhongrui Wang, Xumeng Zhang, Dashan Shang, Han Wang, Qi Liu, Kwang-Ting Cheng, Ming Liu2024-04-08下载Human brains image complicated scenes when reading a novel. Replicating this imagination is one of the ultimate goals of AI-Generated Content (AIGC).
The Argument for Meta-Modeling-Based Approaches to Hardware Generation LanguagesJohannes Schreiner, Daniel Gerl, Robert Kunzelmann, Paritosh Kumar Sinha, Wolfgang Ecker2024-04-08下载The rapid evolution of Integrated Circuit (IC) development necessitates innovative methodologies such as code generation to manage complexity and increase productivity.
Design and implementation of a synchronous Hardware Performance Monitor for a RISC-V space-oriented processorMiguel Jiménez Arribas, Agustín Martínez Hellín, Manuel Prieto Mateo, Iván Gamino del Río, Andrea Fernandez Gallego, Oscar Rodríguez Polo, Antonio da Silva, Pablo Parra, Sebastián Sánchez2024-04-08下载The ability to collect statistics about the execution of a program within a CPU is of the utmost importance across all fields of computing since it allows characterizing the timing performance of a pr...
Exploring Quantization and Mapping Synergy in Hardware-Aware Deep Neural Network AcceleratorsJan Klhufek, Miroslav Safar, Vojtech Mrazek, Zdenek Vasicek, Lukas Sekanina2024-04-08下载Energy efficiency and memory footprint of a convolutional neural network (CNN) implemented on a CNN inference accelerator depend on many factors, including a weight quantization strategy (i.e.
SARIS: Accelerating Stencil Computations on Energy-Efficient RISC-V Compute Clusters with Indirect Stream RegistersPaul Scheffler, Luca Colagrande, Luca Benini2024-04-08下载Stencil codes are performance-critical in many compute-intensive applications, but suffer from significant address calculation and irregular memory access overheads.
SRAM-PG: Power Delivery Network Benchmarks from SRAM CircuitsShan Shen, Zhiqiang Liu, Wenjian Yu2024-04-08下载Designing the power delivery network (PDN) in very large-scale integrated (VLSI) circuits is increasingly important, especially for nowadays low-power integrated circuit (IC) design.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Space-time deterministic graph rewritingPablo Arrighi, Marin Costes, Gilles Dowek, Luidnel Maignan2024-04-08下载We study non-terminating graph rewriting models, whose local rules are applied non-deterministically -- and yet enjoy a strong form of determinism, namely space-time determinism.
Measuring Arbitrage Losses and Profitability of AMM LiquidityRobin Fritsch, Andrea Canidio2024-04-08下载This paper presents the results of a comprehensive empirical study of losses to arbitrageurs (following the formalization of loss-versus-rebalancing by [Milionis et al.
KaMPIng: Flexible and (Near) Zero-Overhead C++ Bindings for MPITim Niklas Uhl, Matthias Schimek, Lukas Hübner, Demian Hespe, Florian Kurpicz, Christoph Stelz, Peter Sanders2024-04-08下载The Message-Passing Interface (MPI) and C++ form the backbone of high-performance computing, but MPI only provides C and Fortran bindings. While this offers great language interoperability, high-level...
Hook-in Privacy Techniques for gRPC-based Microservice CommunicationLouis Loechel, Siar-Remzi Akbayin, Elias Grünewald, Jannis Kiesel, Inga Strelnikova, Thomas Janke, Frank Pallas2024-04-08下载gRPC is at the heart of modern distributed system architectures. Based on HTTP/2 and Protocol Buffers, it provides highly performant, standardized, and polyglot communication across loosely coupled mi...
Improved Decision Module Selection for Hierarchical Inference in Resource-Constrained Edge DevicesAdarsh Prasad Behera, Roberto Morabito, Joerg Widmer, Jaya Prakash Champati2024-04-08下载The Hierarchical Inference (HI) paradigm employs a tiered processing: the inference from simple data samples are accepted at the end device, while complex data samples are offloaded to the central ser...
Optimal Allocation of Tasks and Price of Anarchy of Distributed Optimization in Networked Computing FacilitiesVincenzo Mancuso, Paolo Castagno, Leonardo Badia, Matteo Sereno, Marco Ajmone Marsan2024-04-08下载The allocation of computing tasks for networked distributed services poses a question to service providers on whether centralized allocation management be worth its cost.
Efficient Distributed Data Structures for Future Many-core ArchitecturesPanagiota Fatourou, Nikolaos D. Kallimanis, Eleni Kanellou, Odysseas Makridakis, Christi Symeonidou2024-04-08下载We study general techniques for implementing distributed data structures on top of future many-core architectures with non cache-coherent or partially cache-coherent memory.
Towards Reconfigurable Linearizable ReadsMyles Thiessen, Aleksey Panas, Guy Khazma, Eyal de Lara2024-04-08下载Linearizable datastores are desirable because they provide users with the illusion that the datastore is run on a single machine that performs client operations one at a time.
DLoRA: Distributed Parameter-Efficient Fine-Tuning Solution for Large Language ModelChao Gao, Sai Qian Zhang2024-04-08下载To enhance the performance of large language models (LLM) on downstream tasks, one solution is to fine-tune certain LLM parameters and make it better align with the characteristics of the training dat...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
AI-Enabled System for Efficient and Effective Cyber Incident Detection and Response in Cloud EnvironmentsMohammed Ashfaaq M. Farzaan, Mohamed Chahine Ghanem, Ayman El-Hajjar, Deepthi N. Ratnayake2024-04-08下载The escalating sophistication and volume of cyber threats in cloud environments necessitate a paradigm shift in strategies. Recognising the need for an automated and precise response to cyber threats,...
Optimal Flow Admission Control in Edge Computing via Safe Reinforcement LearningA. Fox, F. De Pellegrini, F. Faticanti, E. Altman, F. Bronzino2024-04-08下载With the uptake of intelligent data-driven applications, edge computing infrastructures necessitate a new generation of admission control algorithms to maximize system performance under limited and hi...
Session Types for the Transport Layer: Towards an Implementation of TCPSamuel Cavoj, Ivan Nikitin, Colin Perkins, Ornela Dardha2024-04-08下载Session types are a typing discipline used to formally describe communication-driven applications with the aim of fewer errors and easier debugging later into the life cycle of the software.
Liquid Neural Network-based Adaptive Learning vs. Incremental Learning for Link Load Prediction amid Concept Drift due to Network FailuresOmran Ayoub, Davide Andreoletti, Aleksandra Knapińska, Róża Goścień, Piotr Lechowicz, Tiziano Leidi, Silvia Giordano, Cristina Rottondi, Krzysztof Walkowiak2024-04-08下载Adapting to concept drift is a challenging task in machine learning, which is usually tackled using incremental learning techniques that periodically re-fit a learning model leveraging newly available...
Can Edge Computing fulfill the requirements of automated vehicular services using 5G network ?Wendlasida Ouedraogo, Andrea Araldo, Badii Jouaber, Hind Castel, Remy Grunblatt2024-04-08下载Communication and computation services supporting Connected and Automated Vehicles (CAVs) are characterized by stringent requirements, in terms of response time and reliability.
Towards a Partial Computation offloading in In-networking Computing-Assisted MEC: A Digital Twin ApproachIbrahim Aliyu, Awwal Arigi, Seungmin Oh, Tai-Won Um, Jinsul Kim2024-04-08下载This paper addresses the problem of minimizing latency with partial computation offloading within Industrial Internet-of-Things (IoT) systems in in-network computing (COIN)-assisted Multiaccess Edge C...
Exact Analysis of the Age of Information in the Multi-Source M/GI/1 Queueing SystemYoshiaki Inoue, Tetsuya Takine2024-04-08下载We consider a situation that multiple monitoring applications (each with a different sensor-monitor pair) compete for a common service resource such as a communication link.

cs.PF - Performance

标题作者发布日期PDF摘要
Optimal Allocation of Tasks and Price of Anarchy of Distributed Optimization in Networked Computing FacilitiesVincenzo Mancuso, Paolo Castagno, Leonardo Badia, Matteo Sereno, Marco Ajmone Marsan2024-04-08下载The allocation of computing tasks for networked distributed services poses a question to service providers on whether centralized allocation management be worth its cost.

基于 VitePress 构建