Skip to content

2024-07-31

cs.AR - Architecture

标题作者发布日期PDF摘要
Towards Error Correction for Computing in Racetrack MemoryPreston Brazzle, Benjamin F. Morris, Evan McKinney, Peipei Zhou, Jingtong Hu, Asif Ali Khan, Alex K. Jones2024-07-31下载Computing-in-memory (CIM) promises to alleviate the Von Neumann bottleneck and accelerate data-intensive applications. Depending on the underlying technology and configuration, CIM enables implementin...
Machine Learning In-Sensors: Computation-enabled Intelligent Sensors For Next Generation of IoTAndrea Ronco, Lukas Schulthess, David Zehnder, Michele Magno2024-07-31下载Smart sensors are an emerging technology that allows combining the data acquisition with the elaboration directly on the Edge device, very close to the sensors.
Blink: Fast Automated Design of Run-Time Power Monitors on FPGA-Based Computing PlatformsAndrea Galimberti, Michele Piccoli, Davide Zoni2024-07-31下载The current over-provisioned heterogeneous multi-cores require effective run-time optimization strategies, and the run-time power monitoring subsystem is paramount for their success.
EdgeLLM: A Highly Efficient CPU-FPGA Heterogeneous Edge Accelerator for Large Language ModelsMingqiang Huang, Ao Shen, Kai Li, Haoxiang Peng, Boyu Li, Yupeng Su, Hao Yu2024-07-31下载The rapid advancements in artificial intelligence (AI), particularly the Large Language Models (LLMs), have profoundly affected our daily work and communication forms.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Algorithms for Collaborative Machine Learning under Statistical HeterogeneitySeok-Ju Hahn2024-07-31下载Learning from distributed data without accessing them is undoubtedly a challenging and non-trivial task. Nevertheless, the necessity for distributed training of a statistical model has been increasing...
Ponder: Online Prediction of Task Memory Requirements for Scientific WorkflowsFabian Lehmann, Jonathan Bader, Ninon De Mecquenem, Xing Wang, Vasilis Bountris, Florian Friederici, Ulf Leser, Lauritz Thamsen2024-07-31下载Scientific workflows are used to analyze large amounts of data. These workflows comprise numerous tasks, many of which are executed repeatedly, running the same custom program on different inputs.
FTuner: A Fast Dynamic Shape Tensors Program Auto-Tuner for Deep Learning CompilersPengyu Mu, Linquan Wei, Yi Liu, Rui Wang2024-07-31下载Many artificial intelligence models process input data of different lengths and resolutions, making the shape of the tensors dynamic. The performance of these models depends on the shape of the tensor...
DDU-Net: A Domain Decomposition-Based CNN for High-Resolution Image Segmentation on Multiple GPUsCorné Verburg, Alexander Heinlein, Eric C. Cyr2024-07-31下载The segmentation of ultra-high resolution images poses challenges such as loss of spatial information or computational inefficiency. In this work, a novel approach that combines encoder-decoder archit...
AQUA: Network-Accelerated Memory Offloading for LLMs in Scale-Up GPU DomainsAbhishek Vijaya Kumar, Gianni Antichi, Rachee Singh2024-07-31下载Inference on large-language models (LLMs) is constrained by GPU memory capacity. A sudden increase in the number of inference requests to a cloud-hosted LLM can deplete GPU memory, leading to contenti...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Discovery of 6G Services and Resources in Edge-Cloud-ContinuumMohammad Farhoudi, Masoud Shokrnezhad, Tarik Taleb, Richard Li, JaeSeung Song2024-07-31下载The advent of 6G networks will present a pivotal juncture in the evolution of telecommunications, marked by the proliferation of devices, dynamic service requests, and the integration of edge and clou...
Post-Quantum Cryptography (PQC) Network Instrument: Measuring PQC Adoption Rates and Identifying Migration PathwaysJakub Sowa, Bach Hoang, Advaith Yeluru, Steven Qie, Anita Nikolich, Ravishankar Iyer, Phuong Cao2024-07-31下载The problem of adopting quantum-resistant cryptographic network protocols or post-quantum cryptography (PQC) is critically important to democratizing quantum computing.
REPS: Recycled Entropy Packet Spraying for Adaptive Load Balancing and Failure MitigationTommaso Bonato, Abdul Kabbani, Ahmad Ghalayini, Michael Papamichael, Mohammad Dohadwala, Lukas Gianinazzi, Mikhail Khalilov, Elias Achermann, Daniele De Sensi, Torsten Hoefler2024-07-31下载Next-generation datacenters require highly efficient network load balancing to manage the growing scale of artificial intelligence (AI) training and general datacenter traffic.
A New Horizon of Data Communication through Quantum EntanglementS. M. Rashadul Islam, Md Manirul Islam, Umme Salsabil2024-07-31下载By the blessing of our existing data communication system, we can communicate or share our information with each other in every nook and corner of the world within some few seconds but there are some ...
Kuramoto oscillators in random networksAgostino Funel2024-07-31下载By means of numerical analysis conducted with the aid of the computer, the collective synchronization of coupled phase oscillators in the Kuramoto model in the connected regime of random networks of v...
Semantic Enabled 6G LEO Satellite Communication for Earth Observation: A Resource-Constrained Network OptimizationSheikh Salman Hassan, Loc X. Nguyen, Yan Kyaw Tun, Zhu Han, Choong Seon Hong2024-07-31下载Earth observation satellites generate large amounts of real-time data for monitoring and managing time-critical events such as disaster relief missions.
Priority and Stackelberg Game-Based Incentive Task Allocation for Device-Assisted MEC NetworksYang Li, Xing Zhang, Bo Lei, Zheyan Qu, Wenbo Wang2024-07-31下载Mobile edge computing (MEC) is a promising computing paradigm that offers users proximity and instant computing services for various applications, and it has become an essential component of the Inter...
Pushing the Limits of In-Network Caching for Key-Value StoresGyuyeong Kim2024-07-31下载We present OrbitCache, a new in-network caching architecture that can cache variable-length items to balance a wide range of key-value workloads.

cs.PF - Performance

标题作者发布日期PDF摘要
Accelerating Transfer Function Update for Distance Map based Volume RenderingMichael Rauter, Lukas Zimmermann, Markus Zeilinger2024-07-31下载Direct volume rendering using ray-casting is widely used in practice. By using GPUs and applying acceleration techniques as empty space skipping, high frame rates are possible on modern hardware.

基于 VitePress 构建