Skip to content

2024-12-04

cs.AR - Architecture

标题作者发布日期PDF摘要
Designing DNNs for a trade-off between robustness and processing performance in embedded devicesJon Gutiérrez-Zaballa, Koldo Basterretxea, Javier Echanobe2024-12-04下载Machine learning-based embedded systems employed in safety-critical applications such as aerospace and autonomous driving need to be robust against perturbations produced by soft errors.
BinSparX: Sparsified Binary Neural Networks for Reduced Hardware Non-Idealities in Xbar ArraysAkul Malhotra, Sumeet Kumar Gupta2024-12-04下载Compute-in-memory (CiM)-based binary neural network (CiM-BNN) accelerators marry the benefits of CiM and ultra-low precision quantization, making them highly suitable for edge computing.
Evaluating Single Event Upsets in Deep Neural Networks for Semantic Segmentation: an embedded system perspectiveJon Gutiérrez-Zaballa, Koldo Basterretxea, Javier Echanobe2024-12-04下载As the deployment of artifical intelligence (AI) algorithms at edge devices becomes increasingly prevalent, enhancing the robustness and reliability of autonomous AI-based perception and decision syst...
BOSS: Blocking algorithm for optimizing shuttling scheduling in Ion TrapXian Wu, Chenghong Zhu, Jingbo Wang, Xin Wang2024-12-04下载Ion traps stand at the forefront of quantum hardware technology, presenting unparalleled benefits for quantum computing, such as high-fidelity gates, extensive connectivity, and prolonged coherence ti...
IMPACT:InMemory ComPuting Architecture Based on Y-FlAsh Technology for Coalesced Tsetlin Machine InferenceOmar Ghazal, Wei Wang, Shahar Kvatinsky, Farhad Merchant, Alex Yakovlev, Rishad Shafik2024-12-04下载The increasing demand for processing large volumes of data for machine learning models has pushed data bandwidth requirements beyond the capability of traditional von Neumann architecture.
Online Soft Error Tolerance in ReRAM Crossbars for Deep Learning AcceleratorsBenyamin Khezeli, Hamid Reza Zarandi, Elham Cheshmikhani2024-12-04下载Resistive Random-Access Memory (ReRAM) crossbar arrays are promising candidates for in-situ matrix-vector multiplication (MVM), a frequent operation in Deep Learning algorithms.
SPICE-PIDE: A Methodology for Design and Optimization of Integrated CircuitsJehan Taraporewalla, Arun KP, Sugata Ghosh, Abhishek Agarwal, Bijaydoot Basak, Dipankar Saha2024-12-04下载In application-specific designs, owing to the trade-off between power consumption and speed, optimization of various circuit parameters has become a challenging task.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Securing RC Based P2P Networks: A Blockchain-based Access Control Framework utilizing Ethereum Smart Contracts for IoT and Web 3.0Saurav Ghosh, Reshmi Mitra, Indranil Roy, Bidyut Gupta2024-12-04下载Ensuring security for highly dynamic peer-to-peer (P2P) networks has always been a challenge, especially for services like online transactions and smart devices.
Reactive Orchestration for Hierarchical Federated Learning Under a Communication Cost BudgetIvan Čilić, Anna Lackinger, Pantelis Frangoudis, Ivana Podnar Žarko, Alireza Furutanpey, Ilir Murturi, Schahram Dustdar2024-12-04下载Deploying a Hierarchical Federated Learning (HFL) pipeline across the computing continuum (CC) requires careful organization of participants into a hierarchical structure with intermediate aggregation...
Seamless Optical Cloud Computing across Edge-Metro Network for Generative AISizhe Xing, Aolong Sun, Chengxi Wang, Yizhi Wang, Boyu Dong, Junhui Hu, Xuyu Deng, An Yan, Yingjun Liu, Fangchen Hu, Zhongya Li, Ouhan Huang, Junhao Zhao, Yingjun Zhou, Ziwei Li, Jianyang Shi, Xi Xiao, Richard Penty, Qixiang Cheng, Nan Chi, Junwen Zhang2024-12-04下载The rapid advancement of generative artificial intelligence (AI) in recent years has profoundly reshaped modern lifestyles, necessitating a revolutionary architecture to support the growing demands fo...
Semi-decentralized Training of Spatio-Temporal Graph Neural Networks for Traffic PredictionIvan Kralj, Lodovico Giaretta, Gordan Ježić, Ivana Podnar Žarko, Šarūnas Girdzijauskas2024-12-04下载In smart mobility, large networks of geographically distributed sensors produce vast amounts of high-frequency spatio-temporal data that must be processed in real time to avoid major disruptions.
Resource Slicing through Intelligent Orchestration of Energy-aware IoT services in Edge-Cloud ContinuumHafiz Faheem Shahid, Erkki Harjula2024-12-04下载The rapid growth of the Internet of Things (IoT) applications inflicts high requirements for computing resources and network bandwidth. A growing number of service providers are applying edge-cloud co...
DiffKV: Differentiated Memory Management for Large Language Models with Parallel KV CompactionYanqi Zhang, Yuwei Hu, Runyuan Zhao, John C. S. Lui, Haibo Chen2024-12-04下载Large language models (LLMs) demonstrate remarkable capabilities but face substantial serving costs due to their high memory demands, with the key-value (KV) cache being a primary bottleneck.
Cost-Performance Evaluation of General Compute Instances: AWS, Azure, GCP, and OCIJay Tharwani, Arnab A Purkayastha2024-12-04下载Cloud computing has become the cornerstone of modern IT infrastructure, offering a wide range of general-purpose instances optimized for diverse workloads.
Edge System Design Using Containers and Unikernels for IoT ApplicationsShahidullah Kaiser, Ali Saman Tosun, Turgay Korkmaz2024-12-04下载Edge computing is emerging as a key enabler of low-latency, high-efficiency processing for the Internet of Things (IoT) and other real-time applications.
Exploring the Viability of Unikernels for ARM-powered Edge ComputingShahidullah Kaiser, Ali Saman Tosun, Turgay Korkmaz2024-12-04下载The rapid expansion of IoT devices and their real-time applications have driven a growing need for edge computing. To meet this need, efficient and secure solutions are required for running such appli...
Partially Conditioned Patch Parallelism for Accelerated Diffusion Model InferenceXiuYu Zhang, Zening Luo, Michelle E. Lu2024-12-04下载Diffusion models have exhibited exciting capabilities in generating images and are also very promising for video creation. However, the inference speed of diffusion models is limited by the slow sampl...
BGTplanner: Maximizing Training Accuracy for Differentially Private Federated Recommenders via Strategic Privacy Budget AllocationXianzhi Zhang, Yipeng Zhou, Miao Hu, Di Wu, Pengshan Liao, Mohsen Guizani, Michael Sheng2024-12-04下载To mitigate the rising concern about privacy leakage, the federated recommender (FR) paradigm emerges, in which decentralized clients co-train the recommendation model without exposing their raw user-...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Securing RC Based P2P Networks: A Blockchain-based Access Control Framework utilizing Ethereum Smart Contracts for IoT and Web 3.0Saurav Ghosh, Reshmi Mitra, Indranil Roy, Bidyut Gupta2024-12-04下载Ensuring security for highly dynamic peer-to-peer (P2P) networks has always been a challenge, especially for services like online transactions and smart devices.
JPPO++: Joint Power and Denoising-inspired Prompt Optimization for Mobile LLM ServicesFeiran You, Hongyang Du, Kaibin Huang, Abbas Jamalipour2024-12-04下载Large Language Models (LLMs) are increasingly integrated into mobile services over wireless networks to support complex user requests. This trend has led to longer prompts, which improve LLMs' perform...
Reactive Orchestration for Hierarchical Federated Learning Under a Communication Cost BudgetIvan Čilić, Anna Lackinger, Pantelis Frangoudis, Ivana Podnar Žarko, Alireza Furutanpey, Ilir Murturi, Schahram Dustdar2024-12-04下载Deploying a Hierarchical Federated Learning (HFL) pipeline across the computing continuum (CC) requires careful organization of participants into a hierarchical structure with intermediate aggregation...
Coordinated Multi-Armed Bandits for Improved Spatial Reuse in Wi-FiFrancesc Wilhelmi, Boris Bellalta, Szymon Szott, Katarzyna Kosek-Szott, Sergio Barrachina-Muñoz2024-12-04下载Multi-Access Point Coordination (MAPC) and Artificial Intelligence and Machine Learning (AI/ML) are expected to be key features in future Wi-Fi, such as the forthcoming IEEE 802.

cs.PF - Performance

标题作者发布日期PDF摘要
ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable CompressionGuangda Liu, Chengwei Li, Jieru Zhao, Chenqi Zhang, Minyi Guo2024-12-04下载Large Language Models (LLMs) have been widely deployed in a variety of applications, and the context length is rapidly increasing to handle tasks such as long-document QA and complex logical reasoning...
Cost-Performance Evaluation of General Compute Instances: AWS, Azure, GCP, and OCIJay Tharwani, Arnab A Purkayastha2024-12-04下载Cloud computing has become the cornerstone of modern IT infrastructure, offering a wide range of general-purpose instances optimized for diverse workloads.

基于 VitePress 构建