2024-12-04

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Designing DNNs for a trade-off between robustness and processing performance in embedded devices	Jon Gutiérrez-Zaballa, Koldo Basterretxea, Javier Echanobe	2024-12-04	下载	Machine learning-based embedded systems employed in safety-critical applications such as aerospace and autonomous driving need to be robust against perturbations produced by soft errors.
BinSparX: Sparsified Binary Neural Networks for Reduced Hardware Non-Idealities in Xbar Arrays	Akul Malhotra, Sumeet Kumar Gupta	2024-12-04	下载	Compute-in-memory (CiM)-based binary neural network (CiM-BNN) accelerators marry the benefits of CiM and ultra-low precision quantization, making them highly suitable for edge computing.
Evaluating Single Event Upsets in Deep Neural Networks for Semantic Segmentation: an embedded system perspective	Jon Gutiérrez-Zaballa, Koldo Basterretxea, Javier Echanobe	2024-12-04	下载	As the deployment of artifical intelligence (AI) algorithms at edge devices becomes increasingly prevalent, enhancing the robustness and reliability of autonomous AI-based perception and decision syst...
BOSS: Blocking algorithm for optimizing shuttling scheduling in Ion Trap	Xian Wu, Chenghong Zhu, Jingbo Wang, Xin Wang	2024-12-04	下载	Ion traps stand at the forefront of quantum hardware technology, presenting unparalleled benefits for quantum computing, such as high-fidelity gates, extensive connectivity, and prolonged coherence ti...
IMPACT:InMemory ComPuting Architecture Based on Y-FlAsh Technology for Coalesced Tsetlin Machine Inference	Omar Ghazal, Wei Wang, Shahar Kvatinsky, Farhad Merchant, Alex Yakovlev, Rishad Shafik	2024-12-04	下载	The increasing demand for processing large volumes of data for machine learning models has pushed data bandwidth requirements beyond the capability of traditional von Neumann architecture.
Online Soft Error Tolerance in ReRAM Crossbars for Deep Learning Accelerators	Benyamin Khezeli, Hamid Reza Zarandi, Elham Cheshmikhani	2024-12-04	下载	Resistive Random-Access Memory (ReRAM) crossbar arrays are promising candidates for in-situ matrix-vector multiplication (MVM), a frequent operation in Deep Learning algorithms.
SPICE-PIDE: A Methodology for Design and Optimization of Integrated Circuits	Jehan Taraporewalla, Arun KP, Sugata Ghosh, Abhishek Agarwal, Bijaydoot Basak, Dipankar Saha	2024-12-04	下载	In application-specific designs, owing to the trade-off between power consumption and speed, optimization of various circuit parameters has become a challenging task.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
Securing RC Based P2P Networks: A Blockchain-based Access Control Framework utilizing Ethereum Smart Contracts for IoT and Web 3.0	Saurav Ghosh, Reshmi Mitra, Indranil Roy, Bidyut Gupta	2024-12-04	下载	Ensuring security for highly dynamic peer-to-peer (P2P) networks has always been a challenge, especially for services like online transactions and smart devices.
Reactive Orchestration for Hierarchical Federated Learning Under a Communication Cost Budget	Ivan Čilić, Anna Lackinger, Pantelis Frangoudis, Ivana Podnar Žarko, Alireza Furutanpey, Ilir Murturi, Schahram Dustdar	2024-12-04	下载	Deploying a Hierarchical Federated Learning (HFL) pipeline across the computing continuum (CC) requires careful organization of participants into a hierarchical structure with intermediate aggregation...
Seamless Optical Cloud Computing across Edge-Metro Network for Generative AI	Sizhe Xing, Aolong Sun, Chengxi Wang, Yizhi Wang, Boyu Dong, Junhui Hu, Xuyu Deng, An Yan, Yingjun Liu, Fangchen Hu, Zhongya Li, Ouhan Huang, Junhao Zhao, Yingjun Zhou, Ziwei Li, Jianyang Shi, Xi Xiao, Richard Penty, Qixiang Cheng, Nan Chi, Junwen Zhang	2024-12-04	下载	The rapid advancement of generative artificial intelligence (AI) in recent years has profoundly reshaped modern lifestyles, necessitating a revolutionary architecture to support the growing demands fo...
Semi-decentralized Training of Spatio-Temporal Graph Neural Networks for Traffic Prediction	Ivan Kralj, Lodovico Giaretta, Gordan Ježić, Ivana Podnar Žarko, Šarūnas Girdzijauskas	2024-12-04	下载	In smart mobility, large networks of geographically distributed sensors produce vast amounts of high-frequency spatio-temporal data that must be processed in real time to avoid major disruptions.
Resource Slicing through Intelligent Orchestration of Energy-aware IoT services in Edge-Cloud Continuum	Hafiz Faheem Shahid, Erkki Harjula	2024-12-04	下载	The rapid growth of the Internet of Things (IoT) applications inflicts high requirements for computing resources and network bandwidth. A growing number of service providers are applying edge-cloud co...
DiffKV: Differentiated Memory Management for Large Language Models with Parallel KV Compaction	Yanqi Zhang, Yuwei Hu, Runyuan Zhao, John C. S. Lui, Haibo Chen	2024-12-04	下载	Large language models (LLMs) demonstrate remarkable capabilities but face substantial serving costs due to their high memory demands, with the key-value (KV) cache being a primary bottleneck.
Cost-Performance Evaluation of General Compute Instances: AWS, Azure, GCP, and OCI	Jay Tharwani, Arnab A Purkayastha	2024-12-04	下载	Cloud computing has become the cornerstone of modern IT infrastructure, offering a wide range of general-purpose instances optimized for diverse workloads.
Edge System Design Using Containers and Unikernels for IoT Applications	Shahidullah Kaiser, Ali Saman Tosun, Turgay Korkmaz	2024-12-04	下载	Edge computing is emerging as a key enabler of low-latency, high-efficiency processing for the Internet of Things (IoT) and other real-time applications.
Exploring the Viability of Unikernels for ARM-powered Edge Computing	Shahidullah Kaiser, Ali Saman Tosun, Turgay Korkmaz	2024-12-04	下载	The rapid expansion of IoT devices and their real-time applications have driven a growing need for edge computing. To meet this need, efficient and secure solutions are required for running such appli...
Partially Conditioned Patch Parallelism for Accelerated Diffusion Model Inference	XiuYu Zhang, Zening Luo, Michelle E. Lu	2024-12-04	下载	Diffusion models have exhibited exciting capabilities in generating images and are also very promising for video creation. However, the inference speed of diffusion models is limited by the slow sampl...
BGTplanner: Maximizing Training Accuracy for Differentially Private Federated Recommenders via Strategic Privacy Budget Allocation	Xianzhi Zhang, Yipeng Zhou, Miao Hu, Di Wu, Pengshan Liao, Mohsen Guizani, Michael Sheng	2024-12-04	下载	To mitigate the rising concern about privacy leakage, the federated recommender (FR) paradigm emerges, in which decentralized clients co-train the recommendation model without exposing their raw user-...

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Securing RC Based P2P Networks: A Blockchain-based Access Control Framework utilizing Ethereum Smart Contracts for IoT and Web 3.0	Saurav Ghosh, Reshmi Mitra, Indranil Roy, Bidyut Gupta	2024-12-04	下载	Ensuring security for highly dynamic peer-to-peer (P2P) networks has always been a challenge, especially for services like online transactions and smart devices.
JPPO++: Joint Power and Denoising-inspired Prompt Optimization for Mobile LLM Services	Feiran You, Hongyang Du, Kaibin Huang, Abbas Jamalipour	2024-12-04	下载	Large Language Models (LLMs) are increasingly integrated into mobile services over wireless networks to support complex user requests. This trend has led to longer prompts, which improve LLMs' perform...
Reactive Orchestration for Hierarchical Federated Learning Under a Communication Cost Budget	Ivan Čilić, Anna Lackinger, Pantelis Frangoudis, Ivana Podnar Žarko, Alireza Furutanpey, Ilir Murturi, Schahram Dustdar	2024-12-04	下载	Deploying a Hierarchical Federated Learning (HFL) pipeline across the computing continuum (CC) requires careful organization of participants into a hierarchical structure with intermediate aggregation...
Coordinated Multi-Armed Bandits for Improved Spatial Reuse in Wi-Fi	Francesc Wilhelmi, Boris Bellalta, Szymon Szott, Katarzyna Kosek-Szott, Sergio Barrachina-Muñoz	2024-12-04	下载	Multi-Access Point Coordination (MAPC) and Artificial Intelligence and Machine Learning (AI/ML) are expected to be key features in future Wi-Fi, such as the forthcoming IEEE 802.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression	Guangda Liu, Chengwei Li, Jieru Zhao, Chenqi Zhang, Minyi Guo	2024-12-04	下载	Large Language Models (LLMs) have been widely deployed in a variety of applications, and the context length is rapidly increasing to handle tasks such as long-document QA and complex logical reasoning...
Cost-Performance Evaluation of General Compute Instances: AWS, Azure, GCP, and OCI	Jay Tharwani, Arnab A Purkayastha	2024-12-04	下载	Cloud computing has become the cornerstone of modern IT infrastructure, offering a wide range of general-purpose instances optimized for diverse workloads.