Appearance
2024-12-04
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Designing DNNs for a trade-off between robustness and processing performance in embedded devices | Jon Gutiérrez-Zaballa, Koldo Basterretxea, Javier Echanobe | 2024-12-04 | 下载 | Machine learning-based embedded systems employed in safety-critical applications such as aerospace and autonomous driving need to be robust against perturbations produced by soft errors. |
| BinSparX: Sparsified Binary Neural Networks for Reduced Hardware Non-Idealities in Xbar Arrays | Akul Malhotra, Sumeet Kumar Gupta | 2024-12-04 | 下载 | Compute-in-memory (CiM)-based binary neural network (CiM-BNN) accelerators marry the benefits of CiM and ultra-low precision quantization, making them highly suitable for edge computing. |
| Evaluating Single Event Upsets in Deep Neural Networks for Semantic Segmentation: an embedded system perspective | Jon Gutiérrez-Zaballa, Koldo Basterretxea, Javier Echanobe | 2024-12-04 | 下载 | As the deployment of artifical intelligence (AI) algorithms at edge devices becomes increasingly prevalent, enhancing the robustness and reliability of autonomous AI-based perception and decision syst... |
| BOSS: Blocking algorithm for optimizing shuttling scheduling in Ion Trap | Xian Wu, Chenghong Zhu, Jingbo Wang, Xin Wang | 2024-12-04 | 下载 | Ion traps stand at the forefront of quantum hardware technology, presenting unparalleled benefits for quantum computing, such as high-fidelity gates, extensive connectivity, and prolonged coherence ti... |
| IMPACT:InMemory ComPuting Architecture Based on Y-FlAsh Technology for Coalesced Tsetlin Machine Inference | Omar Ghazal, Wei Wang, Shahar Kvatinsky, Farhad Merchant, Alex Yakovlev, Rishad Shafik | 2024-12-04 | 下载 | The increasing demand for processing large volumes of data for machine learning models has pushed data bandwidth requirements beyond the capability of traditional von Neumann architecture. |
| Online Soft Error Tolerance in ReRAM Crossbars for Deep Learning Accelerators | Benyamin Khezeli, Hamid Reza Zarandi, Elham Cheshmikhani | 2024-12-04 | 下载 | Resistive Random-Access Memory (ReRAM) crossbar arrays are promising candidates for in-situ matrix-vector multiplication (MVM), a frequent operation in Deep Learning algorithms. |
| SPICE-PIDE: A Methodology for Design and Optimization of Integrated Circuits | Jehan Taraporewalla, Arun KP, Sugata Ghosh, Abhishek Agarwal, Bijaydoot Basak, Dipankar Saha | 2024-12-04 | 下载 | In application-specific designs, owing to the trade-off between power consumption and speed, optimization of various circuit parameters has become a challenging task. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Securing RC Based P2P Networks: A Blockchain-based Access Control Framework utilizing Ethereum Smart Contracts for IoT and Web 3.0 | Saurav Ghosh, Reshmi Mitra, Indranil Roy, Bidyut Gupta | 2024-12-04 | 下载 | Ensuring security for highly dynamic peer-to-peer (P2P) networks has always been a challenge, especially for services like online transactions and smart devices. |
| Reactive Orchestration for Hierarchical Federated Learning Under a Communication Cost Budget | Ivan Čilić, Anna Lackinger, Pantelis Frangoudis, Ivana Podnar Žarko, Alireza Furutanpey, Ilir Murturi, Schahram Dustdar | 2024-12-04 | 下载 | Deploying a Hierarchical Federated Learning (HFL) pipeline across the computing continuum (CC) requires careful organization of participants into a hierarchical structure with intermediate aggregation... |
| Seamless Optical Cloud Computing across Edge-Metro Network for Generative AI | Sizhe Xing, Aolong Sun, Chengxi Wang, Yizhi Wang, Boyu Dong, Junhui Hu, Xuyu Deng, An Yan, Yingjun Liu, Fangchen Hu, Zhongya Li, Ouhan Huang, Junhao Zhao, Yingjun Zhou, Ziwei Li, Jianyang Shi, Xi Xiao, Richard Penty, Qixiang Cheng, Nan Chi, Junwen Zhang | 2024-12-04 | 下载 | The rapid advancement of generative artificial intelligence (AI) in recent years has profoundly reshaped modern lifestyles, necessitating a revolutionary architecture to support the growing demands fo... |
| Semi-decentralized Training of Spatio-Temporal Graph Neural Networks for Traffic Prediction | Ivan Kralj, Lodovico Giaretta, Gordan Ježić, Ivana Podnar Žarko, Šarūnas Girdzijauskas | 2024-12-04 | 下载 | In smart mobility, large networks of geographically distributed sensors produce vast amounts of high-frequency spatio-temporal data that must be processed in real time to avoid major disruptions. |
| Resource Slicing through Intelligent Orchestration of Energy-aware IoT services in Edge-Cloud Continuum | Hafiz Faheem Shahid, Erkki Harjula | 2024-12-04 | 下载 | The rapid growth of the Internet of Things (IoT) applications inflicts high requirements for computing resources and network bandwidth. A growing number of service providers are applying edge-cloud co... |
| DiffKV: Differentiated Memory Management for Large Language Models with Parallel KV Compaction | Yanqi Zhang, Yuwei Hu, Runyuan Zhao, John C. S. Lui, Haibo Chen | 2024-12-04 | 下载 | Large language models (LLMs) demonstrate remarkable capabilities but face substantial serving costs due to their high memory demands, with the key-value (KV) cache being a primary bottleneck. |
| Cost-Performance Evaluation of General Compute Instances: AWS, Azure, GCP, and OCI | Jay Tharwani, Arnab A Purkayastha | 2024-12-04 | 下载 | Cloud computing has become the cornerstone of modern IT infrastructure, offering a wide range of general-purpose instances optimized for diverse workloads. |
| Edge System Design Using Containers and Unikernels for IoT Applications | Shahidullah Kaiser, Ali Saman Tosun, Turgay Korkmaz | 2024-12-04 | 下载 | Edge computing is emerging as a key enabler of low-latency, high-efficiency processing for the Internet of Things (IoT) and other real-time applications. |
| Exploring the Viability of Unikernels for ARM-powered Edge Computing | Shahidullah Kaiser, Ali Saman Tosun, Turgay Korkmaz | 2024-12-04 | 下载 | The rapid expansion of IoT devices and their real-time applications have driven a growing need for edge computing. To meet this need, efficient and secure solutions are required for running such appli... |
| Partially Conditioned Patch Parallelism for Accelerated Diffusion Model Inference | XiuYu Zhang, Zening Luo, Michelle E. Lu | 2024-12-04 | 下载 | Diffusion models have exhibited exciting capabilities in generating images and are also very promising for video creation. However, the inference speed of diffusion models is limited by the slow sampl... |
| BGTplanner: Maximizing Training Accuracy for Differentially Private Federated Recommenders via Strategic Privacy Budget Allocation | Xianzhi Zhang, Yipeng Zhou, Miao Hu, Di Wu, Pengshan Liao, Mohsen Guizani, Michael Sheng | 2024-12-04 | 下载 | To mitigate the rising concern about privacy leakage, the federated recommender (FR) paradigm emerges, in which decentralized clients co-train the recommendation model without exposing their raw user-... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Securing RC Based P2P Networks: A Blockchain-based Access Control Framework utilizing Ethereum Smart Contracts for IoT and Web 3.0 | Saurav Ghosh, Reshmi Mitra, Indranil Roy, Bidyut Gupta | 2024-12-04 | 下载 | Ensuring security for highly dynamic peer-to-peer (P2P) networks has always been a challenge, especially for services like online transactions and smart devices. |
| JPPO++: Joint Power and Denoising-inspired Prompt Optimization for Mobile LLM Services | Feiran You, Hongyang Du, Kaibin Huang, Abbas Jamalipour | 2024-12-04 | 下载 | Large Language Models (LLMs) are increasingly integrated into mobile services over wireless networks to support complex user requests. This trend has led to longer prompts, which improve LLMs' perform... |
| Reactive Orchestration for Hierarchical Federated Learning Under a Communication Cost Budget | Ivan Čilić, Anna Lackinger, Pantelis Frangoudis, Ivana Podnar Žarko, Alireza Furutanpey, Ilir Murturi, Schahram Dustdar | 2024-12-04 | 下载 | Deploying a Hierarchical Federated Learning (HFL) pipeline across the computing continuum (CC) requires careful organization of participants into a hierarchical structure with intermediate aggregation... |
| Coordinated Multi-Armed Bandits for Improved Spatial Reuse in Wi-Fi | Francesc Wilhelmi, Boris Bellalta, Szymon Szott, Katarzyna Kosek-Szott, Sergio Barrachina-Muñoz | 2024-12-04 | 下载 | Multi-Access Point Coordination (MAPC) and Artificial Intelligence and Machine Learning (AI/ML) are expected to be key features in future Wi-Fi, such as the forthcoming IEEE 802. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression | Guangda Liu, Chengwei Li, Jieru Zhao, Chenqi Zhang, Minyi Guo | 2024-12-04 | 下载 | Large Language Models (LLMs) have been widely deployed in a variety of applications, and the context length is rapidly increasing to handle tasks such as long-document QA and complex logical reasoning... |
| Cost-Performance Evaluation of General Compute Instances: AWS, Azure, GCP, and OCI | Jay Tharwani, Arnab A Purkayastha | 2024-12-04 | 下载 | Cloud computing has become the cornerstone of modern IT infrastructure, offering a wide range of general-purpose instances optimized for diverse workloads. |