2024-08-06

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
LLM-Aided Compilation for Tensor Accelerators	Charles Hong, Sahil Bhatia, Altan Haan, Shengjun Kris Dong, Dima Nikiforov, Alvin Cheung, Yakun Sophia Shao	2024-08-06	下载	Hardware accelerators, in particular accelerators for tensor processing, have many potential application domains. However, they currently lack the software infrastructure to support the majority of do...
HeTraX: Energy Efficient 3D Heterogeneous Manycore Architecture for Transformer Acceleration	Pratyush Dhingra, Janardhan Rao Doppa, Partha Pratim Pande	2024-08-06	下载	Transformers have revolutionized deep learning and generative modeling to enable unprecedented advancements in natural language processing tasks and beyond.
Potential and Limitation of High-Frequency Cores and Caches	Kunal Pai, Anusheel Nand, Jason Lowe-Power	2024-08-06	下载	This paper explores the potential of cryogenic semiconductor computing and superconductor electronics as promising alternatives to traditional semiconductor devices.
Static IR Drop Prediction with Attention U-Net and Saliency-Based Explainability	Lizi Zhang, Azadeh Davoodi	2024-08-06	下载	There has been significant recent progress to reduce the computational effort of static IR drop analysis using neural networks, and modeling as an image-to-image translation task.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
FLASH: Federated Learning-Based LLMs for Advanced Query Processing in Social Networks through RAG	Sai Puppala, Ismail Hossain, Md Jahangir Alam, Sajedul Talukder	2024-08-06	下载	Our paper introduces a novel approach to social network information retrieval and user engagement through a personalized chatbot system empowered by Federated Learning GPT.
Masked Random Noise for Communication Efficient Federated Learning	Shiwei Li, Yingyi Cheng, Haozhao Wang, Xing Tang, Shijie Xu, Weihong Luo, Yuhua Li, Dugang Liu, Xiuqiang He, Ruixuan Li	2024-08-06	下载	Federated learning is a promising distributed training paradigm that effectively safeguards data privacy. However, it may involve significant communication costs, which hinders training efficiency.
FedBAT: Communication-Efficient Federated Learning via Learnable Binarization	Shiwei Li, Wenchao Xu, Haozhao Wang, Xing Tang, Yining Qi, Shijie Xu, Weihong Luo, Yuhua Li, Xiuqiang He, Ruixuan Li	2024-08-06	下载	Federated learning is a promising distributed machine learning paradigm that can effectively exploit large-scale data without exposing users' privacy.
DaVE -- A Curated Database of Visualization Examples	Jens Koenen, Marvin Petersen, Christoph Garth, Tim Gerrits	2024-08-06	下载	Visualization, from simple line plots to complex high-dimensional visual analysis systems, has established itself throughout numerous domains to explore, analyze, and evaluate data.
Large-Scale Graphs Community Detection using Spark GraphFrames	Elena-Simona Apostol, Adrian-Cosmin Cojocaru, Ciprian-Octavian Truică	2024-08-06	下载	With the emergence of social networks, online platforms dedicated to different use cases, and sensor networks, the emergence of large-scale graph community detection has become a steady field of resea...
The State of FaaS: An analysis of public Functions-as-a-Service providers	Nnamdi Ekwe-Ekwe, Lucas Amos	2024-08-06	下载	Serverless computing is a growing and maturing field that is the focus of much research, industry interest and adoption. Previous works exploring Functions-as-a-Service providers have focused primaril...
Reinforcement Learning based Workflow Scheduling in Cloud and Edge Computing Environments: A Taxonomy, Review and Future Directions	Amanda Jayanetti, Saman Halgamuge, Rajkumar Buyya	2024-08-06	下载	Deep Reinforcement Learning (DRL) techniques have been successfully applied for solving complex decision-making and control tasks in multiple fields including robotics, autonomous driving, healthcare ...
A Deep Reinforcement Learning Approach for Cost Optimized Workflow Scheduling in Cloud Computing Environments	Amanda Jayanetti, Saman Halgamuge, Rajkumar Buyya	2024-08-06	下载	Cost optimization is a common goal of workflow schedulers operating in cloud computing environments. The use of spot instances is a potential means of achieving this goal, as they are offered by cloud...
Enabling High-Throughput Parallel I/O in Particle-in-Cell Monte Carlo Simulations with openPMD and Darshan I/O Monitoring	Jeremy J. Williams, Daniel Medeiros, Stefan Costea, David Tskhakaya, Franz Poeschel, René Widera, Axel Huebl, Scott Klasky, Norbert Podhorszki, Leon Kos, Ales Podolnik, Jakub Hromadka, Tapish Narwal, Klaus Steiniger, Michael Bussmann, Erwin Laure, Stefano Markidis	2024-08-06	下载	Large-scale HPC simulations of plasma dynamics in fusion devices require efficient parallel I/O to avoid slowing down the simulation and to enable the post-processing of critical information.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
Optimizing NOMA Transmissions to Advance Federated Learning in Vehicular Networks	Ziru Chen, Zhou Ni, Peiyuan Guan, Lu Wang, Lin X. Cai, Morteza Hashemi, Zongzhi Li	2024-08-06	下载	Diverse critical data, such as location information and driving patterns, can be collected by IoT devices in vehicular networks to improve driving experiences and road safety.
Communication-Aware Consistent Edge Selection for Mobile Users and Autonomous Vehicles	Nazish Tahir, Ramviyas Parasuraman, Haijian Sun	2024-08-06	下载	Offloading time-sensitive, computationally intensive tasks-such as advanced learning algorithms for autonomous driving-from vehicles to nearby edge servers, vehicle-to-infrastructure (V2I) systems, or...
DRL-Assisted Dynamic QoT-Aware Service Provisioning in Multi-Band Elastic Optical Networks	Yiran Teng, Carlos Natalino, Farhad Arpanaei, Alfonso Sánchez-Macián, Paolo Monti, Shuangyi Yan, Dimitra Simeonidou	2024-08-06	下载	We propose a DRL-assisted approach for service provisioning in multi-band elastic optical networks. Our simulation environment uses an accurate QoT estimator based on the GN/EGN model.
Congestion or No Congestion: Packet Loss Identification and Prediction Using Machine Learning	Inayat Ali, Seungwoo Hong, Taesik Cheung	2024-08-06	下载	Packet losses in the network significantly impact network performance. Most TCP variants reduce the transmission rate when detecting packet losses, assuming network congestion, resulting in lower thro...
Towards Smart Microfarming in an Urban Computing Continuum	Marla Grunewald, Mounir Bensalem, Jasenka Dizdarević, Admela Jukan	2024-08-06	下载	Microfarming and urban computing have evolved as two distinct sustainability pillars of urban living today. In this paper, we combine these two concepts, while majorly extending them jointly towards n...
Rate-Splitting for Joint Unicast and Multicast Transmission in LEO Satellite Networks with Non-Uniform Traffic Demand	Jaehyup Seong, Juha Park, Dong-Hyun Jung, Jeonghun Park, Wonjae Shin	2024-08-06	下载	Low Earth orbit (LEO) satellite communications (SATCOM) with ubiquitous global connectivity is deemed a pivotal catalyst in advancing wireless communication systems for 5G and beyond.

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
Boosting File Systems Elegantly: A Transparent NVM Write-ahead Log for Disk File Systems	Guoyu Wang, Xilong Che, Haoyang Wei, Shuo Chen, Puyi He, Juncheng Hu	2024-08-06	下载	We propose NVLog, an NVM-based write-ahead log for disk file systems, designed to transparently harness the high performance of NVM within the legacy storage stack.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Enabling High-Throughput Parallel I/O in Particle-in-Cell Monte Carlo Simulations with openPMD and Darshan I/O Monitoring	Jeremy J. Williams, Daniel Medeiros, Stefan Costea, David Tskhakaya, Franz Poeschel, René Widera, Axel Huebl, Scott Klasky, Norbert Podhorszki, Leon Kos, Ales Podolnik, Jakub Hromadka, Tapish Narwal, Klaus Steiniger, Michael Bussmann, Erwin Laure, Stefano Markidis	2024-08-06	下载	Large-scale HPC simulations of plasma dynamics in fusion devices require efficient parallel I/O to avoid slowing down the simulation and to enable the post-processing of critical information.