Skip to content

2024-08-06

cs.AR - Architecture

标题作者发布日期PDF摘要
LLM-Aided Compilation for Tensor AcceleratorsCharles Hong, Sahil Bhatia, Altan Haan, Shengjun Kris Dong, Dima Nikiforov, Alvin Cheung, Yakun Sophia Shao2024-08-06下载Hardware accelerators, in particular accelerators for tensor processing, have many potential application domains. However, they currently lack the software infrastructure to support the majority of do...
HeTraX: Energy Efficient 3D Heterogeneous Manycore Architecture for Transformer AccelerationPratyush Dhingra, Janardhan Rao Doppa, Partha Pratim Pande2024-08-06下载Transformers have revolutionized deep learning and generative modeling to enable unprecedented advancements in natural language processing tasks and beyond.
Potential and Limitation of High-Frequency Cores and CachesKunal Pai, Anusheel Nand, Jason Lowe-Power2024-08-06下载This paper explores the potential of cryogenic semiconductor computing and superconductor electronics as promising alternatives to traditional semiconductor devices.
Static IR Drop Prediction with Attention U-Net and Saliency-Based ExplainabilityLizi Zhang, Azadeh Davoodi2024-08-06下载There has been significant recent progress to reduce the computational effort of static IR drop analysis using neural networks, and modeling as an image-to-image translation task.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
FLASH: Federated Learning-Based LLMs for Advanced Query Processing in Social Networks through RAGSai Puppala, Ismail Hossain, Md Jahangir Alam, Sajedul Talukder2024-08-06下载Our paper introduces a novel approach to social network information retrieval and user engagement through a personalized chatbot system empowered by Federated Learning GPT.
Masked Random Noise for Communication Efficient Federated LearningShiwei Li, Yingyi Cheng, Haozhao Wang, Xing Tang, Shijie Xu, Weihong Luo, Yuhua Li, Dugang Liu, Xiuqiang He, Ruixuan Li2024-08-06下载Federated learning is a promising distributed training paradigm that effectively safeguards data privacy. However, it may involve significant communication costs, which hinders training efficiency.
FedBAT: Communication-Efficient Federated Learning via Learnable BinarizationShiwei Li, Wenchao Xu, Haozhao Wang, Xing Tang, Yining Qi, Shijie Xu, Weihong Luo, Yuhua Li, Xiuqiang He, Ruixuan Li2024-08-06下载Federated learning is a promising distributed machine learning paradigm that can effectively exploit large-scale data without exposing users' privacy.
DaVE -- A Curated Database of Visualization ExamplesJens Koenen, Marvin Petersen, Christoph Garth, Tim Gerrits2024-08-06下载Visualization, from simple line plots to complex high-dimensional visual analysis systems, has established itself throughout numerous domains to explore, analyze, and evaluate data.
Large-Scale Graphs Community Detection using Spark GraphFramesElena-Simona Apostol, Adrian-Cosmin Cojocaru, Ciprian-Octavian Truică2024-08-06下载With the emergence of social networks, online platforms dedicated to different use cases, and sensor networks, the emergence of large-scale graph community detection has become a steady field of resea...
The State of FaaS: An analysis of public Functions-as-a-Service providersNnamdi Ekwe-Ekwe, Lucas Amos2024-08-06下载Serverless computing is a growing and maturing field that is the focus of much research, industry interest and adoption. Previous works exploring Functions-as-a-Service providers have focused primaril...
Reinforcement Learning based Workflow Scheduling in Cloud and Edge Computing Environments: A Taxonomy, Review and Future DirectionsAmanda Jayanetti, Saman Halgamuge, Rajkumar Buyya2024-08-06下载Deep Reinforcement Learning (DRL) techniques have been successfully applied for solving complex decision-making and control tasks in multiple fields including robotics, autonomous driving, healthcare ...
A Deep Reinforcement Learning Approach for Cost Optimized Workflow Scheduling in Cloud Computing EnvironmentsAmanda Jayanetti, Saman Halgamuge, Rajkumar Buyya2024-08-06下载Cost optimization is a common goal of workflow schedulers operating in cloud computing environments. The use of spot instances is a potential means of achieving this goal, as they are offered by cloud...
Enabling High-Throughput Parallel I/O in Particle-in-Cell Monte Carlo Simulations with openPMD and Darshan I/O MonitoringJeremy J. Williams, Daniel Medeiros, Stefan Costea, David Tskhakaya, Franz Poeschel, René Widera, Axel Huebl, Scott Klasky, Norbert Podhorszki, Leon Kos, Ales Podolnik, Jakub Hromadka, Tapish Narwal, Klaus Steiniger, Michael Bussmann, Erwin Laure, Stefano Markidis2024-08-06下载Large-scale HPC simulations of plasma dynamics in fusion devices require efficient parallel I/O to avoid slowing down the simulation and to enable the post-processing of critical information.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Optimizing NOMA Transmissions to Advance Federated Learning in Vehicular NetworksZiru Chen, Zhou Ni, Peiyuan Guan, Lu Wang, Lin X. Cai, Morteza Hashemi, Zongzhi Li2024-08-06下载Diverse critical data, such as location information and driving patterns, can be collected by IoT devices in vehicular networks to improve driving experiences and road safety.
Communication-Aware Consistent Edge Selection for Mobile Users and Autonomous VehiclesNazish Tahir, Ramviyas Parasuraman, Haijian Sun2024-08-06下载Offloading time-sensitive, computationally intensive tasks-such as advanced learning algorithms for autonomous driving-from vehicles to nearby edge servers, vehicle-to-infrastructure (V2I) systems, or...
DRL-Assisted Dynamic QoT-Aware Service Provisioning in Multi-Band Elastic Optical NetworksYiran Teng, Carlos Natalino, Farhad Arpanaei, Alfonso Sánchez-Macián, Paolo Monti, Shuangyi Yan, Dimitra Simeonidou2024-08-06下载We propose a DRL-assisted approach for service provisioning in multi-band elastic optical networks. Our simulation environment uses an accurate QoT estimator based on the GN/EGN model.
Congestion or No Congestion: Packet Loss Identification and Prediction Using Machine LearningInayat Ali, Seungwoo Hong, Taesik Cheung2024-08-06下载Packet losses in the network significantly impact network performance. Most TCP variants reduce the transmission rate when detecting packet losses, assuming network congestion, resulting in lower thro...
Towards Smart Microfarming in an Urban Computing ContinuumMarla Grunewald, Mounir Bensalem, Jasenka Dizdarević, Admela Jukan2024-08-06下载Microfarming and urban computing have evolved as two distinct sustainability pillars of urban living today. In this paper, we combine these two concepts, while majorly extending them jointly towards n...
Rate-Splitting for Joint Unicast and Multicast Transmission in LEO Satellite Networks with Non-Uniform Traffic DemandJaehyup Seong, Juha Park, Dong-Hyun Jung, Jeonghun Park, Wonjae Shin2024-08-06下载Low Earth orbit (LEO) satellite communications (SATCOM) with ubiquitous global connectivity is deemed a pivotal catalyst in advancing wireless communication systems for 5G and beyond.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Boosting File Systems Elegantly: A Transparent NVM Write-ahead Log for Disk File SystemsGuoyu Wang, Xilong Che, Haoyang Wei, Shuo Chen, Puyi He, Juncheng Hu2024-08-06下载We propose NVLog, an NVM-based write-ahead log for disk file systems, designed to transparently harness the high performance of NVM within the legacy storage stack.

cs.PF - Performance

标题作者发布日期PDF摘要
Enabling High-Throughput Parallel I/O in Particle-in-Cell Monte Carlo Simulations with openPMD and Darshan I/O MonitoringJeremy J. Williams, Daniel Medeiros, Stefan Costea, David Tskhakaya, Franz Poeschel, René Widera, Axel Huebl, Scott Klasky, Norbert Podhorszki, Leon Kos, Ales Podolnik, Jakub Hromadka, Tapish Narwal, Klaus Steiniger, Michael Bussmann, Erwin Laure, Stefano Markidis2024-08-06下载Large-scale HPC simulations of plasma dynamics in fusion devices require efficient parallel I/O to avoid slowing down the simulation and to enable the post-processing of critical information.

基于 VitePress 构建