Skip to content

2024-08-26

cs.AR - Architecture

标题作者发布日期PDF摘要
Sparsity-Aware Hardware-Software Co-Design of Spiking Neural Networks: An OverviewIlkin Aliyev, Kama Svoboda, Tosiron Adegbija, Jean-Marc Fellous2024-08-26下载Spiking Neural Networks (SNNs) are inspired by the sparse and event-driven nature of biological neural processing, and offer the potential for ultra-low-power artificial intelligence.
Synergistic and Efficient Edge-Host Communication for Energy Harvesting Wireless Sensor NetworksCyan Subhra Mishra, Jack Sampson, Mahmut Taylan Kandmeir, Vijaykrishnan Narayanan, Chita R Das2024-08-26下载There is an increasing demand for intelligent processing on ultra-low-power internet of things (IoT) device. Recent works have shown substantial efficiency boosts by executing inferences directly on t...
Exploring GPU-to-GPU Communication: Insights into Supercomputer InterconnectsDaniele De Sensi, Lorenzo Pichetti, Flavio Vella, Tiziano De Matteis, Zebin Ren, Luigi Fusco, Matteo Turisini, Daniele Cesarini, Kurt Lust, Animesh Trivedi, Duncan Roweth, Filippo Spiga, Salvatore Di Girolamo, Torsten Hoefler2024-08-26下载Multi-GPU nodes are increasingly common in the rapidly evolving landscape of exascale supercomputers. On these systems, GPUs on the same node are connected through dedicated networks, with bandwidths ...
HAPM -- Hardware Aware Pruning Method for CNN hardware accelerators in resource constrained devicesFederico Nicolas Peccia, Luciano Ferreyro, Alejandro Furfaro2024-08-26下载During the last years, algorithms known as Convolutional Neural Networks (CNNs) had become increasingly popular, expanding its application range to several areas.
Towards Battery-Free Wireless Sensing via Radio-Frequency Energy HarvestingTao Ni, Zehua Sun, Mingda Han, Guohao Lan, Yaxiong Xie, Zhenjiang Li, Tao Gu, Weitao Xu2024-08-26下载Diverse Wi-Fi-based wireless applications have been proposed, ranging from daily activity recognition to vital sign monitoring. Despite their remarkable sensing accuracy, the high energy consumption a...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Scalable, reproducible, and cost-effective processing of large-scale medical imaging datasetsMichael E. Kim, Karthik Ramadass, Chenyu Gao, Praitayini Kanakaraj, Nancy R. Newlin, Gaurav Rudravaram, Kurt G. Schilling, Blake E. Dewey, Derek Archer, Timothy J. Hohman, Zhiyuan Li, Shunxing Bao, Bennett A. Landman, Nazirah Mohd Khairi2024-08-26下载Curating, processing, and combining large-scale medical imaging datasets from national studies is a non-trivial task due to the intense computation and data throughput required, variability of acquire...
A sparsity-aware distributed-memory algorithm for sparse-sparse matrix multiplicationYuxi Hong, Aydin Buluc2024-08-26下载Multiplying two sparse matrices (SpGEMM) is a common computational primitive used in many areas including graph algorithms, bioinformatics, algebraic multigrid solvers, and randomized sketching.
Employing Artificial Intelligence to Steer Exascale Workflows with ColmenaLogan Ward, J. Gregory Pauloski, Valerie Hayot-Sasson, Yadu Babuji, Alexander Brace, Ryan Chard, Kyle Chard, Rajeev Thakur, Ian Foster2024-08-26下载Computational workflows are a common class of application on supercomputers, yet the loosely coupled and heterogeneous nature of workflows often fails to take full advantage of their capabilities.
Hyperdimensional Computing Empowered Federated Foundation Model over Wireless Networks for MetaverseYahao Ding, Wen Shang, Minrui Xu, Zhaohui Yang, Ye Hu, Dusit Niyato, Mohammad Shikh-Bahaei2024-08-26下载The Metaverse, a burgeoning collective virtual space merging augmented reality and persistent virtual worlds, necessitates advanced artificial intelligence (AI) and communication technologies to suppo...
Adaptive Resolution Inference (ARI): Energy-Efficient Machine Learning for Internet of ThingsZiheng Wang, Pedro Reviriego, Farzad Niknia, Javier Conde, Shanshan Liu, Fabrizio Lombardi2024-08-26下载The implementation of machine learning in Internet of Things devices poses significant operational challenges due to limited energy and computation resources.
Resource Efficient Asynchronous Federated Learning for Digital Twin Empowered IoT NetworkShunfeng Chu, Jun Li, Jianxin Wang, Yiyang Ni, Kang Wei, Wen Chen, Shi Jin2024-08-26下载As an emerging technology, digital twin (DT) can provide real-time status and dynamic topology mapping for Internet of Things (IoT) devices. However, DT and its implementation within industrial IoT ne...
Exploiting ray tracing technology through OptiX to compute particle interactions with cutoff in a 3D environment on GPUAlgis David, Bérenger Bramas2024-08-26下载Computing on graphics processing units (GPUs) has become standard in scientific computing, allowing for incredible performance gains over classical CPUs for many computational methods.
Optimizing STAR Aligner for High Throughput Computing in the CloudPiotr Kica, Sabina Lichołai, Michał Orzechowski, Maciej Malawski2024-08-26下载We propose a scalable, cloud-native architecture designed for Transcriptomics Atlas Pipeline, using a resource-intensive STAR aligner and processing tens or hundreds of terabytes of RNA-seq data.
Celtibero: Robust Layered Aggregation for Federated LearningBorja Molina-Coronado2024-08-26下载Federated Learning (FL) is an innovative approach to distributed machine learning. While FL offers significant privacy advantages, it also faces security challenges, particularly from poisoning attack...
LIMO: Load-balanced Offloading with MAPE and Particle Swarm Optimization in Mobile Fog NetworksYasaman Seraj, Soheil Fadaei, Bardia Safaei, Ali Javadi, Amir Mahdi Hosseini Monazzah, Ali Mohammad Afshin Hemmatyar2024-08-26下载Fog computing is essentially the expansion of cloud computing towards the network edge, reducing user access time to computing resources and services.
Dynamic Pricing for Electric Vehicle ChargingArun Kumar Kalakanti, Shrisha Rao2024-08-26下载Dynamic pricing is a promising strategy to address the challenges of smart charging, as traditional time-of-use (ToU) rates and stationary pricing (SP) do not dynamically react to changes in operating...
Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep LearningWei An, Xiao Bi, Guanting Chen, Shanhuang Chen, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Wenjun Gao, Kang Guan, Jianzhong Guo, Yongqiang Guo, Zhe Fu, Ying He, Panpan Huang, Jiashi Li, Wenfeng Liang, Xiaodong Liu, Xin Liu, Yiyuan Liu, Yuxuan Liu, Shanghao Lu, Xuan Lu, Xiaotao Nie, Tian Pei, Junjie Qiu, Hui Qu, Zehui Ren, Zhangli Sha, Xuecheng Su, Xiaowen Sun, Yixuan Tan, Minghui Tang, Shiyu Wang, Yaohui Wang, Yongji Wang, Ziwei Xie, Yiliang Xiong, Yanhong Xu, Shengfeng Ye, Shuiping Yu, Yukun Zha, Liyue Zhang, Haowei Zhang, Mingchuan Zhang, Wentao Zhang, Yichao Zhang, Chenggang Zhao, Yao Zhao, Shangyan Zhou, Shunfeng Zhou, Yuheng Zou2024-08-26下载The rapid progress in Deep Learning (DL) and Large Language Models (LLMs) has exponentially increased demands of computational power and bandwidth.
Neighborhood and Global Perturbations Supported SAM in Federated Learning: From Local Tweaks To Global AwarenessBoyuan Li, Zihao Peng, Yafei Li, Mingliang Xu, Shengbo Chen, Baofeng Ji, Cong Shen2024-08-26下载Federated Learning (FL) can be coordinated under the orchestration of a central server to collaboratively build a privacy-preserving model without the need for data exchange.
Hierarchical Learning and Computing over Space-Ground Integrated NetworksJingyang Zhu, Yuanming Shi, Yong Zhou, Chunxiao Jiang, Linling Kuang2024-08-26下载Space-ground integrated networks hold great promise for providing global connectivity, particularly in remote areas where large amounts of valuable data are generated by Internet of Things (IoT) devic...
Rorqual: Speeding up Narwhal with TEEsLuciano Freitas, Shashank Motepalli, Matej Pavlovic, Benjamin Livshits2024-08-26下载In this paper, we introduce Rorqual, a protocol designed to enhance the performance of the Narwhal Mempool by integrating Trusted Execution Environments (TEEs).
Exploring GPU-to-GPU Communication: Insights into Supercomputer InterconnectsDaniele De Sensi, Lorenzo Pichetti, Flavio Vella, Tiziano De Matteis, Zebin Ren, Luigi Fusco, Matteo Turisini, Daniele Cesarini, Kurt Lust, Animesh Trivedi, Duncan Roweth, Filippo Spiga, Salvatore Di Girolamo, Torsten Hoefler2024-08-26下载Multi-GPU nodes are increasingly common in the rapidly evolving landscape of exascale supercomputers. On these systems, GPUs on the same node are connected through dedicated networks, with bandwidths ...
Revisiting time-variant complex conjugate matrix equations with their corresponding real field time-variant large-scale linear equations, neural hypercomplex numbers space compressive approximation approachJiakuang He, Dongqing Wu2024-08-26下载Large-scale linear equations and high dimension have been hot topics in deep learning, machine learning, control,and scientific computing. Because of special conjugate operation characteristics, time-...
Decentralized Federated Learning with Model Caching on Mobile AgentsXiaoyu Wang, Guojun Xiong, Houwei Cao, Jian Li, Yong Liu2024-08-26下载Federated Learning (FL) trains a shared model using data and computation power on distributed agents coordinated by a central server. Decentralized FL (DFL) utilizes local model exchange and aggregati...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Anomaly Detection Within Mission-Critical Call ProcessingSean Doris, Iosif Salem, Stefan Schmid2024-08-26下载With increasingly larger and more complex telecommunication networks, there is a need for improved monitoring and reliability. Requirements increase further when working with mission-critical systems ...
Cloud-Based Federation Framework and Prototype for Open, Scalable, and Shared Access to NextG and IoT TestbedsMaxwell McManus, Tenzin Rinchen, Annoy Dey, Sumanth Thota, Zhaoxi Zhang, Jiangqi Hu, Xi Wang, Mingyue Ji, Nicholas Mastronarde, Elizabeth Serena Bentley, Michael Medley, Zhangyu Guan2024-08-26下载In this work, we present a new federation framework for UnionLabs, an innovative cloud-based resource-sharing infrastructure designed for next-generation (NextG) and Internet of Things (IoT) over-the-...
Synergistic and Efficient Edge-Host Communication for Energy Harvesting Wireless Sensor NetworksCyan Subhra Mishra, Jack Sampson, Mahmut Taylan Kandmeir, Vijaykrishnan Narayanan, Chita R Das2024-08-26下载There is an increasing demand for intelligent processing on ultra-low-power internet of things (IoT) device. Recent works have shown substantial efficiency boosts by executing inferences directly on t...
User-Access Point Association for High Density MIMO Wireless LANsPhillip B. Oni, Steven D. Blostein2024-08-26下载Wireless local area network (WLAN) access points (APs) are being deployed in high density to improve coverage and throughput. The emerging multiple-input multiple-output (MIMO) implementation for upli...
Scalable Multivariate Fronthaul Quantization for Cell-Free Massive MIMOSangwoo Park, Ahmet Hasim Gokceoglu, Li Wang, Osvaldo Simeone2024-08-26下载The conventional approach to the fronthaul design for cell-free massive MIMO system follows the compress-and-precode (CP) paradigm. Accordingly, encoded bits and precoding coefficients are shared by t...
LIMO: Load-balanced Offloading with MAPE and Particle Swarm Optimization in Mobile Fog NetworksYasaman Seraj, Soheil Fadaei, Bardia Safaei, Ali Javadi, Amir Mahdi Hosseini Monazzah, Ali Mohammad Afshin Hemmatyar2024-08-26下载Fog computing is essentially the expansion of cloud computing towards the network edge, reducing user access time to computing resources and services.
Hierarchical Learning and Computing over Space-Ground Integrated NetworksJingyang Zhu, Yuanming Shi, Yong Zhou, Chunxiao Jiang, Linling Kuang2024-08-26下载Space-ground integrated networks hold great promise for providing global connectivity, particularly in remote areas where large amounts of valuable data are generated by Internet of Things (IoT) devic...
Exploring GPU-to-GPU Communication: Insights into Supercomputer InterconnectsDaniele De Sensi, Lorenzo Pichetti, Flavio Vella, Tiziano De Matteis, Zebin Ren, Luigi Fusco, Matteo Turisini, Daniele Cesarini, Kurt Lust, Animesh Trivedi, Duncan Roweth, Filippo Spiga, Salvatore Di Girolamo, Torsten Hoefler2024-08-26下载Multi-GPU nodes are increasingly common in the rapidly evolving landscape of exascale supercomputers. On these systems, GPUs on the same node are connected through dedicated networks, with bandwidths ...
Towards Battery-Free Wireless Sensing via Radio-Frequency Energy HarvestingTao Ni, Zehua Sun, Mingda Han, Guohao Lan, Yaxiong Xie, Zhenjiang Li, Tao Gu, Weitao Xu2024-08-26下载Diverse Wi-Fi-based wireless applications have been proposed, ranging from daily activity recognition to vital sign monitoring. Despite their remarkable sensing accuracy, the high energy consumption a...

cs.PF - Performance

标题作者发布日期PDF摘要
Exploring GPU-to-GPU Communication: Insights into Supercomputer InterconnectsDaniele De Sensi, Lorenzo Pichetti, Flavio Vella, Tiziano De Matteis, Zebin Ren, Luigi Fusco, Matteo Turisini, Daniele Cesarini, Kurt Lust, Animesh Trivedi, Duncan Roweth, Filippo Spiga, Salvatore Di Girolamo, Torsten Hoefler2024-08-26下载Multi-GPU nodes are increasingly common in the rapidly evolving landscape of exascale supercomputers. On these systems, GPUs on the same node are connected through dedicated networks, with bandwidths ...

基于 VitePress 构建