Skip to content

2024-07-12

cs.AR - Architecture

标题作者发布日期PDF摘要
MonoSparse-CAM: Efficient Tree Model Processing via Monotonicity and Sparsity in CAMsTergel Molom-Ochir, Brady Taylor, Hai Li, Yiran Chen2024-07-12下载While the tree-based machine learning (TBML) models exhibit superior performance compared to neural networks on tabular data and hold promise for energy-efficient acceleration using aCAM arrays, their...
Weight Block Sparsity: Training, Compilation, and AI Engine AcceleratorsPaolo D'Alberto, Taehee Jeong, Akshai Jain, Shreyas Manjunath, Mrinal Sarmah, Samuel Hsu, Yaswanth Raparti, Nitesh Pipralia2024-07-12下载Nowadays, increasingly larger Deep Neural Networks (DNNs) are being developed, trained, and utilized. These networks require significant computational resources, putting a strain on both advanced and ...
iMIV: in-Memory Integrity Verification for NVMRajat Jain, Aravinda Prasad, Sreenivas Subramoney, Arkaprava Basu2024-07-12下载Non-volatile Memory (NVM) could bridge the gap between memory and storage. However, NVMs are susceptible to data remanence attacks. Thus, multiple security metadata must persist along with the data to...
Dynamic neural network with memristive CIM and CAM for 2D and 3D visionYue Zhang, Woyu Zhang, Shaocong Wang, Ning Lin, Yifei Yu, Yangu He, Bo Wang, Hao Jiang, Peng Lin, Xiaoxin Xu, Xiaojuan Qi, Zhongrui Wang, Xumeng Zhang, Dashan Shang, Qi Liu, Kwang-Ting Cheng, Ming Liu2024-07-12下载The brain is dynamic, associative and efficient. It reconfigures by associating the inputs with past experiences, with fused memory and processing.
Hybrid Temporal Computing for Lower Power Hardware AcceleratorsMaliha Tasnim, Sachin Sachdeva, Yibo Liu, Sheldon X. -D. Tan2024-07-12下载In this paper, we propose a new hybrid temporal computing (HTC) framework that leverages both pulse rate and temporal data encoding to design ultra-low energy hardware accelerators.
TensorTEE: Unifying Heterogeneous TEE Granularity for Efficient Secure Collaborative Tensor ComputingHusheng Han, Xinyao Zheng, Yuanbo Wen, Yifan Hao, Erhu Feng, Ling Liang, Jianan Mu, Xiaqing Li, Tianyun Ma, Pengwei Jin, Xinkai Song, Zidong Du, Qi Guo, Xing Hu2024-07-12下载Heterogeneous collaborative computing with NPU and CPU has received widespread attention due to its substantial performance benefits. To ensure data confidentiality and integrity during computing, Tru...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Memory Lower Bounds and Impossibility Results for Anonymous Dynamic BroadcastGarrett Parzych, Joshua J. Daymude2024-07-12下载Broadcast is a ubiquitous distributed computing problem that underpins many other system tasks. In static, connected networks, it was recently shown that broadcast is solvable without any node memory ...
Securing Confidential Data For Distributed Software Development Teams: Encrypted Container FileTobias J. Bauer, Andreas Aßmuth2024-07-12下载In the context of modern software engineering, there is a trend towards Cloud-native software development involving international teams with members from all over the world.
Mapping Large Memory-constrained Workflows onto Heterogeneous PlatformsSvetlana Kulagina, Henning Meyerhenke, Anne Benoit2024-07-12下载Scientific workflows are often represented as directed acyclic graphs (DAGs), where vertices correspond to tasks and edges represent the dependencies between them.
Ktirio Urban Building: A Computational Framework for City Energy Simulations Enhanced by CI/CD Innovations on EuroHPC SystemsChristophe Prud'Homme, Vincent Chabannes, Luca Berti, Maryam Maslek, Philippe Pincon, Javier Cladellas, Abdoulaye Diallo2024-07-12下载The building sector in the European Union significantly impacts energy consumption and greenhouse gas emissions. The EU's Horizon 2050 initiative sets ambitious goals to reduce these impacts through e...
Enabling Elastic Model Serving with MultiWorldMyungjin Lee, Akshay Jajoo, Ramana Rao Kompella2024-07-12下载Machine learning models have been exponentially growing in terms of their parameter size over the past few years. We are now seeing the rise of trillion-parameter models.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Managing O-RAN Networks: xApp Development from Zero to HeroJoao F. Santos, Alexandre Huff, Daniel Campos, Kleber V. Cardoso, Cristiano B. Both, Luiz A. DaSilva2024-07-12下载The Open Radio Access Network (O-RAN) Alliance proposes an open architecture that disaggregates the RAN and supports executing custom control logic in near-real time from third-party applications, the...
FedsLLM: Federated Split Learning for Large Language Models over Communication NetworksKai Zhao, Zhaohui Yang, Chongwen Huang, Xiaoming Chen, Zhaoyang Zhang2024-07-12下载Addressing the challenges of deploying large language models in wireless communication networks, this paper combines low-rank adaptation technology (LoRA) with the splitfed learning framework to propo...
Physical Layer Aspects of Quantum Communications: A SurveySeid Koudia, Leonardo Oleynik, Mert Bayraktar, Junaid ur Rehman, Symeon Chatzinotas2024-07-12下载Quantum communication systems support unique applications in the form of distributed quantum computing, distributed quantum sensing, and several cryptographic protocols.
An Adaptive Indoor Localization Approach Using WiFi RSSI Fingerprinting with SLAM-Enabled Robotic Platform and Deep Neural NetworksSeyed Alireza Rahimi Azghadi, Atah Nuh Mih, Asfia Kawnine, Monica Wachowicz, Francis Palma, Hung Cao2024-07-12下载Indoor localization plays a vital role in the era of the IoT and robotics, with WiFi technology being a prominent choice due to its ubiquity. We present a method for creating WiFi fingerprinting datas...
Optimized Federated Multitask Learning in Mobile Edge Networks: A Hybrid Client Selection and Model Aggregation ApproachMoqbel Hamood, Abdullatif Albaseer, Mohamed Abdallah, Ala Al-Fuqaha, Amr Mohamed2024-07-12下载We propose clustered federated multitask learning to address statistical challenges in non-independent and identically distributed data across clients.
Assessing the Efficacy of IoT-based Forest Fire Detection: a Practical Use CaseBelcher Anthony, Esteva Miguel A., Lam Anthea, Ramadhani Rizki, Rayhan Achmad, Xu Wangkun, Tuncer Daphne2024-07-12下载The implementation of early warning mechanisms that can be used to detect forest fires in rural areas is essential to mitigate their deleterious effects, in particular by notifying local fire authorit...
A Bistatic ISAC Framework for LEO Satellite Systems: A Rate-Splitting ApproachJuha Park, Jaehyup Seong, Jaehak Ryu, Yijie Mao, Wonjae Shin2024-07-12下载Aiming to achieve ubiquitous global connectivity and target detection on the same platform with improved spectral/energy efficiency and reduced onboard hardware cost, low Earth orbit (LEO) satellite s...
Redefinition of Digital Twin and its Situation Awareness Framework Designing Towards Fourth Paradigm for Energy Internet of ThingsXing He, Yuezhong Tang, Shuyan Ma, Qian Ai, Fei Tao, Robert Qiu2024-07-12下载Traditional knowledge-based situation awareness (SA) modes struggle to adapt to the escalating complexity of today's Energy Internet of Things (EIoT), necessitating a pivotal paradigm shift.
Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement LearningChuang Zhang, Geng Sun, Jiahui Li, Qingqing Wu, Jiacheng Wang, Dusit Niyato, Yuanwei Liu2024-07-12下载Due to flexibility and low-cost, unmanned aerial vehicles (UAVs) are increasingly crucial for enhancing coverage and functionality of wireless networks.

cs.PF - Performance

标题作者发布日期PDF摘要
Acceleration of Tensor-Product Operations with Tensor CoresCu Cui2024-07-12下载In this paper, we explore the acceleration of tensor product operations in finite element methods, leveraging the computational power of the NVIDIA A100 GPU Tensor Cores.

基于 VitePress 构建