Appearance
2024-07-12
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| MonoSparse-CAM: Efficient Tree Model Processing via Monotonicity and Sparsity in CAMs | Tergel Molom-Ochir, Brady Taylor, Hai Li, Yiran Chen | 2024-07-12 | 下载 | While the tree-based machine learning (TBML) models exhibit superior performance compared to neural networks on tabular data and hold promise for energy-efficient acceleration using aCAM arrays, their... |
| Weight Block Sparsity: Training, Compilation, and AI Engine Accelerators | Paolo D'Alberto, Taehee Jeong, Akshai Jain, Shreyas Manjunath, Mrinal Sarmah, Samuel Hsu, Yaswanth Raparti, Nitesh Pipralia | 2024-07-12 | 下载 | Nowadays, increasingly larger Deep Neural Networks (DNNs) are being developed, trained, and utilized. These networks require significant computational resources, putting a strain on both advanced and ... |
| iMIV: in-Memory Integrity Verification for NVM | Rajat Jain, Aravinda Prasad, Sreenivas Subramoney, Arkaprava Basu | 2024-07-12 | 下载 | Non-volatile Memory (NVM) could bridge the gap between memory and storage. However, NVMs are susceptible to data remanence attacks. Thus, multiple security metadata must persist along with the data to... |
| Dynamic neural network with memristive CIM and CAM for 2D and 3D vision | Yue Zhang, Woyu Zhang, Shaocong Wang, Ning Lin, Yifei Yu, Yangu He, Bo Wang, Hao Jiang, Peng Lin, Xiaoxin Xu, Xiaojuan Qi, Zhongrui Wang, Xumeng Zhang, Dashan Shang, Qi Liu, Kwang-Ting Cheng, Ming Liu | 2024-07-12 | 下载 | The brain is dynamic, associative and efficient. It reconfigures by associating the inputs with past experiences, with fused memory and processing. |
| Hybrid Temporal Computing for Lower Power Hardware Accelerators | Maliha Tasnim, Sachin Sachdeva, Yibo Liu, Sheldon X. -D. Tan | 2024-07-12 | 下载 | In this paper, we propose a new hybrid temporal computing (HTC) framework that leverages both pulse rate and temporal data encoding to design ultra-low energy hardware accelerators. |
| TensorTEE: Unifying Heterogeneous TEE Granularity for Efficient Secure Collaborative Tensor Computing | Husheng Han, Xinyao Zheng, Yuanbo Wen, Yifan Hao, Erhu Feng, Ling Liang, Jianan Mu, Xiaqing Li, Tianyun Ma, Pengwei Jin, Xinkai Song, Zidong Du, Qi Guo, Xing Hu | 2024-07-12 | 下载 | Heterogeneous collaborative computing with NPU and CPU has received widespread attention due to its substantial performance benefits. To ensure data confidentiality and integrity during computing, Tru... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Memory Lower Bounds and Impossibility Results for Anonymous Dynamic Broadcast | Garrett Parzych, Joshua J. Daymude | 2024-07-12 | 下载 | Broadcast is a ubiquitous distributed computing problem that underpins many other system tasks. In static, connected networks, it was recently shown that broadcast is solvable without any node memory ... |
| Securing Confidential Data For Distributed Software Development Teams: Encrypted Container File | Tobias J. Bauer, Andreas Aßmuth | 2024-07-12 | 下载 | In the context of modern software engineering, there is a trend towards Cloud-native software development involving international teams with members from all over the world. |
| Mapping Large Memory-constrained Workflows onto Heterogeneous Platforms | Svetlana Kulagina, Henning Meyerhenke, Anne Benoit | 2024-07-12 | 下载 | Scientific workflows are often represented as directed acyclic graphs (DAGs), where vertices correspond to tasks and edges represent the dependencies between them. |
| Ktirio Urban Building: A Computational Framework for City Energy Simulations Enhanced by CI/CD Innovations on EuroHPC Systems | Christophe Prud'Homme, Vincent Chabannes, Luca Berti, Maryam Maslek, Philippe Pincon, Javier Cladellas, Abdoulaye Diallo | 2024-07-12 | 下载 | The building sector in the European Union significantly impacts energy consumption and greenhouse gas emissions. The EU's Horizon 2050 initiative sets ambitious goals to reduce these impacts through e... |
| Enabling Elastic Model Serving with MultiWorld | Myungjin Lee, Akshay Jajoo, Ramana Rao Kompella | 2024-07-12 | 下载 | Machine learning models have been exponentially growing in terms of their parameter size over the past few years. We are now seeing the rise of trillion-parameter models. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Managing O-RAN Networks: xApp Development from Zero to Hero | Joao F. Santos, Alexandre Huff, Daniel Campos, Kleber V. Cardoso, Cristiano B. Both, Luiz A. DaSilva | 2024-07-12 | 下载 | The Open Radio Access Network (O-RAN) Alliance proposes an open architecture that disaggregates the RAN and supports executing custom control logic in near-real time from third-party applications, the... |
| FedsLLM: Federated Split Learning for Large Language Models over Communication Networks | Kai Zhao, Zhaohui Yang, Chongwen Huang, Xiaoming Chen, Zhaoyang Zhang | 2024-07-12 | 下载 | Addressing the challenges of deploying large language models in wireless communication networks, this paper combines low-rank adaptation technology (LoRA) with the splitfed learning framework to propo... |
| Physical Layer Aspects of Quantum Communications: A Survey | Seid Koudia, Leonardo Oleynik, Mert Bayraktar, Junaid ur Rehman, Symeon Chatzinotas | 2024-07-12 | 下载 | Quantum communication systems support unique applications in the form of distributed quantum computing, distributed quantum sensing, and several cryptographic protocols. |
| An Adaptive Indoor Localization Approach Using WiFi RSSI Fingerprinting with SLAM-Enabled Robotic Platform and Deep Neural Networks | Seyed Alireza Rahimi Azghadi, Atah Nuh Mih, Asfia Kawnine, Monica Wachowicz, Francis Palma, Hung Cao | 2024-07-12 | 下载 | Indoor localization plays a vital role in the era of the IoT and robotics, with WiFi technology being a prominent choice due to its ubiquity. We present a method for creating WiFi fingerprinting datas... |
| Optimized Federated Multitask Learning in Mobile Edge Networks: A Hybrid Client Selection and Model Aggregation Approach | Moqbel Hamood, Abdullatif Albaseer, Mohamed Abdallah, Ala Al-Fuqaha, Amr Mohamed | 2024-07-12 | 下载 | We propose clustered federated multitask learning to address statistical challenges in non-independent and identically distributed data across clients. |
| Assessing the Efficacy of IoT-based Forest Fire Detection: a Practical Use Case | Belcher Anthony, Esteva Miguel A., Lam Anthea, Ramadhani Rizki, Rayhan Achmad, Xu Wangkun, Tuncer Daphne | 2024-07-12 | 下载 | The implementation of early warning mechanisms that can be used to detect forest fires in rural areas is essential to mitigate their deleterious effects, in particular by notifying local fire authorit... |
| A Bistatic ISAC Framework for LEO Satellite Systems: A Rate-Splitting Approach | Juha Park, Jaehyup Seong, Jaehak Ryu, Yijie Mao, Wonjae Shin | 2024-07-12 | 下载 | Aiming to achieve ubiquitous global connectivity and target detection on the same platform with improved spectral/energy efficiency and reduced onboard hardware cost, low Earth orbit (LEO) satellite s... |
| Redefinition of Digital Twin and its Situation Awareness Framework Designing Towards Fourth Paradigm for Energy Internet of Things | Xing He, Yuezhong Tang, Shuyan Ma, Qian Ai, Fei Tao, Robert Qiu | 2024-07-12 | 下载 | Traditional knowledge-based situation awareness (SA) modes struggle to adapt to the escalating complexity of today's Energy Internet of Things (EIoT), necessitating a pivotal paradigm shift. |
| Multi-objective Aerial Collaborative Secure Communication Optimization via Generative Diffusion Model-enabled Deep Reinforcement Learning | Chuang Zhang, Geng Sun, Jiahui Li, Qingqing Wu, Jiacheng Wang, Dusit Niyato, Yuanwei Liu | 2024-07-12 | 下载 | Due to flexibility and low-cost, unmanned aerial vehicles (UAVs) are increasingly crucial for enhancing coverage and functionality of wireless networks. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Acceleration of Tensor-Product Operations with Tensor Cores | Cu Cui | 2024-07-12 | 下载 | In this paper, we explore the acceleration of tensor product operations in finite element methods, leveraging the computational power of the NVIDIA A100 GPU Tensor Cores. |