Appearance
2024-12-23
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| TNNGen: Automated Design of Neuromorphic Sensory Processing Units for Time-Series Clustering | Prabhu Vellaisamy, Harideep Nair, Vamsikrishna Ratnakaram, Dhruv Gupta, John Paul Shen | 2024-12-23 | 下载 | Temporal Neural Networks (TNNs), a special class of spiking neural networks, draw inspiration from the neocortex in utilizing spike-timings for information processing. |
| tuGEMM: Area-Power-Efficient Temporal Unary GEMM Architecture for Low-Precision Edge AI | Harideep Nair, Prabhu Vellaisamy, Albert Chen, Joseph Finn, Anna Li, Manav Trivedi, John Paul Shen | 2024-12-23 | 下载 | General matrix multiplication (GEMM) is a ubiquitous computing kernel/algorithm for data processing in diverse applications, including artificial intelligence (AI) and deep learning (DL). |
| tubGEMM: Energy-Efficient and Sparsity-Effective Temporal-Unary-Binary Based Matrix Multiply Unit | Prabhu Vellaisamy, Harideep Nair, Joseph Finn, Manav Trivedi, Albert Chen, Anna Li, Tsung-Han Lin, Perry Wang, Shawn Blanton, John Paul Shen | 2024-12-23 | 下载 | General Matrix Multiplication (GEMM) is a ubiquitous compute kernel in deep learning (DL). To support energy-efficient edge-native processing, new GEMM hardware units have been proposed that operate o... |
| Robust and Reconfigurable On-Board Data Handling Subsystem for Present and Future Brazilian CubeSat Missions | Victor O. Costa, Mauren D'Ávila, Douglas Arena, Vinicius Schreiner, Renan Menezes, Cleber Hoffmann, Edson Pereira, Lidia Shibuya Sato, Felipe Tavares, Luis Loures, Fernanda L. Kastensmidt | 2024-12-23 | 下载 | CubeSats require robust OBDH solutions in harsh environments. The Demoiselle OBC, featuring a radiation-tolerant APSoC and layered FSW, supports reuse, in-orbit updates, and secure operations. |
| HPCNeuroNet: A Neuromorphic Approach Merging SNN Temporal Dynamics with Transformer Attention for FPGA-based Particle Physics | Murat Isik, Hiruna Vishwamith, Jonathan Naoukin, I. Can Dikmen | 2024-12-23 | 下载 | This paper presents the innovative HPCNeuroNet model, a pioneering fusion of Spiking Neural Networks (SNNs), Transformers, and high-performance computing tailored for particle physics, particularly in... |
| Highly Optimized Kernels and Fine-Grained Codebooks for LLM Inference on Arm CPUs | Dibakar Gope, David Mansell, Danny Loh, Ian Bratt | 2024-12-23 | 下载 | Large language models (LLMs) have transformed the way we think about language understanding and generation, enthralling both researchers and developers. |
| Agile TLB Prefetching and Prediction Replacement Policy | Melkamu Mersha, Tsion Abay, Mingziem Bitewa, Gedare Bloom | 2024-12-23 | 下载 | Virtual-to-physical address translation is a critical performance bottleneck in paging-based virtual memory systems. The Translation Lookaside Buffer (TLB) accelerates address translation by caching f... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Parallel Contraction Hierarchies Can Be Efficient and Scalable | Zijin Wan, Xiaojun Dong, Letong Wang, Enzuo Zhu, Yan Gu, Yihan Sun | 2024-12-23 | 下载 | Contraction Hierarchies (CH) (Geisberger et al., 2008) is one of the most widely used algorithms for shortest-path queries on road networks. Compared to Dijkstra's algorithm, CH enables orders of magn... |
| Enhanced Quantum Circuit Cutting Framework for Sampling Overhead Reduction | Po-Hung Chen, Dah-Wei Chiou, Bo-Hung Chen, Jie-Hong Roland Jiang | 2024-12-23 | 下载 | The recently developed quantum circuit cutting technique greatly extends the capabilities of current noisy intermediate-scale quantum (NISQ) hardware. |
| FedTLU: Federated Learning with Targeted Layer Updates | Jong-Ik Park, Carlee Joe-Wong | 2024-12-23 | 下载 | Federated learning (FL) addresses privacy concerns in training language models by enabling multiple clients to contribute to the training, without sending their data to others. |
| Heat: Satellite's meat is GPU's poison | Zhehu Yuan, Jinyang Liu, Guanqun Song, Ting Zhu | 2024-12-23 | 下载 | In satellite applications, managing thermal conditions is a significant challenge due to the extreme fluctuations in temperature during orbital cycles. |
| Synergistic Integration of Blockchain and Software-Defined Networking in the Internet of Energy Systems | Vahideh Hayyolalam, Abdulrezzak Zekiye, Hamza Abuzahra, Oznur Ozkasap, Murat Karakus, Evrim Guler, Suleyman Uludag | 2024-12-23 | 下载 | Peer-to-peer (P2P) energy trading, Smart Grids (SG), and electric vehicle energy management are integral components of the Internet of Energy (IoE) field. |
| Power- and Fragmentation-aware Online Scheduling for GPU Datacenters | Francesco Lettich, Emanuele Carlini, Franco Maria Nardini, Raffaele Perego, Salvatore Trani | 2024-12-23 | 下载 | The rise of Artificial Intelligence and Large Language Models is driving increased GPU usage in data centers for complex training and inference tasks, impacting operational costs, energy demands, and ... |
| Data-Juicer 2.0: Cloud-Scale Adaptive Data Processing for and with Foundation Models | Daoyuan Chen, Yilun Huang, Xuchen Pan, Nana Jiang, Haibin Wang, Yilei Zhang, Ce Ge, Yushuo Chen, Wenhao Zhang, Zhijian Ma, Jun Huang, Wei Lin, Yaliang Li, Bolin Ding, Jingren Zhou | 2024-12-23 | 下载 | Foundation models demand advanced data processing for their vast, multimodal datasets. However, traditional frameworks struggle with the unique complexities of multimodal data. |
| Quantum Approximate Optimisation Applied to Graph Similarity | Nicholas J. Pritchard | 2024-12-23 | 下载 | Quantum computing promises solutions to classically difficult and new-found problems through controlling the subtleties of quantum computing. The Quantum Approximate Optimisation Algorithm (QAOA) is a... |
| Dynamic Scheduling Strategies for Resource Optimization in Computing Environments | Xiaoye Wang | 2024-12-23 | 下载 | The rapid development of cloud-native architecture has promoted the widespread application of container technology, but the optimization problems in container scheduling and resource management still ... |
| BLITZSCALE: Fast and Live Large Model Autoscaling with O(1) Host Caching | Dingyan Zhang, Haotian Wang, Yang Liu, Xingda Wei, Yizhou Shan, Rong Chen, Haibo Chen | 2024-12-23 | 下载 | Model autoscaling is the key mechanism to achieve serverless model-as-a-service, but it faces a fundamental trade-off between scaling speed and storage/memory usage to cache parameters, and cannot mee... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Towards Cognitive Service Delivery on B5G through AIaaS Architecture | Larissa F. Rodrigues Moreira, Rodrigo Moreira, Flávio de Oliveira Silva, André R. Backes | 2024-12-23 | 下载 | Artificial Intelligence (AI) is pivotal in advancing mobile network systems by facilitating smart capabilities and automation. The transition from 4G to 5G has substantial implications for AI in conso... |
| UAV Communications: Impact of Obstacles on Channel Characteristics | Kamal Shayegan | 2024-12-23 | 下载 | In recent years, Unmanned Aerial Vehicles (UAVs) have been utilized as effective platforms for carrying Wi-Fi Access Points (APs) and cellular Base Stations (BSs), enabling low-cost, agile, and flexib... |
| Hierarchical Blockchain Radio Access Networks: Architecture, Modelling, and Performance Assessment | Vasileios Kouvakis, Stylianos E. Trevlakis, Alexandros-Apostolos A. Boulogeorgos, Hongwu Liu, Waqas Khalid, Theodoros Tsiftsis, Octavia A. Dobre | 2024-12-23 | 下载 | Demands for secure, ubiquitous, and always-available connectivity have been identified as the pillar design parameters of the next generation radio access networks (RANs). |
| Outage Probability Analysis of Uplink Heterogeneous Non-terrestrial Networks: A Novel Stochastic Geometry Model | Wen-Yu Dong, Shaoshi Yang, Wei Lin, Wei Zhao, Jia-Xing Gui, Sheng Chen | 2024-12-23 | 下载 | In harsh environments such as mountainous terrain, dense vegetation areas, or urban landscapes, a single type of unmanned aerial vehicles (UAVs) may encounter challenges like flight restrictions, diff... |
| Efficacy of Full-Packet Encryption in Mitigating Protocol Detection for Evasive Virtual Private Networks | Amy Iris Parker | 2024-12-23 | 下载 | Full-packet encryption is a technique used by modern evasive Virtual Private Networks (VPNs) to avoid protocol-based flagging from censorship models by disguising their traffic as random noise on the ... |
| SoK: The Design Paradigm of Safe and Secure Defaults | Jukka Ruohonen | 2024-12-23 | 下载 | In security engineering, including software security engineering, there is a well-known design paradigm telling to prefer safe and secure defaults. |
| Taming Imbalance and Complexity in WAN Traffic Engineering | Yufeng Xin, Sajith Sasidharam, Cong Wang, Mert Cevik | 2024-12-23 | 下载 | The rapid expansion of global cloud infrastructures, coupled with the growing volume and complexity of network traffic, has fueled active research into scalable and resilient Traffic Engineering (TE) ... |
| FedMeld: A Model-dispersal Federated Learning Framework for Space-ground Integrated Networks | Qian Chen, Xianhao Chen, Kaibin Huang | 2024-12-23 | 下载 | To bridge the digital divide, space-ground integrated networks (SGINs) are expected to deliver artificial intelligence (AI) services to every corner of the world. |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| BLITZSCALE: Fast and Live Large Model Autoscaling with O(1) Host Caching | Dingyan Zhang, Haotian Wang, Yang Liu, Xingda Wei, Yizhou Shan, Rong Chen, Haibo Chen | 2024-12-23 | 下载 | Model autoscaling is the key mechanism to achieve serverless model-as-a-service, but it faces a fundamental trade-off between scaling speed and storage/memory usage to cache parameters, and cannot mee... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| On the Optimization of Singular Spectrum Analyses: A Pragmatic Approach | Fernando Lopes, Dominique Gibert, Vincent Courtillot, Jean-Louis Le Mouël, Jean-Baptiste Boulé | 2024-12-23 | 下载 | Singular Spectrum Analysis (SSA) occupies a prominent place in the real signal analysis toolkit alongside Fourier and Wavelet analysis. In addition to the two aforementioned analyses, SSA allows the s... |
| Performance evaluation of accelerated real and complex multiple-precision sparse matrix-vector multiplication | Tomonori Kouya | 2024-12-23 | 下载 | Sparse matrices have recently played a significant and impactful role in scientific computing, including artificial intelligence-related fields. |