Skip to content

2024-12-23

cs.AR - Architecture

标题作者发布日期PDF摘要
TNNGen: Automated Design of Neuromorphic Sensory Processing Units for Time-Series ClusteringPrabhu Vellaisamy, Harideep Nair, Vamsikrishna Ratnakaram, Dhruv Gupta, John Paul Shen2024-12-23下载Temporal Neural Networks (TNNs), a special class of spiking neural networks, draw inspiration from the neocortex in utilizing spike-timings for information processing.
tuGEMM: Area-Power-Efficient Temporal Unary GEMM Architecture for Low-Precision Edge AIHarideep Nair, Prabhu Vellaisamy, Albert Chen, Joseph Finn, Anna Li, Manav Trivedi, John Paul Shen2024-12-23下载General matrix multiplication (GEMM) is a ubiquitous computing kernel/algorithm for data processing in diverse applications, including artificial intelligence (AI) and deep learning (DL).
tubGEMM: Energy-Efficient and Sparsity-Effective Temporal-Unary-Binary Based Matrix Multiply UnitPrabhu Vellaisamy, Harideep Nair, Joseph Finn, Manav Trivedi, Albert Chen, Anna Li, Tsung-Han Lin, Perry Wang, Shawn Blanton, John Paul Shen2024-12-23下载General Matrix Multiplication (GEMM) is a ubiquitous compute kernel in deep learning (DL). To support energy-efficient edge-native processing, new GEMM hardware units have been proposed that operate o...
Robust and Reconfigurable On-Board Data Handling Subsystem for Present and Future Brazilian CubeSat MissionsVictor O. Costa, Mauren D'Ávila, Douglas Arena, Vinicius Schreiner, Renan Menezes, Cleber Hoffmann, Edson Pereira, Lidia Shibuya Sato, Felipe Tavares, Luis Loures, Fernanda L. Kastensmidt2024-12-23下载CubeSats require robust OBDH solutions in harsh environments. The Demoiselle OBC, featuring a radiation-tolerant APSoC and layered FSW, supports reuse, in-orbit updates, and secure operations.
HPCNeuroNet: A Neuromorphic Approach Merging SNN Temporal Dynamics with Transformer Attention for FPGA-based Particle PhysicsMurat Isik, Hiruna Vishwamith, Jonathan Naoukin, I. Can Dikmen2024-12-23下载This paper presents the innovative HPCNeuroNet model, a pioneering fusion of Spiking Neural Networks (SNNs), Transformers, and high-performance computing tailored for particle physics, particularly in...
Highly Optimized Kernels and Fine-Grained Codebooks for LLM Inference on Arm CPUsDibakar Gope, David Mansell, Danny Loh, Ian Bratt2024-12-23下载Large language models (LLMs) have transformed the way we think about language understanding and generation, enthralling both researchers and developers.
Agile TLB Prefetching and Prediction Replacement PolicyMelkamu Mersha, Tsion Abay, Mingziem Bitewa, Gedare Bloom2024-12-23下载Virtual-to-physical address translation is a critical performance bottleneck in paging-based virtual memory systems. The Translation Lookaside Buffer (TLB) accelerates address translation by caching f...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Parallel Contraction Hierarchies Can Be Efficient and ScalableZijin Wan, Xiaojun Dong, Letong Wang, Enzuo Zhu, Yan Gu, Yihan Sun2024-12-23下载Contraction Hierarchies (CH) (Geisberger et al., 2008) is one of the most widely used algorithms for shortest-path queries on road networks. Compared to Dijkstra's algorithm, CH enables orders of magn...
Enhanced Quantum Circuit Cutting Framework for Sampling Overhead ReductionPo-Hung Chen, Dah-Wei Chiou, Bo-Hung Chen, Jie-Hong Roland Jiang2024-12-23下载The recently developed quantum circuit cutting technique greatly extends the capabilities of current noisy intermediate-scale quantum (NISQ) hardware.
FedTLU: Federated Learning with Targeted Layer UpdatesJong-Ik Park, Carlee Joe-Wong2024-12-23下载Federated learning (FL) addresses privacy concerns in training language models by enabling multiple clients to contribute to the training, without sending their data to others.
Heat: Satellite's meat is GPU's poisonZhehu Yuan, Jinyang Liu, Guanqun Song, Ting Zhu2024-12-23下载In satellite applications, managing thermal conditions is a significant challenge due to the extreme fluctuations in temperature during orbital cycles.
Synergistic Integration of Blockchain and Software-Defined Networking in the Internet of Energy SystemsVahideh Hayyolalam, Abdulrezzak Zekiye, Hamza Abuzahra, Oznur Ozkasap, Murat Karakus, Evrim Guler, Suleyman Uludag2024-12-23下载Peer-to-peer (P2P) energy trading, Smart Grids (SG), and electric vehicle energy management are integral components of the Internet of Energy (IoE) field.
Power- and Fragmentation-aware Online Scheduling for GPU DatacentersFrancesco Lettich, Emanuele Carlini, Franco Maria Nardini, Raffaele Perego, Salvatore Trani2024-12-23下载The rise of Artificial Intelligence and Large Language Models is driving increased GPU usage in data centers for complex training and inference tasks, impacting operational costs, energy demands, and ...
Data-Juicer 2.0: Cloud-Scale Adaptive Data Processing for and with Foundation ModelsDaoyuan Chen, Yilun Huang, Xuchen Pan, Nana Jiang, Haibin Wang, Yilei Zhang, Ce Ge, Yushuo Chen, Wenhao Zhang, Zhijian Ma, Jun Huang, Wei Lin, Yaliang Li, Bolin Ding, Jingren Zhou2024-12-23下载Foundation models demand advanced data processing for their vast, multimodal datasets. However, traditional frameworks struggle with the unique complexities of multimodal data.
Quantum Approximate Optimisation Applied to Graph SimilarityNicholas J. Pritchard2024-12-23下载Quantum computing promises solutions to classically difficult and new-found problems through controlling the subtleties of quantum computing. The Quantum Approximate Optimisation Algorithm (QAOA) is a...
Dynamic Scheduling Strategies for Resource Optimization in Computing EnvironmentsXiaoye Wang2024-12-23下载The rapid development of cloud-native architecture has promoted the widespread application of container technology, but the optimization problems in container scheduling and resource management still ...
BLITZSCALE: Fast and Live Large Model Autoscaling with O(1) Host CachingDingyan Zhang, Haotian Wang, Yang Liu, Xingda Wei, Yizhou Shan, Rong Chen, Haibo Chen2024-12-23下载Model autoscaling is the key mechanism to achieve serverless model-as-a-service, but it faces a fundamental trade-off between scaling speed and storage/memory usage to cache parameters, and cannot mee...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Towards Cognitive Service Delivery on B5G through AIaaS ArchitectureLarissa F. Rodrigues Moreira, Rodrigo Moreira, Flávio de Oliveira Silva, André R. Backes2024-12-23下载Artificial Intelligence (AI) is pivotal in advancing mobile network systems by facilitating smart capabilities and automation. The transition from 4G to 5G has substantial implications for AI in conso...
UAV Communications: Impact of Obstacles on Channel CharacteristicsKamal Shayegan2024-12-23下载In recent years, Unmanned Aerial Vehicles (UAVs) have been utilized as effective platforms for carrying Wi-Fi Access Points (APs) and cellular Base Stations (BSs), enabling low-cost, agile, and flexib...
Hierarchical Blockchain Radio Access Networks: Architecture, Modelling, and Performance AssessmentVasileios Kouvakis, Stylianos E. Trevlakis, Alexandros-Apostolos A. Boulogeorgos, Hongwu Liu, Waqas Khalid, Theodoros Tsiftsis, Octavia A. Dobre2024-12-23下载Demands for secure, ubiquitous, and always-available connectivity have been identified as the pillar design parameters of the next generation radio access networks (RANs).
Outage Probability Analysis of Uplink Heterogeneous Non-terrestrial Networks: A Novel Stochastic Geometry ModelWen-Yu Dong, Shaoshi Yang, Wei Lin, Wei Zhao, Jia-Xing Gui, Sheng Chen2024-12-23下载In harsh environments such as mountainous terrain, dense vegetation areas, or urban landscapes, a single type of unmanned aerial vehicles (UAVs) may encounter challenges like flight restrictions, diff...
Efficacy of Full-Packet Encryption in Mitigating Protocol Detection for Evasive Virtual Private NetworksAmy Iris Parker2024-12-23下载Full-packet encryption is a technique used by modern evasive Virtual Private Networks (VPNs) to avoid protocol-based flagging from censorship models by disguising their traffic as random noise on the ...
SoK: The Design Paradigm of Safe and Secure DefaultsJukka Ruohonen2024-12-23下载In security engineering, including software security engineering, there is a well-known design paradigm telling to prefer safe and secure defaults.
Taming Imbalance and Complexity in WAN Traffic EngineeringYufeng Xin, Sajith Sasidharam, Cong Wang, Mert Cevik2024-12-23下载The rapid expansion of global cloud infrastructures, coupled with the growing volume and complexity of network traffic, has fueled active research into scalable and resilient Traffic Engineering (TE) ...
FedMeld: A Model-dispersal Federated Learning Framework for Space-ground Integrated NetworksQian Chen, Xianhao Chen, Kaibin Huang2024-12-23下载To bridge the digital divide, space-ground integrated networks (SGINs) are expected to deliver artificial intelligence (AI) services to every corner of the world.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
BLITZSCALE: Fast and Live Large Model Autoscaling with O(1) Host CachingDingyan Zhang, Haotian Wang, Yang Liu, Xingda Wei, Yizhou Shan, Rong Chen, Haibo Chen2024-12-23下载Model autoscaling is the key mechanism to achieve serverless model-as-a-service, but it faces a fundamental trade-off between scaling speed and storage/memory usage to cache parameters, and cannot mee...

cs.PF - Performance

标题作者发布日期PDF摘要
On the Optimization of Singular Spectrum Analyses: A Pragmatic ApproachFernando Lopes, Dominique Gibert, Vincent Courtillot, Jean-Louis Le Mouël, Jean-Baptiste Boulé2024-12-23下载Singular Spectrum Analysis (SSA) occupies a prominent place in the real signal analysis toolkit alongside Fourier and Wavelet analysis. In addition to the two aforementioned analyses, SSA allows the s...
Performance evaluation of accelerated real and complex multiple-precision sparse matrix-vector multiplicationTomonori Kouya2024-12-23下载Sparse matrices have recently played a significant and impactful role in scientific computing, including artificial intelligence-related fields.

基于 VitePress 构建