Skip to content

2024-09-20

cs.AR - Architecture

标题作者发布日期PDF摘要
LoopTree: Exploring the Fused-layer Dataflow Accelerator Design SpaceMichael Gilbert, Yannan Nellie Wu, Joel S. Emer, Vivienne Sze2024-09-20下载Latency and energy consumption are key metrics in the performance of deep neural network (DNN) accelerators. A significant factor contributing to latency and energy is data transfers.
RapidOMS: FPGA-based Open Modification Spectral Library Searching with HD ComputingSumukh Pinge, Weihong Xu, Wout Bittremieux, Niema Moshiri, Sang-Woo Jun, Tajana Rosing2024-09-20下载Mass spectrometry (MS) is essential for protein analysis but faces significant challenges with large datasets and complex post-translational modifications, resulting in difficulties in spectral identi...
Towards Efficient Neuro-Symbolic AI: From Workload Characterization to Hardware ArchitectureZishen Wan, Che-Kai Liu, Hanchen Yang, Ritik Raj, Chaojian Li, Haoran You, Yonggan Fu, Cheng Wan, Sixu Li, Youbin Kim, Ananda Samajdar, Yingyan Celine Lin, Mohamed Ibrahim, Jan M. Rabaey, Tushar Krishna, Arijit Raychowdhury2024-09-20下载The remarkable advancements in artificial intelligence (AI), primarily driven by deep neural networks, are facing challenges surrounding unsustainable computational trajectories, limited robustness, a...
Learning to Compare Hardware Designs for High-Level SynthesisYunsheng Bai, Atefeh Sohrabizadeh, Zijian Ding, Rongjian Liang, Weikai Li, Ding Wang, Haoxing Ren, Yizhou Sun, Jason Cong2024-09-20下载High-level synthesis (HLS) is an automated design process that transforms high-level code into hardware designs, enabling the rapid development of hardware accelerators.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
DP2^2-FedSAM: Enhancing Differentially Private Federated Learning Through Personalized Sharpness-Aware MinimizationZhenxiao Zhang, Yuanxiong Guo, Yanmin Gong2024-09-20下载Federated learning (FL) is a distributed machine learning approach that allows multiple clients to collaboratively train a model without sharing their raw data.
End-Cloud Collaboration Framework for Advanced AI Customer Service in E-commerceLiangyu Teng, Yang Liu, Jing Liu, Liang Song2024-09-20下载In recent years, the e-commerce industry has seen a rapid increase in the demand for advanced AI-driven customer service solutions. Traditional cloud-based models face limitations in terms of latency,...
SatFed: A Resource-Efficient LEO Satellite-Assisted Heterogeneous Federated Learning FrameworkYuxin Zhang, Zheng Lin, Zhe Chen, Zihan Fang, Wenjun Zhu, Xianhao Chen, Jin Zhao, Yue Gao2024-09-20下载Traditional federated learning (FL) frameworks rely heavily on terrestrial networks, where coverage limitations and increasing bandwidth congestion significantly hinder model convergence.
Local problems in trees across a wide range of distributed modelsAnubhav Dhar, Eli Kujawa, Henrik Lievonen, Augusto Modanese, Mikail Muftuoglu, Jan Studený, Jukka Suomela2024-09-20下载The randomized online-LOCAL model captures a number of models of computing; it is at least as strong as all of these models: - the classical LOCAL model of distributed graph algorithms, - the quan...
Noise-Robust and Resource-Efficient ADMM-based Federated LearningEhsan Lari, Reza Arablouei, Vinay Chakravarthi Gogineni, Stefan Werner2024-09-20下载Federated learning (FL) leverages client-server communications to train global models on decentralized data. However, communication noise or errors can impair model accuracy.
RapidOMS: FPGA-based Open Modification Spectral Library Searching with HD ComputingSumukh Pinge, Weihong Xu, Wout Bittremieux, Niema Moshiri, Sang-Woo Jun, Tajana Rosing2024-09-20下载Mass spectrometry (MS) is essential for protein analysis but faces significant challenges with large datasets and complex post-translational modifications, resulting in difficulties in spectral identi...
Flexible Swapping for the CloudMilan Pandurov, Lukas Humbel, Dmitry Sepp, Adamos Ttofari, Leon Thomm, Do Le Quoc, Siddharth Chandrasekaran, Sharan Santhanam, Chuan Ye, Shai Bergman, Wei Wang, Sven Lundgren, Konstantinos Sagonas, Alberto Ros2024-09-20下载Memory has become the primary cost driver in cloud data centers. Yet, a significant portion of memory allocated to VMs in public clouds remains unused.
Performance Enhancement of the Ozaki Scheme on Integer Matrix Multiplication UnitYuki Uchino, Katsuhisa Ozaki, Toshiyuki Imamura2024-09-20下载This study was aimed at simultaneously achieving sufficient accuracy and high performance for general matrix multiplications. Recent architectures, such as NVIDIA GPUs, feature high-performance units ...
Optimizing RLHF Training for Large Language Models with Stage FusionYinmin Zhong, Zili Zhang, Bingyang Wu, Shengyu Liu, Yukun Chen, Changyi Wan, Hanpeng Hu, Lei Xia, Ranchen Ming, Yibo Zhu, Xin Jin2024-09-20下载We present RLHFuse, an efficient training system with stage fusion for Reinforcement Learning from Human Feedback (RLHF). Due to the intrinsic nature of RLHF training, i.e.
Stabl: Blockchain Fault ToleranceVincent Gramoli, Rachid Guerraoui, Andrei Lebedev, Gauthier Voron2024-09-20下载Blockchain promises to make online services more fault tolerant due to their inherent distributed nature. Their ability to execute arbitrary programs in different geo-distributed regions and on divers...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
A Centrality Approach to Select Offloading Data Aggregation Points in Vehicular Sensor NetworksDouglas Moura, Geymerson S. Ramos, Andre L. L. Aquino, Antonio Loureiro2024-09-20下载This work proposes a centrality-based approach to identify data offloading points in a VSN. The solution presents a scheme to select vehicles used as aggregation points to collect and aggregate other ...
Efficient Entanglement Routing for Satellite-Aerial-Terrestrial Quantum NetworksYu Zhang, Yanmin Gong, Lei Fan, Yu Wang, Zhu Han, Yuanxiong Guo2024-09-20下载In the era of 6G and beyond, space-aerial-terrestrial quantum networks (SATQNs) are shaping the future of the global-scale quantum Internet. This paper investigates the collaboration among satellite, ...
Quantum-Assisted Joint Virtual Network Function Deployment and Maximum Flow Routing for Space Information NetworksYu Zhang, Yanmin Gong, Lei Fan, Yu Wang, Zhu Han, Yuanxiong Guo2024-09-20下载Network function virtualization (NFV)-enabled space information network (SIN) has emerged as a promising method to facilitate global coverage and seamless service.
Integrating Deterministic Networking with 5GYash Deshpande, Philip Diederich, Muhamad Luthfi, Laura Becker, José Fontalvo-Hernández, Wolfgang Kellerer2024-09-20下载The rising prevalence of real-time applications that require deterministic communication over mobile networks necessitates the joint operation of both mobile and fixed network components.
Post-Quantum Cryptography Anonymous Scheme -- PQCWC: Post-Quantum Cryptography Winternitz-ChenAbel C. H. Chen2024-09-20下载As quantum computing technology matures, it poses a threat to the security of mainstream asymmetric cryptographic methods. In response, the National Institute of Standards and Technology released the ...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Flexible Swapping for the CloudMilan Pandurov, Lukas Humbel, Dmitry Sepp, Adamos Ttofari, Leon Thomm, Do Le Quoc, Siddharth Chandrasekaran, Sharan Santhanam, Chuan Ye, Shai Bergman, Wei Wang, Sven Lundgren, Konstantinos Sagonas, Alberto Ros2024-09-20下载Memory has become the primary cost driver in cloud data centers. Yet, a significant portion of memory allocated to VMs in public clouds remains unused.

cs.PF - Performance

标题作者发布日期PDF摘要
RAVE: RISC-V Analyzer of Vector Executions, a QEMU tracing pluginPablo Vizcaino, Filippo Mantovani, Jesus Labarta, Roger Ferrer2024-09-20下载Simulators are crucial during the development of a chip, like the RISC-V accelerator designed in the European Processor Initiative project. In this paper, we showcase the limitations of the current si...
Stabl: Blockchain Fault ToleranceVincent Gramoli, Rachid Guerraoui, Andrei Lebedev, Gauthier Voron2024-09-20下载Blockchain promises to make online services more fault tolerant due to their inherent distributed nature. Their ability to execute arbitrary programs in different geo-distributed regions and on divers...

基于 VitePress 构建