Skip to content

2024-09-04

cs.AR - Architecture

标题作者发布日期PDF摘要
Register Aggregation for Hardware DecompilationVarun Rao, Zachary D. Sisco2024-09-04下载Hardware decompilation reverses logic synthesis, converting a gate-level digital electronic design, or netlist, back up to hardware description language (HDL) code.
RTLRewriter: Methodologies for Large Models aided RTL Code OptimizationXufeng Yao, Yiwen Wang, Xing Li, Yingzhao Lian, Ran Chen, Lei Chen, Mingxuan Yuan, Hong Xu, Bei Yu2024-09-04下载Register Transfer Level (RTL) code optimization is crucial for enhancing the efficiency and performance of digital circuits during early synthesis stages.
ResiLogic: Leveraging Composability and Diversity to Design Fault and Intrusion Resilient ChipsAhmad T. Sheikh, Ali Shoker, Suhaib A. Fahmy, Paulo Esteves-Verissimo2024-09-04下载A long-standing challenge is the design of chips resilient to faults and glitches. Both fine-grained gate diversity and coarse-grained modular redundancy have been used in the past.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
On Advanced Monte Carlo Methods for Linear Algebra on Advanced Accelerator ArchitecturesAnton Lebedev, Vassil Alexandrov2024-09-04下载In this paper we present computational experiments with the Markov Chain Monte Carlo Matrix Inversion ((MC)2MI(\text{MC})^2\text{MI}) on several accelerator architectures and investigate their impact on pe...
A Bayesian Optimization through Sequential Monte Carlo and Statistical Physics-Inspired TechniquesAnton Lebedev, Thomas Warford, M. Emre Şahin2024-09-04下载In this paper, we propose an approach for an application of Bayesian optimization using Sequential Monte Carlo (SMC) and concepts from the statistical physics of classical systems.
GreenWhisk: Emission-Aware Computing for Serverless PlatformJayden Serenari, Sreekanth Sreekumar, Kaiwen Zhao, Saurabh Sarkar, Stephen Lee2024-09-04下载Serverless computing is an emerging cloud computing abstraction wherein the cloud platform transparently manages all resources, including explicitly provisioning resources and geographical load balanc...
Towards a Scalable and Efficient PGAS-based Distributed OpenMPBaodi Shan, Mauricio Araya-Polo, Barbara Chapman2024-09-04下载MPI+X has been the de facto standard for distributed memory parallel programming. It is widely used primarily as an explicit two-sided communication model, which often leads to complex and error-prone...
TS-EoH: An Edge Server Task Scheduling Algorithm Based on Evolution of HeuristicWang Yatong, Pei Yuchen, Zhao Yuqi2024-09-04下载With the widespread adoption of 5G and Internet of Things (IoT) technologies, the low latency provided by edge computing has great importance for real-time processing.
A Joint Time and Energy-Efficient Federated Learning-based Computation Offloading Method for Mobile Edge ComputingAnwesha Mukherjee, Rajkumar Buyya2024-09-04下载Computation offloading at lower time and lower energy consumption is crucial for resource limited mobile devices. This paper proposes an offloading decision-making model using federated learning.
ISO: Overlap of Computation and Communication within Seqenence For LLM InferenceBin Xiao, Lei Su2024-09-04下载In the realm of Large Language Model (LLM) inference, the inherent structure of transformer models coupled with the multi-GPU tensor parallelism strategy leads to a sequential execution of computation...
Accelerating Large Language Model Training with Hybrid GPU-based CompressionLang Xu, Quentin Anthony, Qinghua Zhou, Nawras Alnaasan, Radha R. Gulhane, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda2024-09-04下载Data Parallelism (DP), Tensor Parallelism (TP), and Pipeline Parallelism (PP) are the three strategies widely adopted to enable fast and efficient Large Language Model (LLM) training.
Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRAShuangyi Chen, Yue Ju, Hardik Dalal, Zhongwen Zhu, Ashish Khisti2024-09-04下载Parameter-Efficient Fine-Tuning (PEFT) has risen as an innovative training strategy that updates only a select few model parameters, significantly lowering both computational and memory demands.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
What is Normal? A Big Data Observational Science Model of Anonymized Internet TrafficJeremy Kepner, Hayden Jananthan, Michael Jones, William Arcand, David Bestor, William Bergeron, Daniel Burrill, Aydin Buluc, Chansup Byun, Timothy Davis, Vijay Gadepally, Daniel Grant, Michael Houle, Matthew Hubbell, Piotr Luszczek, Lauren Milechin, Chasen Milner, Guillermo Morales, Andrew Morris, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Gabriel Wachman, Charles Yee, Peter Michaleas2024-09-04下载Understanding what is normal is a key aspect of protecting a domain. Other domains invest heavily in observational science to develop models of normal behavior to better detect anomalies.
VECA: Reliable and Confidential Resource Clustering for Volunteer Edge-Cloud ComputingHemanth Sai Yeddulapalli, Mauro Lemus Alarcon, Upasana Roy, Roshan Lal Neupane, Durbek Gafurov, Motahare Mounesan, Saptarshi Debroy, Prasad Calyam2024-09-04下载Volunteer Edge-Cloud (VEC) computing has a significant potential to support scientific workflows in user communities contributing volunteer edge nodes.
Anomaly Detection in Offshore Open Radio Access Network Using Long Short-Term Memory Models on a Novel Artificial Intelligence-Driven Cloud-Native Data PlatformAbdelrahim Ahmad, Peizheng Li, Robert Piechocki, Rui Inacio2024-09-04下载The Radio Access Network (RAN) is a critical component of modern telecommunications infrastructure, currently evolving towards disaggregated and open architectures.
Knowledge Transfer for Collaborative Misbehavior Detection in Untrusted Vehicular EnvironmentsRoshan Sedar, Charalampos Kalalas, Paolo Dini, Francisco Vazquez-Gallego, Jesus Alonso-Zarate, Luis Alonso2024-09-04下载Vehicular mobility underscores the need for collaborative misbehavior detection at the vehicular edge. However, locally trained misbehavior detection models are susceptible to adversarial attacks that...
Jäger: Automated Telephone Call TracebackDavid Adei, Varun Madathil, Sathvik Prasad, Bradley Reaves, Alessandra Scafuro2024-09-04下载Unsolicited telephone calls that facilitate fraud or unlawful telemarketing continue to overwhelm network users and the regulators who prosecute them.
Towards Edge-Based Data Lake Architecture for Intelligent Transportation SystemDanilo Fernandes, Douglas L. L. Moura, Gean Santos, Geymerson S. Ramos, Fabiane Queiroz, Andre L. L. Aquino2024-09-04下载The rapid urbanization growth has underscored the need for innovative solutions to enhance transportation efficiency and safety. Intelligent Transportation Systems (ITS) have emerged as a promising so...
Enhancing 5G Performance: Reducing Service Time and Research Directions for 6G StandardsLaura Landon, Vipindev Adat Vasudevan, Jaeweon Kim, Junmo Sung, Jeffery Tony Masters, Muriel Médard2024-09-04下载This paper presents several methods for minimizing packet service time in networks using 5G and beyond. We propose leveraging network coding alongside Hybrid Automatic Repeat reQuest (HARQ) to reduce ...
Security Implications and Mitigation Strategies in MPLS NetworksAyush Thakur2024-09-04下载Multiprotocol Label Switching (MPLS) is a high-performance telecommunications technology that directs data from one network node to another based on short path labels rather than long network addresse...
AirFogSim: A Light-Weight and Modular Simulator for UAV-Integrated Vehicular Fog ComputingZhiwei Wei, Chenran Huang, Bing Li, Yiting Zhao, Xiang Cheng, Liuqing Yang, Rongqing Zhang2024-09-04下载Vehicular Fog Computing (VFC) is significantly enhancing the efficiency, safety, and computational capabilities of Intelligent Transportation Systems (ITS), and the integration of Unmanned Aerial Vehi...
A Dynamic Resource Scheduling Algorithm Based on Traffic Prediction for Coexistence of eMBB and Random Arrival URLLCYizhou Jiang, Xiujun Zhang, Xiaofeng Zhong, Shidong Zhou2024-09-04下载In this paper, we propose a joint design for the coexistence of enhanced mobile broadband (eMBB) and ultra-reliable and random low-latency communication (URLLC) with different transmission time interv...
FlexBSO: Flexible Block Storage Offload for DatacentersVojtech Aschenbrenner, John Shawger, Sadman Sakib2024-09-04下载Efficient virtualization of CPU and memory is standardized and mature. Capabilities such as Intel VT-x [3] have been added by manufacturers for efficient hypervisor support.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
FlexBSO: Flexible Block Storage Offload for DatacentersVojtech Aschenbrenner, John Shawger, Sadman Sakib2024-09-04下载Efficient virtualization of CPU and memory is standardized and mature. Capabilities such as Intel VT-x [3] have been added by manufacturers for efficient hypervisor support.

cs.PF - Performance

标题作者发布日期PDF摘要
Towards a Scalable and Efficient PGAS-based Distributed OpenMPBaodi Shan, Mauricio Araya-Polo, Barbara Chapman2024-09-04下载MPI+X has been the de facto standard for distributed memory parallel programming. It is widely used primarily as an explicit two-sided communication model, which often leads to complex and error-prone...
ISO: Overlap of Computation and Communication within Seqenence For LLM InferenceBin Xiao, Lei Su2024-09-04下载In the realm of Large Language Model (LLM) inference, the inherent structure of transformer models coupled with the multi-GPU tensor parallelism strategy leads to a sequential execution of computation...

基于 VitePress 构建