2024-09-04

cs.AR - Architecture

标题	作者	发布日期	PDF	摘要
Register Aggregation for Hardware Decompilation	Varun Rao, Zachary D. Sisco	2024-09-04	下载	Hardware decompilation reverses logic synthesis, converting a gate-level digital electronic design, or netlist, back up to hardware description language (HDL) code.
RTLRewriter: Methodologies for Large Models aided RTL Code Optimization	Xufeng Yao, Yiwen Wang, Xing Li, Yingzhao Lian, Ran Chen, Lei Chen, Mingxuan Yuan, Hong Xu, Bei Yu	2024-09-04	下载	Register Transfer Level (RTL) code optimization is crucial for enhancing the efficiency and performance of digital circuits during early synthesis stages.
ResiLogic: Leveraging Composability and Diversity to Design Fault and Intrusion Resilient Chips	Ahmad T. Sheikh, Ali Shoker, Suhaib A. Fahmy, Paulo Esteves-Verissimo	2024-09-04	下载	A long-standing challenge is the design of chips resilient to faults and glitches. Both fine-grained gate diversity and coarse-grained modular redundancy have been used in the past.

cs.DC - Distributed, Parallel, and Cluster Computing

标题	作者	发布日期	PDF	摘要
On Advanced Monte Carlo Methods for Linear Algebra on Advanced Accelerator Architectures	Anton Lebedev, Vassil Alexandrov	2024-09-04	下载	In this paper we present computational experiments with the Markov Chain Monte Carlo Matrix Inversion ( $(\text{MC})^2\text{MI}$ ) on several accelerator architectures and investigate their impact on pe...
A Bayesian Optimization through Sequential Monte Carlo and Statistical Physics-Inspired Techniques	Anton Lebedev, Thomas Warford, M. Emre Şahin	2024-09-04	下载	In this paper, we propose an approach for an application of Bayesian optimization using Sequential Monte Carlo (SMC) and concepts from the statistical physics of classical systems.
GreenWhisk: Emission-Aware Computing for Serverless Platform	Jayden Serenari, Sreekanth Sreekumar, Kaiwen Zhao, Saurabh Sarkar, Stephen Lee	2024-09-04	下载	Serverless computing is an emerging cloud computing abstraction wherein the cloud platform transparently manages all resources, including explicitly provisioning resources and geographical load balanc...
Towards a Scalable and Efficient PGAS-based Distributed OpenMP	Baodi Shan, Mauricio Araya-Polo, Barbara Chapman	2024-09-04	下载	MPI+X has been the de facto standard for distributed memory parallel programming. It is widely used primarily as an explicit two-sided communication model, which often leads to complex and error-prone...
TS-EoH: An Edge Server Task Scheduling Algorithm Based on Evolution of Heuristic	Wang Yatong, Pei Yuchen, Zhao Yuqi	2024-09-04	下载	With the widespread adoption of 5G and Internet of Things (IoT) technologies, the low latency provided by edge computing has great importance for real-time processing.
A Joint Time and Energy-Efficient Federated Learning-based Computation Offloading Method for Mobile Edge Computing	Anwesha Mukherjee, Rajkumar Buyya	2024-09-04	下载	Computation offloading at lower time and lower energy consumption is crucial for resource limited mobile devices. This paper proposes an offloading decision-making model using federated learning.
ISO: Overlap of Computation and Communication within Seqenence For LLM Inference	Bin Xiao, Lei Su	2024-09-04	下载	In the realm of Large Language Model (LLM) inference, the inherent structure of transformer models coupled with the multi-GPU tensor parallelism strategy leads to a sequential execution of computation...
Accelerating Large Language Model Training with Hybrid GPU-based Compression	Lang Xu, Quentin Anthony, Qinghua Zhou, Nawras Alnaasan, Radha R. Gulhane, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda	2024-09-04	下载	Data Parallelism (DP), Tensor Parallelism (TP), and Pipeline Parallelism (PP) are the three strategies widely adopted to enable fast and efficient Large Language Model (LLM) training.
Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA	Shuangyi Chen, Yue Ju, Hardik Dalal, Zhongwen Zhu, Ashish Khisti	2024-09-04	下载	Parameter-Efficient Fine-Tuning (PEFT) has risen as an innovative training strategy that updates only a select few model parameters, significantly lowering both computational and memory demands.

cs.NI - Networking and Internet Architecture

标题	作者	发布日期	PDF	摘要
What is Normal? A Big Data Observational Science Model of Anonymized Internet Traffic	Jeremy Kepner, Hayden Jananthan, Michael Jones, William Arcand, David Bestor, William Bergeron, Daniel Burrill, Aydin Buluc, Chansup Byun, Timothy Davis, Vijay Gadepally, Daniel Grant, Michael Houle, Matthew Hubbell, Piotr Luszczek, Lauren Milechin, Chasen Milner, Guillermo Morales, Andrew Morris, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Gabriel Wachman, Charles Yee, Peter Michaleas	2024-09-04	下载	Understanding what is normal is a key aspect of protecting a domain. Other domains invest heavily in observational science to develop models of normal behavior to better detect anomalies.
VECA: Reliable and Confidential Resource Clustering for Volunteer Edge-Cloud Computing	Hemanth Sai Yeddulapalli, Mauro Lemus Alarcon, Upasana Roy, Roshan Lal Neupane, Durbek Gafurov, Motahare Mounesan, Saptarshi Debroy, Prasad Calyam	2024-09-04	下载	Volunteer Edge-Cloud (VEC) computing has a significant potential to support scientific workflows in user communities contributing volunteer edge nodes.
Anomaly Detection in Offshore Open Radio Access Network Using Long Short-Term Memory Models on a Novel Artificial Intelligence-Driven Cloud-Native Data Platform	Abdelrahim Ahmad, Peizheng Li, Robert Piechocki, Rui Inacio	2024-09-04	下载	The Radio Access Network (RAN) is a critical component of modern telecommunications infrastructure, currently evolving towards disaggregated and open architectures.
Knowledge Transfer for Collaborative Misbehavior Detection in Untrusted Vehicular Environments	Roshan Sedar, Charalampos Kalalas, Paolo Dini, Francisco Vazquez-Gallego, Jesus Alonso-Zarate, Luis Alonso	2024-09-04	下载	Vehicular mobility underscores the need for collaborative misbehavior detection at the vehicular edge. However, locally trained misbehavior detection models are susceptible to adversarial attacks that...
Jäger: Automated Telephone Call Traceback	David Adei, Varun Madathil, Sathvik Prasad, Bradley Reaves, Alessandra Scafuro	2024-09-04	下载	Unsolicited telephone calls that facilitate fraud or unlawful telemarketing continue to overwhelm network users and the regulators who prosecute them.
Towards Edge-Based Data Lake Architecture for Intelligent Transportation System	Danilo Fernandes, Douglas L. L. Moura, Gean Santos, Geymerson S. Ramos, Fabiane Queiroz, Andre L. L. Aquino	2024-09-04	下载	The rapid urbanization growth has underscored the need for innovative solutions to enhance transportation efficiency and safety. Intelligent Transportation Systems (ITS) have emerged as a promising so...
Enhancing 5G Performance: Reducing Service Time and Research Directions for 6G Standards	Laura Landon, Vipindev Adat Vasudevan, Jaeweon Kim, Junmo Sung, Jeffery Tony Masters, Muriel Médard	2024-09-04	下载	This paper presents several methods for minimizing packet service time in networks using 5G and beyond. We propose leveraging network coding alongside Hybrid Automatic Repeat reQuest (HARQ) to reduce ...
Security Implications and Mitigation Strategies in MPLS Networks	Ayush Thakur	2024-09-04	下载	Multiprotocol Label Switching (MPLS) is a high-performance telecommunications technology that directs data from one network node to another based on short path labels rather than long network addresse...
AirFogSim: A Light-Weight and Modular Simulator for UAV-Integrated Vehicular Fog Computing	Zhiwei Wei, Chenran Huang, Bing Li, Yiting Zhao, Xiang Cheng, Liuqing Yang, Rongqing Zhang	2024-09-04	下载	Vehicular Fog Computing (VFC) is significantly enhancing the efficiency, safety, and computational capabilities of Intelligent Transportation Systems (ITS), and the integration of Unmanned Aerial Vehi...
A Dynamic Resource Scheduling Algorithm Based on Traffic Prediction for Coexistence of eMBB and Random Arrival URLLC	Yizhou Jiang, Xiujun Zhang, Xiaofeng Zhong, Shidong Zhou	2024-09-04	下载	In this paper, we propose a joint design for the coexistence of enhanced mobile broadband (eMBB) and ultra-reliable and random low-latency communication (URLLC) with different transmission time interv...
FlexBSO: Flexible Block Storage Offload for Datacenters	Vojtech Aschenbrenner, John Shawger, Sadman Sakib	2024-09-04	下载	Efficient virtualization of CPU and memory is standardized and mature. Capabilities such as Intel VT-x [3] have been added by manufacturers for efficient hypervisor support.

cs.OS - Operating Systems

标题	作者	发布日期	PDF	摘要
FlexBSO: Flexible Block Storage Offload for Datacenters	Vojtech Aschenbrenner, John Shawger, Sadman Sakib	2024-09-04	下载	Efficient virtualization of CPU and memory is standardized and mature. Capabilities such as Intel VT-x [3] have been added by manufacturers for efficient hypervisor support.

cs.PF - Performance

标题	作者	发布日期	PDF	摘要
Towards a Scalable and Efficient PGAS-based Distributed OpenMP	Baodi Shan, Mauricio Araya-Polo, Barbara Chapman	2024-09-04	下载	MPI+X has been the de facto standard for distributed memory parallel programming. It is widely used primarily as an explicit two-sided communication model, which often leads to complex and error-prone...
ISO: Overlap of Computation and Communication within Seqenence For LLM Inference	Bin Xiao, Lei Su	2024-09-04	下载	In the realm of Large Language Model (LLM) inference, the inherent structure of transformer models coupled with the multi-GPU tensor parallelism strategy leads to a sequential execution of computation...