Appearance
2024-09-04
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Register Aggregation for Hardware Decompilation | Varun Rao, Zachary D. Sisco | 2024-09-04 | 下载 | Hardware decompilation reverses logic synthesis, converting a gate-level digital electronic design, or netlist, back up to hardware description language (HDL) code. |
| RTLRewriter: Methodologies for Large Models aided RTL Code Optimization | Xufeng Yao, Yiwen Wang, Xing Li, Yingzhao Lian, Ran Chen, Lei Chen, Mingxuan Yuan, Hong Xu, Bei Yu | 2024-09-04 | 下载 | Register Transfer Level (RTL) code optimization is crucial for enhancing the efficiency and performance of digital circuits during early synthesis stages. |
| ResiLogic: Leveraging Composability and Diversity to Design Fault and Intrusion Resilient Chips | Ahmad T. Sheikh, Ali Shoker, Suhaib A. Fahmy, Paulo Esteves-Verissimo | 2024-09-04 | 下载 | A long-standing challenge is the design of chips resilient to faults and glitches. Both fine-grained gate diversity and coarse-grained modular redundancy have been used in the past. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| On Advanced Monte Carlo Methods for Linear Algebra on Advanced Accelerator Architectures | Anton Lebedev, Vassil Alexandrov | 2024-09-04 | 下载 | In this paper we present computational experiments with the Markov Chain Monte Carlo Matrix Inversion () on several accelerator architectures and investigate their impact on pe... |
| A Bayesian Optimization through Sequential Monte Carlo and Statistical Physics-Inspired Techniques | Anton Lebedev, Thomas Warford, M. Emre Şahin | 2024-09-04 | 下载 | In this paper, we propose an approach for an application of Bayesian optimization using Sequential Monte Carlo (SMC) and concepts from the statistical physics of classical systems. |
| GreenWhisk: Emission-Aware Computing for Serverless Platform | Jayden Serenari, Sreekanth Sreekumar, Kaiwen Zhao, Saurabh Sarkar, Stephen Lee | 2024-09-04 | 下载 | Serverless computing is an emerging cloud computing abstraction wherein the cloud platform transparently manages all resources, including explicitly provisioning resources and geographical load balanc... |
| Towards a Scalable and Efficient PGAS-based Distributed OpenMP | Baodi Shan, Mauricio Araya-Polo, Barbara Chapman | 2024-09-04 | 下载 | MPI+X has been the de facto standard for distributed memory parallel programming. It is widely used primarily as an explicit two-sided communication model, which often leads to complex and error-prone... |
| TS-EoH: An Edge Server Task Scheduling Algorithm Based on Evolution of Heuristic | Wang Yatong, Pei Yuchen, Zhao Yuqi | 2024-09-04 | 下载 | With the widespread adoption of 5G and Internet of Things (IoT) technologies, the low latency provided by edge computing has great importance for real-time processing. |
| A Joint Time and Energy-Efficient Federated Learning-based Computation Offloading Method for Mobile Edge Computing | Anwesha Mukherjee, Rajkumar Buyya | 2024-09-04 | 下载 | Computation offloading at lower time and lower energy consumption is crucial for resource limited mobile devices. This paper proposes an offloading decision-making model using federated learning. |
| ISO: Overlap of Computation and Communication within Seqenence For LLM Inference | Bin Xiao, Lei Su | 2024-09-04 | 下载 | In the realm of Large Language Model (LLM) inference, the inherent structure of transformer models coupled with the multi-GPU tensor parallelism strategy leads to a sequential execution of computation... |
| Accelerating Large Language Model Training with Hybrid GPU-based Compression | Lang Xu, Quentin Anthony, Qinghua Zhou, Nawras Alnaasan, Radha R. Gulhane, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda | 2024-09-04 | 下载 | Data Parallelism (DP), Tensor Parallelism (TP), and Pipeline Parallelism (PP) are the three strategies widely adopted to enable fast and efficient Large Language Model (LLM) training. |
| Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA | Shuangyi Chen, Yue Ju, Hardik Dalal, Zhongwen Zhu, Ashish Khisti | 2024-09-04 | 下载 | Parameter-Efficient Fine-Tuning (PEFT) has risen as an innovative training strategy that updates only a select few model parameters, significantly lowering both computational and memory demands. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| What is Normal? A Big Data Observational Science Model of Anonymized Internet Traffic | Jeremy Kepner, Hayden Jananthan, Michael Jones, William Arcand, David Bestor, William Bergeron, Daniel Burrill, Aydin Buluc, Chansup Byun, Timothy Davis, Vijay Gadepally, Daniel Grant, Michael Houle, Matthew Hubbell, Piotr Luszczek, Lauren Milechin, Chasen Milner, Guillermo Morales, Andrew Morris, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Gabriel Wachman, Charles Yee, Peter Michaleas | 2024-09-04 | 下载 | Understanding what is normal is a key aspect of protecting a domain. Other domains invest heavily in observational science to develop models of normal behavior to better detect anomalies. |
| VECA: Reliable and Confidential Resource Clustering for Volunteer Edge-Cloud Computing | Hemanth Sai Yeddulapalli, Mauro Lemus Alarcon, Upasana Roy, Roshan Lal Neupane, Durbek Gafurov, Motahare Mounesan, Saptarshi Debroy, Prasad Calyam | 2024-09-04 | 下载 | Volunteer Edge-Cloud (VEC) computing has a significant potential to support scientific workflows in user communities contributing volunteer edge nodes. |
| Anomaly Detection in Offshore Open Radio Access Network Using Long Short-Term Memory Models on a Novel Artificial Intelligence-Driven Cloud-Native Data Platform | Abdelrahim Ahmad, Peizheng Li, Robert Piechocki, Rui Inacio | 2024-09-04 | 下载 | The Radio Access Network (RAN) is a critical component of modern telecommunications infrastructure, currently evolving towards disaggregated and open architectures. |
| Knowledge Transfer for Collaborative Misbehavior Detection in Untrusted Vehicular Environments | Roshan Sedar, Charalampos Kalalas, Paolo Dini, Francisco Vazquez-Gallego, Jesus Alonso-Zarate, Luis Alonso | 2024-09-04 | 下载 | Vehicular mobility underscores the need for collaborative misbehavior detection at the vehicular edge. However, locally trained misbehavior detection models are susceptible to adversarial attacks that... |
| Jäger: Automated Telephone Call Traceback | David Adei, Varun Madathil, Sathvik Prasad, Bradley Reaves, Alessandra Scafuro | 2024-09-04 | 下载 | Unsolicited telephone calls that facilitate fraud or unlawful telemarketing continue to overwhelm network users and the regulators who prosecute them. |
| Towards Edge-Based Data Lake Architecture for Intelligent Transportation System | Danilo Fernandes, Douglas L. L. Moura, Gean Santos, Geymerson S. Ramos, Fabiane Queiroz, Andre L. L. Aquino | 2024-09-04 | 下载 | The rapid urbanization growth has underscored the need for innovative solutions to enhance transportation efficiency and safety. Intelligent Transportation Systems (ITS) have emerged as a promising so... |
| Enhancing 5G Performance: Reducing Service Time and Research Directions for 6G Standards | Laura Landon, Vipindev Adat Vasudevan, Jaeweon Kim, Junmo Sung, Jeffery Tony Masters, Muriel Médard | 2024-09-04 | 下载 | This paper presents several methods for minimizing packet service time in networks using 5G and beyond. We propose leveraging network coding alongside Hybrid Automatic Repeat reQuest (HARQ) to reduce ... |
| Security Implications and Mitigation Strategies in MPLS Networks | Ayush Thakur | 2024-09-04 | 下载 | Multiprotocol Label Switching (MPLS) is a high-performance telecommunications technology that directs data from one network node to another based on short path labels rather than long network addresse... |
| AirFogSim: A Light-Weight and Modular Simulator for UAV-Integrated Vehicular Fog Computing | Zhiwei Wei, Chenran Huang, Bing Li, Yiting Zhao, Xiang Cheng, Liuqing Yang, Rongqing Zhang | 2024-09-04 | 下载 | Vehicular Fog Computing (VFC) is significantly enhancing the efficiency, safety, and computational capabilities of Intelligent Transportation Systems (ITS), and the integration of Unmanned Aerial Vehi... |
| A Dynamic Resource Scheduling Algorithm Based on Traffic Prediction for Coexistence of eMBB and Random Arrival URLLC | Yizhou Jiang, Xiujun Zhang, Xiaofeng Zhong, Shidong Zhou | 2024-09-04 | 下载 | In this paper, we propose a joint design for the coexistence of enhanced mobile broadband (eMBB) and ultra-reliable and random low-latency communication (URLLC) with different transmission time interv... |
| FlexBSO: Flexible Block Storage Offload for Datacenters | Vojtech Aschenbrenner, John Shawger, Sadman Sakib | 2024-09-04 | 下载 | Efficient virtualization of CPU and memory is standardized and mature. Capabilities such as Intel VT-x [3] have been added by manufacturers for efficient hypervisor support. |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| FlexBSO: Flexible Block Storage Offload for Datacenters | Vojtech Aschenbrenner, John Shawger, Sadman Sakib | 2024-09-04 | 下载 | Efficient virtualization of CPU and memory is standardized and mature. Capabilities such as Intel VT-x [3] have been added by manufacturers for efficient hypervisor support. |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Towards a Scalable and Efficient PGAS-based Distributed OpenMP | Baodi Shan, Mauricio Araya-Polo, Barbara Chapman | 2024-09-04 | 下载 | MPI+X has been the de facto standard for distributed memory parallel programming. It is widely used primarily as an explicit two-sided communication model, which often leads to complex and error-prone... |
| ISO: Overlap of Computation and Communication within Seqenence For LLM Inference | Bin Xiao, Lei Su | 2024-09-04 | 下载 | In the realm of Large Language Model (LLM) inference, the inherent structure of transformer models coupled with the multi-GPU tensor parallelism strategy leads to a sequential execution of computation... |