Skip to content

2024-07-01

cs.AR - Architecture

标题作者发布日期PDF摘要
Exploring FPGA designs for MX and beyondEbby Samson, Naveen Mellempudi, Wayne Luk, George A. Constantinides2024-07-01下载A number of companies recently worked together to release the new Open Compute Project MX standard for low-precision computation, aimed at efficient neural network implementation.
A Novel HDL Code Generator for Effectively Testing FPGA Logic Synthesis CompilersZhihao Xu, Shikai Guo, Guilin Zhao, Peiyu Zou, Xiaochen Li, He Jiang2024-07-01下载Field Programmable Gate Array (FPGA) logic synthesis compilers (e.g., Vivado, Iverilog, Yosys, and Quartus) are widely applied in Electronic Design Automation (EDA), such as the development of FPGA pr...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Introducing SWIRL: An Intermediate Representation Language for Scientific WorkflowsIacopo Colonnelli, Doriana Medić, Alberto Mulone, Viviana Bono, Luca Padovani, Marco Aldinucci2024-07-01下载In the ever-evolving landscape of scientific computing, properly supporting the modularity and complexity of modern scientific applications requires new approaches to workflow execution, like seamless...
Accelerate Intermittent Deep InferenceZiliang Zhang2024-07-01下载Emerging research in edge devices and micro-controller units (MCU) enables on-device computation of Deep Learning Training and Inferencing tasks.
Object Proxy Patterns for Accelerating Distributed ApplicationsJ. Gregory Pauloski, Valerie Hayot-Sasson, Logan Ward, Alexander Brace, André Bauer, Kyle Chard, Ian Foster2024-07-01下载Workflow and serverless frameworks have empowered new approaches to distributed application design by abstracting compute resources. However, their typically limited or one-size-fits-all support for a...
Linearizability and State-Machine Replication: Is it a match?Franz J. Hauck, Alexander Heß2024-07-01下载Linearizability is a well-known correctness property for concurrent and distributed systems. In the past, it was also used to prove the design and implementation of replicated state-machines correct.
Scaling on Frontier: Uncertainty Quantification Workflow Applications using ExaWorks to Enable Full System UtilizationMikhail Titov, Robert Carson, Matthew Rolchigo, John Coleman, James Belak, Matthew Bement, Daniel Laney, Matteo Turilli, Shantenu Jha2024-07-01下载When running at scale, modern scientific workflows require middleware to handle allocated resources, distribute computing payloads and guarantee a resilient execution.
Enabling MPI communication within Numba/LLVM JIT-compiled Python code using numba-mpi v1.0Kacper Derlatka, Maciej Manna, Oleksii Bulenok, David Zwicker, Sylwester Arabas2024-07-01下载The numba-mpi package offers access to the Message Passing Interface (MPI) routines from Python code that uses the Numba just-in-time (JIT) compiler.
LLload: Simplifying Real-Time Job Monitoring for HPC UsersChansup Byun, Julia Mullen, Albert Reuther, William Arcand, William Bergeron, David Bestor, Daniel Burrill, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Peter Michaleas, Guillermo Morales, Andrew Prout, Antonio Rosa, Charles Yee, Jeremy Kepner, Lauren Milechin2024-07-01下载One of the more complex tasks for researchers using HPC systems is performance monitoring and tuning of their applications. Developing a practice of continuous performance improvement, both for speed-...
Reinforcement Learning-driven Data-intensive Workflow Scheduling for Volunteer Edge-CloudMotahare Mounesan, Mauro Lemus, Hemanth Yeddulapalli, Prasad Calyam, Saptarshi Debroy2024-07-01下载In recent times, Volunteer Edge-Cloud (VEC) has gained traction as a cost-effective, community computing paradigm to support data-intensive scientific workflows.
Maximizing Blockchain Performance: Mitigating Conflicting Transactions through Parallelism and Dependency ManagementFaisal Haque Bappy, Tarannum Shaila Zaman, Md Sajidul Islam Sajid, Mir Mehedi Ahsan Pritom, Tariqul Islam2024-07-01下载While blockchains initially gained popularity in the realm of cryptocurrencies, their widespread adoption is expanding beyond conventional applications, driven by the imperative need for enhanced data...
Optimizing Age of Information in Vehicular Edge Computing with Federated Graph Neural Network Multi-Agent Reinforcement LearningWenhua Wang, Qiong Wu, Pingyi Fan, Nan Cheng, Wen Chen, Jiangzhou Wang, Khaled B. Letaief2024-07-01下载With the rapid development of intelligent vehicles and Intelligent Transport Systems (ITS), the sensors such as cameras and LiDAR installed on intelligent vehicles provides higher capacity of executin...
Energy-Aware Decentralized Learning with Intermittent Model TrainingAkash Dhasade, Paolo Dini, Elia Guerra, Anne-Marie Kermarrec, Marco Miozzo, Rafael Pires, Rishi Sharma, Martijn de Vos2024-07-01下载Decentralized learning (DL) offers a powerful framework where nodes collaboratively train models without sharing raw data and without the coordination of a central server.
I've Got 99 Problems But FLOPS Ain't OneAlexandru M. Gherghescu, Vlad-Andrei Bădoiu, Alexandru Agache, Mihai-Valentin Dumitru, Iuliu Vasilescu, Radu Mantu, Costin Raiciu2024-07-01下载Hyperscalers dominate the landscape of large network deployments, yet they rarely share data or insights about the challenges they face. In light of this supremacy, what problems can we find to solve ...
SplitLoRA: A Split Parameter-Efficient Fine-Tuning Framework for Large Language ModelsZheng Lin, Xuanjie Hu, Yuxin Zhang, Zhe Chen, Zihan Fang, Xianhao Chen, Ang Li, Praneeth Vepakomma, Yue Gao2024-07-01下载The scalability of large language models (LLMs) in handling high-complexity models and large-scale datasets has led to tremendous successes in pivotal domains.
FedEx: Expediting Federated Learning over Heterogeneous Mobile Devices by Overlapping and Participant SelectionJiaxiang Geng, Boyu Li, Xiaoqi Qin, Yixuan Li, Liang Li, Yanzhao Hou, Miao Pan2024-07-01下载Training latency is critical for the success of numerous intrigued applications ignited by federated learning (FL) over heterogeneous mobile devices.
Reconfigurable Intelligent Computational Surfaces for MEC-Assisted Autonomous Driving Networks: Design Optimization and AnalysisXueyao Zhang, Bo Yang, Zhiwen Yu, Xuelin Cao, George C. Alexandropoulos, Yan Zhang, Merouane Debbah, Chau Yuen2024-07-01下载This paper investigates autonomous driving safety improvement via task offloading from cellular vehicles (CVs) to a multi-access edge computing (MEC) server using vehicle-to-infrastructure (V2I) links...
Privacy-First Crowdsourcing: Blockchain and Local Differential Privacy in Crowdsourced Drone ServicesJunaid Akram, Ali Anaissi2024-07-01下载We introduce a privacy-preserving framework for integrating consumer-grade drones into bushfire management. This system creates a marketplace where bushfire management authorities obtain essential dat...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
An infinite server system with packing constraints and ranked serversAlexander Stolyar2024-07-01下载A service system with multiple types of customers, arriving as Poisson processes, is considered. The system has infinite number of servers, ranked by 1,2,3,1,2,3, \ldots; a server rank is its ``location.
Science DMZ Networks: How Different are They Really?Emily Mutter, Susmit Shannigrahi2024-07-01下载The Science Demilitarized Zone (Science DMZ) is a network environment optimized for scientific applications. A Science DMZ provides an environment mostly free from competing traffic flows and complex ...
SONIC: Connect the Unconnected via FM Radio & SMSAyush Pandey, Rohail Asim, Khalid Mengal, Matteo Varvello, Yasir Zaki2024-07-01下载As of 2022, about 2.78 billion people in developing countries do not have access to the Internet. Lack of Internet access hinders economic growth, educational opportunities, and access to information ...
Optimizing Age of Information in Vehicular Edge Computing with Federated Graph Neural Network Multi-Agent Reinforcement LearningWenhua Wang, Qiong Wu, Pingyi Fan, Nan Cheng, Wen Chen, Jiangzhou Wang, Khaled B. Letaief2024-07-01下载With the rapid development of intelligent vehicles and Intelligent Transport Systems (ITS), the sensors such as cameras and LiDAR installed on intelligent vehicles provides higher capacity of executin...
Beyond Throughput and Compression Ratios: Towards High End-to-end Utility of Gradient CompressionWenchen Han, Shay Vargaftik, Michael Mitzenmacher, Brad Karp, Ran Ben Basat2024-07-01下载Gradient aggregation has long been identified as a major bottleneck in today's large-scale distributed machine learning training systems. One promising solution to mitigate such bottlenecks is gradien...
PCAPVision: PCAP-Based High-Velocity and Large-Volume Network Failure DetectionLukasz Tulczyjew, Ihor Biruk, Murat Bilgic, Charles Abondo, Nathanael Weill2024-07-01下载Detecting failures via analysis of Packet Capture (PCAP) files is crucial for maintaining network reliability and performance, especially in large-scale telecommunications networks.
Deploying AI-Based Applications with Serverless Computing in 6G Networks: An Experimental StudyMarc Michalke, Chukwuemeka Muonagor, Admela Jukan2024-07-01下载Future 6G networks are expected to heavily utilize machine learning capabilities in a wide variety of applications with features and benefits for both, the end user and the provider.
I've Got 99 Problems But FLOPS Ain't OneAlexandru M. Gherghescu, Vlad-Andrei Bădoiu, Alexandru Agache, Mihai-Valentin Dumitru, Iuliu Vasilescu, Radu Mantu, Costin Raiciu2024-07-01下载Hyperscalers dominate the landscape of large network deployments, yet they rarely share data or insights about the challenges they face. In light of this supremacy, what problems can we find to solve ...
Neuro-Symbolic Fusion of Wi-Fi Sensing Data for Passive Radar with Inter-Modal Knowledge TransferMarco Cominelli, Francesco Gringoli, Lance M. Kaplan, Mani B. Srivastava, Trevor Bihl, Erik P. Blasch, Nandini Iyer, Federico Cerutti2024-07-01下载Wi-Fi devices, akin to passive radars, can discern human activities within indoor settings due to the human body's interaction with electromagnetic signals.
Accurate Passive Radar via an Uncertainty-Aware Fusion of Wi-Fi Sensing DataMarco Cominelli, Francesco Gringoli, Lance M. Kaplan, Mani B. Srivastava, Federico Cerutti2024-07-01下载Wi-Fi devices can effectively be used as passive radar systems that sense what happens in the surroundings and can even discern human activity.
Exploiting Dependency-Aware Priority Adjustment for Mixed-Criticality TSN Flow SchedulingMiao Guo, Yifei Sun, Chaojie Gu, Shibo He, Zhiguo Shi2024-07-01下载Time-Sensitive Networking (TSN) serves as a one-size-fits-all solution for mixed-criticality communication, in which flow scheduling is vital to guarantee real-time transmissions.
Enhancing Vehicular Networks with Generative AI: Opportunities and ChallengesTeef David, Kassi Muhammad, Kevin Nassisid, Bronny Farus2024-07-01下载In the burgeoning field of intelligent transportation systems, the integration of Generative Artificial Intelligence (AI) into vehicular networks presents a transformative potential for the automotive...
The Future of QKD NetworksAlin-Bogdan Popa, Pantelimon George Popescu2024-07-01下载With the recent advancements in quantum technologies, the QKD market exploded. World players are scrambling to win the race towards global QKD networks, even before the rules and policies required by ...

cs.PF - Performance

标题作者发布日期PDF摘要
LLload: Simplifying Real-Time Job Monitoring for HPC UsersChansup Byun, Julia Mullen, Albert Reuther, William Arcand, William Bergeron, David Bestor, Daniel Burrill, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Peter Michaleas, Guillermo Morales, Andrew Prout, Antonio Rosa, Charles Yee, Jeremy Kepner, Lauren Milechin2024-07-01下载One of the more complex tasks for researchers using HPC systems is performance monitoring and tuning of their applications. Developing a practice of continuous performance improvement, both for speed-...

基于 VitePress 构建