Skip to content

2024-06-26

cs.AR - Architecture

标题作者发布日期PDF摘要
Constable: Improving Performance and Power Efficiency by Safely Eliminating Load Instruction ExecutionRahul Bera, Adithya Ranganathan, Joydeep Rakshit, Sujit Mahto, Anant V. Nori, Jayesh Gaur, Ataberk Olgun, Konstantinos Kanellopoulos, Mohammad Sadrosadati, Sreenivas Subramoney, Onur Mutlu2024-06-26下载Load instructions often limit instruction-level parallelism (ILP) in modern processors due to data and resource dependences they cause. Prior techniques like Load Value Prediction (LVP) and Memory Ren...
On Approximate 8-bit Floating-Point Operations Using Integer OperationsTheodor Lindberg, Oscar Gustafsson2024-06-26下载In this work, approximate eight-bit floating-point operations performed using simple integer operations is discussed. For two-bit mantissa formats, faithful rounding can always be obtained for the con...
A Lightweight Algorithm for Classifying Ex Vivo Tissues SamplesTzu-Hao Li, Ethan Murphy, Allaire Doussan, Ryan Halter, Kofi Odame2024-06-26下载In this paper, we present a novel algorithm for classifying ex vivo tissue that comprises multi-channel bioimpedance analysis and a hardware neural network.
A Jammer-Mitigating 267 Mb/s 3.78 mm2^2 583 mW 32$\times$8 Multi-User MIMO Receiver in 22FDXFlorian Bucheli, Oscar Castañeda, Gian Marti, Christoph Studer2024-06-26下载We present the first multi-user (MU) multiple-input multiple-output (MIMO) receiver ASIC that mitigates jamming attacks. The ASIC implements a recent nonlinear algorithm that performs joint jammer mit...
Resilient and Secure Programmable System-on-Chip Accelerator OffloadInês Pinto Gouveia, Ahmad T. Sheikh, Ali Shoker, Suhaib A. Fahmy, Paulo Esteves-Verissimo2024-06-26下载Computational offload to hardware accelerators is gaining traction due to increasing computational demands and efficiency challenges. Programmable hardware, like FPGAs, offers a promising platform in ...
Managing Classical Processing Requirements for Quantum Error CorrectionSatvik Maurya, Abtin Molavi, Aws Albarghouthi, Swamit Tannu2024-06-26下载Large-scale quantum computers promise transformative speedups, but their viability hinges on fast and reliable quantum error correction (QEC).

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Enhancing Federated Learning with Adaptive Differential Privacy and Priority-Based AggregationMahtab Talaei, Iman Izadi2024-06-26下载Federated learning (FL), a novel branch of distributed machine learning (ML), develops global models through a private procedure without direct access to local datasets.
LoongTrain: Efficient Training of Long-Sequence LLMs with Head-Context ParallelismDiandian Gu, Peng Sun, Qinghao Hu, Ting Huang, Xun Chen, Yingtong Xiong, Guoteng Wang, Qiaoling Chen, Shangchun Zhao, Jiarui Fang, Yonggang Wen, Tianwei Zhang, Xin Jin, Xuanzhe Liu2024-06-26下载Efficiently training LLMs with long sequences is important yet challenged by the massive computation and memory requirements. Sequence parallelism has been proposed to tackle these problems, but exist...
Integrating Power-to-Heat Services in Geographically Distributed Multi-Energy Systems: A Case Study from the ERIGrid 2.0 ProjectGiuseppe Silano, Evangelos Rikos, Vetrivel Rajkumar, Oliver Gehrke, Tesfaye Amare Zerihun, Carmine Rodio, Riccardo Lazzari2024-06-26下载This paper investigates the integration and validation of multi-energy systems within the H2020 ERIGrid 2.0 project, focusing on the deployment of the JaNDER software middleware and universal API (uAP...
FedAQ: Communication-Efficient Federated Edge Learning via Joint Uplink and Downlink Adaptive QuantizationLinping Qu, Shenghui Song, Chi-Ying Tsui2024-06-26下载Federated learning (FL) is a powerful machine learning paradigm which leverages the data as well as the computational resources of clients, while protecting clients' data privacy.
In Situ In Transit Hybrid Analysis with Catalyst-ADIOS2François Mazen, Louis Gombert, Lucas Givord, Charles Gueunet2024-06-26下载In this short paper, we present an innovative approach to limit the required bandwidth when transferring data during in transit analysis. This approach is called hybrid because it combines existing in...
Automatic Tracing in Task-Based Runtime SystemsRohan Yadav, Michael Bauer, David Broman, Michael Garland, Alex Aiken, Fredrik Kjolstad2024-06-26下载Implicitly parallel task-based runtime systems often perform dynamic analysis to discover dependencies in and extract parallelism from sequential programs.
Composing Distributed Computations Through Task and Kernel FusionRohan Yadav, Shiv Sundram, Wonchan Lee, Michael Garland, Michael Bauer, Alex Aiken, Fredrik Kjolstad2024-06-26下载We introduce Diffuse, a system that dynamically performs task and kernel fusion in distributed, task-based runtime systems. The key component of Diffuse is an intermediate representation of distribute...
The Blockchain Risk Parity Line: Moving From The Efficient Frontier To The Final Frontier Of InvestmentsRavi Kashyap2024-06-26下载We engineer blockchain based risk managed portfolios by creating three funds with distinct risk and return profiles: 1) Alpha - high risk portfolio; 2) Beta - mimics the wider market; and 3) Gamma - r...
A Communication Satellite Servises Based Decentralized Network ProtocolXiao Yan, Bernie Gao2024-06-26下载In this paper, we present a decentralized network protocol, Space Network Protocol, based on Communication Satellite Services. The protocol outlines a method for distributing information about the sta...
Scalable Dual Coordinate Descent for Kernel MethodsZishan Shao, Aditya Devarakonda2024-06-26下载Dual Coordinate Descent (DCD) and Block Dual Coordinate Descent (BDCD) are important iterative methods for solving convex optimization problems.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Decentralized Semantic Traffic Control in AVs Using RL and DQN for Dynamic RoadblocksEmanuel Figetakis, Yahuza Bello, Ahmed Refaey, Abdallah Shami2024-06-26下载Autonomous Vehicles (AVs), furnished with sensors capable of capturing essential vehicle dynamics such as speed, acceleration, and precise location, possess the capacity to execute intelligent maneuve...
System for Measurement of Electric Energy Using Beacons with Optical Sensors and LoRaWAN TransmissionŁukasz Marcul, Mateusz Brzozowski, Artur Janicki2024-06-26下载In this article, we present the results of experiments with finding an efficient radio transmission method for an electric energy measurement system called OneMeter 2.0.
Exploiting Data Significance in Remote Estimation of Discrete-State Markov SourcesJiping Luo, Nikolaos Pappas2024-06-26下载We consider semantics-aware remote estimation of a discrete-state Markov source with both normal (low-priority) and alarm (high-priority) states.
CloudCap (C2app) : A Cloud-Based Platform for Packet Analysis On The EdgeKyriazis Kokkinos, Ioannis Polymenidis, Ilias Siniosoglou, Athanasios Liatifis, Panagiotis Sarigiannidis2024-06-26下载Data exchange through mobile devices is rapidly increasing due to the high information demands of today's applications. The need for monitoring the exchanged traffic becomes important in order to cont...
Analysis of Channel Uncertainty in Trusted Wireless Services via Repeated InteractionsBingwen Chen, Xintong Ling, Weihang Cao, Jiaheng Wang, Zhi Ding2024-06-26下载The coexistence of heterogeneous sub-networks in 6G poses new security and trust concerns and thus calls for a perimeterless-security model. Blockchain radio access network (B-RAN) provides a trust-bu...
FedAQ: Communication-Efficient Federated Edge Learning via Joint Uplink and Downlink Adaptive QuantizationLinping Qu, Shenghui Song, Chi-Ying Tsui2024-06-26下载Federated learning (FL) is a powerful machine learning paradigm which leverages the data as well as the computational resources of clients, while protecting clients' data privacy.
A Study on the Situation of Connected Car Patent PortfoliosAbel C. H. Chen, Chia-Shen Chang2024-06-26下载In recent years, the countries of the world have drafted the specifications of connected cars; for instance, the Security Credential Management System (SCMS) has been proposed by United States Departm...
A Communication Satellite Servises Based Decentralized Network ProtocolXiao Yan, Bernie Gao2024-06-26下载In this paper, we present a decentralized network protocol, Space Network Protocol, based on Communication Satellite Services. The protocol outlines a method for distributing information about the sta...

cs.PF - Performance

标题作者发布日期PDF摘要
An Autotuning-based Optimization Framework for Mixed-kernel SVM Classifications in Smart Pixel Datasets and Heterojunction TransistorsXingfu Wu, Tupendra Oli, Justin H. Qian, Valerie Taylor, Mark C. Hersam, Vinod K. Sangwan2024-06-26下载Support Vector Machine (SVM) is a state-of-the-art classification method widely used in science and engineering due to its high accuracy, its ability to deal with high dimensional data, and its flexib...

基于 VitePress 构建