Skip to content

2024-01-19

cs.AR - Architecture

标题作者发布日期PDF摘要
Low-Complexity Integer Divider Architecture for Homomorphic EncryptionSajjad Akherati, Jiaxuan Cai, Xinmiao Zhang2024-01-19下载Homomorphic encryption (HE) allows computations to be directly carried out on ciphertexts and enables privacy-preserving cloud computing. The computations on the coefficients of the polynomials involv...
Quantised Neural Network Accelerators for Low-Power IDS in Automotive NetworksShashwat Khandelwal, Anneliese Walsh, Shanker Shreejith2024-01-19下载In this paper, we explore low-power custom quantised Multi-Layer Perceptrons (MLPs) as an Intrusion Detection System (IDS) for automotive controller area network (CAN).
Exploring Highly Quantised Neural Networks for Intrusion Detection in Automotive CANShashwat Khandelwal, Shreejith Shanker2024-01-19下载Vehicles today comprise intelligent systems like connected autonomous driving and advanced driving assistance systems (ADAS) to enhance the driving experience, which is enabled through increased conne...
BoolGebra: Attributed Graph-learning for Boolean Algebraic ManipulationYingjie Li, Anthony Agnesina, Yanqing Zhang, Haoxing Ren, Cunxi Yu2024-01-19下载Boolean algebraic manipulation is at the core of logic synthesis in Electronic Design Automation (EDA) design flow. Existing methods struggle to fully exploit optimization opportunities, and often suf...
A Lightweight FPGA-based IDS-ECU Architecture for Automotive CANShashwat Khandelwal, Shreejith Shanker2024-01-19下载Recent years have seen an exponential rise in complex software-driven functionality in vehicles, leading to a rising number of electronic control units (ECUs), network capabilities, and interfaces.
Unraveling codes: fast, robust, beyond-bound error correction for DRAMMike Hamburg, Eric Linstadt, Danny Moore, Thomas Vogelsang2024-01-19下载Generalized Reed-Solomon (RS) codes are a common choice for efficient, reliable error correction in memory and communications systems. These codes add 2t2t extra parity symbols to a block of memory, a...
FARe: Fault-Aware GNN Training on ReRAM-based PIM AcceleratorsPratyush Dhingra, Chukwufumnanya Ogbogu, Biresh Kumar Joardar, Janardhan Rao Doppa, Ananth Kalyanaraman, Partha Pratim Pande2024-01-19下载Resistive random-access memory (ReRAM)-based processing-in-memory (PIM) architecture is an attractive solution for training Graph Neural Networks (GNNs) on edge platforms.
Enhancing Scalability in Recommender Systems through Lottery Ticket Hypothesis and Knowledge Distillation-based Neural Network PruningRajaram R, Manoj Bharadhwaj, Vasan VS, Nargis Pervin2024-01-19下载This study introduces an innovative approach aimed at the efficient pruning of neural networks, with a particular focus on their deployment on edge devices.
A2Q+: Improving Accumulator-Aware Weight QuantizationIan Colbert, Alessandro Pappalardo, Jakoba Petri-Koenig, Yaman Umuroglu2024-01-19下载Quantization techniques commonly reduce the inference costs of neural networks by restricting the precision of weights and activations. Recent studies show that also reducing the precision of the accu...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Communication Efficient and Provable Federated UnlearningYouming Tao, Cheng-Long Wang, Miao Pan, Dongxiao Yu, Xiuzhen Cheng, Di Wang2024-01-19下载We study federated unlearning, a novel problem to eliminate the impact of specific clients or data points on the global model learned via federated learning (FL).
Software Resource Disaggregation for HPC with Serverless ComputingMarcin Copik, Marcin Chrapek, Larissa Schmid, Alexandru Calotoiu, Torsten Hoefler2024-01-19下载Aggregated HPC resources have rigid allocation systems and programming models which struggle to adapt to diverse and changing workloads. Consequently, HPC systems fail to efficiently use the large poo...
Distributed Genetic Algorithm for Feature SelectionMichael Potter, Ayberk Yarkın Yıldız, Nishanth Marer Prabhu, Cameron Gordon2024-01-19下载We empirically show that process-based Parallelism speeds up the Genetic Algorithm (GA) for Feature Selection (FS) 2x to 25x, while additionally increasing the Machine Learning (ML) model performance ...
Cppless: Single-Source and High-Performance Serverless Programming in C++Marcin Copik, Lukas Möller, Alexandru Calotoiu, Torsten Hoefler2024-01-19下载The rise of serverless computing introduced a new class of scalable, elastic and widely available parallel workers in the cloud. Many systems and applications benefit from offloading computations and ...
Self-healing Nodes with Adaptive Data-ShardingAyush Thakur, Sanskar Chauhan, Ilisha Tomar, Vaibhavi Paul, Deepak Gupta2024-01-19下载Data sharding, a technique for partitioning and distributing data among multiple servers or nodes, offers enhancements in the scalability, performance, and fault tolerance of extensive distributed sys...
AutoChunk: Automated Activation Chunk for Memory-Efficient Long Sequence InferenceXuanlei Zhao, Shenggan Cheng, Guangyang Lu, Jiarui Fang, Haotian Zhou, Bin Jia, Ziming Liu, Yang You2024-01-19下载Large deep learning models have achieved impressive performance across a range of applications. However, their large memory requirements, including parameter memory and activation memory, have become ...
I-SplitEE: Image classification in Split Computing DNNs with Early ExitsDivya Jyoti Bajpai, Aastha Jaiswal, Manjesh Kumar Hanawal2024-01-19下载The recent advances in Deep Neural Networks (DNNs) stem from their exceptional performance across various domains. However, their inherent large size hinders deploying these networks on resource-const...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Reconfigurable Intelligent Surface (RIS)-Assisted Entanglement Distribution in FSO Quantum NetworksMahdi Chehimi, Mohamed Elhattab, Walid Saad, Gayane Vardoyan, Nitish K. Panigrahy, Chadi Assi, Don Towsley2024-01-19下载Quantum networks (QNs) relying on free-space optical (FSO) quantum channels can support quantum applications in environments wherein establishing an optical fiber infrastructure is challenging and cos...
Data Augmentation for Traffic ClassificationChao Wang, Alessandro Finamore, Pietro Michiardi, Massimo Gallo, Dario Rossi2024-01-19下载Data Augmentation (DA) -- enriching training data by adding synthetic samples -- is a technique widely adopted in Computer Vision (CV) and Natural Language Processing (NLP) tasks to improve models per...
Demonstration of Cooperative Transport Interface using open-source 5G OpenRAN and virtualised PON networkFrank Slyne, Kevin O Sullivan, Merim Dzaferagic, Bruce Richardson, Marcin Wrzeszcz, Brendan Ryan, Niall Power, Robin Giller, Marco Ruffini2024-01-19下载We demonstrate a real-time, converged 5G-PON through the Cooperative Transport Interface, synchronising 5G and PON-DBA upstream schedulers. This innovative approach, implemented using 5G and PON open ...
Maximizing Real-Time Video QoE via Bandwidth Sharing under Markovian settingSushi Anna George, Vinay Joseph2024-01-19下载We consider the problem of optimizing Quality of Experience (QoE) of clients streaming real-time video, served by networks managed by different operators that can share bandwidth with each other.
Time synchronization for deterministic communicationMahin K. Atiq, Raheeb Muzaffar2024-01-19下载Deterministic communication is required for applications of several industry verticals including manufacturing, automotive, financial, and health care, etc.
PTPsec: Securing the Precision Time Protocol Against Time Delay Attacks Using Cyclic Path Asymmetry AnalysisAndreas Finkenzeller, Oliver Butowski, Emanuel Regnath, Mohammad Hamad, Sebastian Steinhorst2024-01-19下载High-precision time synchronization is a vital prerequisite for many modern applications and technologies, including Smart Grids, Time-Sensitive Networking (TSN), and 5G networks.
Empowering HWNs with Efficient Data Labeling: A Clustered Federated Semi-Supervised Learning ApproachMoqbel Hamood, Abdullatif Albaseer, Mohamed Abdallah, Ala Al-Fuqaha2024-01-19下载Clustered Federated Multitask Learning (CFL) has gained considerable attention as an effective strategy for overcoming statistical challenges, particularly when dealing with non independent and identi...
Goal-Oriented Multiple Access Connectivity for Networked Intelligent SystemsPouya Agheli, Nikolaos Pappas, Marios Kountouris2024-01-19下载We design a self-decision goal-oriented multiple access scheme, where sensing agents observe a common event and individually decide to communicate the event's attributes as updates to the monitoring a...
A Thorough Analysis of Radio Resource Assignment for UAV-Enhanced Vehicular Sidelink CommunicationsFrancesca Conserva, Francesco Linsalata, Marouan Mizmizi, Maurizio Magarini, Umberto Spagnolini, Roberto Verdone, Chiara Buratti2024-01-19下载The rapid expansion of connected and autonomous vehicles (CAVs) and the shift towards millimiter-wave (mmWave) frequencies offer unprecedented opportunities to enhance road safety and traffic efficien...
Resource-efficient In-orbit Detection of Earth ObjectsQiyang Zhang, Xin Yuan, Ruolin Xing, Yiran Zhang, Zimu Zheng, Xiao Ma, Mengwei Xu, Schahram Dustdar, Shangguang Wang2024-01-19下载With the rapid proliferation of large Low Earth Orbit (LEO) satellite constellations, a huge amount of in-orbit data is generated and needs to be transmitted to the ground for processing.

cs.PF - Performance

标题作者发布日期PDF摘要
AutoChunk: Automated Activation Chunk for Memory-Efficient Long Sequence InferenceXuanlei Zhao, Shenggan Cheng, Guangyang Lu, Jiarui Fang, Haotian Zhou, Bin Jia, Ziming Liu, Yang You2024-01-19下载Large deep learning models have achieved impressive performance across a range of applications. However, their large memory requirements, including parameter memory and activation memory, have become ...
A2Q+: Improving Accumulator-Aware Weight QuantizationIan Colbert, Alessandro Pappalardo, Jakoba Petri-Koenig, Yaman Umuroglu2024-01-19下载Quantization techniques commonly reduce the inference costs of neural networks by restricting the precision of weights and activations. Recent studies show that also reducing the precision of the accu...

基于 VitePress 构建