Skip to content

2024-02-12

cs.AR - Architecture

标题作者发布日期PDF摘要
IR-Aware ECO Timing Optimization Using Reinforcement LearningWenjing Jiang, Vidya A. Chhabria, Sachin S. Sapatnekar2024-02-12下载Engineering change orders (ECOs) in late stages make minimal design fixes to recover from timing shifts due to excessive IR drops. This paper integrates IR-drop-aware timing analysis and ECO timing op...
LFOC+: A Fair OS-level Cache-Clustering Policy for Commodity Multicore SystemsJuan Carlos Saez, Fernando Castro, Graziano Fanizzi, Manuel Prieto-Matias2024-02-12下载Commodity multicore systems are increasingly adopting hardware support that enables the system software to partition the last-level cache (LLC).
LFOC: A Lightweight Fairness-Oriented Cache Clustering Policy for Commodity MulticoresAdrián García-García, Juan Carlos Sáez, Fernando Castro, Manuel Prieto-Matías2024-02-12下载Multicore processors constitute the main architecture choice for modern computing systems in different market segments. Despite their benefits, the contention that naturally appears when multiple appl...
A Precision-Optimized Fixed-Point Near-Memory Digital Processing Unit for Analog In-Memory ComputingElena Ferro, Athanasios Vasilopoulos, Corey Lammie, Manuel Le Gallo, Luca Benini, Irem Boybat, Abu Sebastian2024-02-12下载Analog In-Memory Computing (AIMC) is an emerging technology for fast and energy-efficient Deep Learning (DL) inference. However, a certain amount of digital post-processing is required to deal with ci...
TransAxx: Efficient Transformers with Approximate ComputingDimitrios Danopoulos, Georgios Zervakis, Dimitrios Soudris, Jörg Henkel2024-02-12下载Vision Transformer (ViT) models which were recently introduced by the transformer architecture have shown to be very competitive and often become a popular alternative to Convolutional Neural Networks...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
From Data to Decisions: The Transformational Power of Machine Learning in Business RecommendationsKapilya Gangadharan, K. Malathi, Anoop Purandaran, Barathi Subramanian, Rathinaraja Jeyaraj, Soon Ki Jung2024-02-12下载This research aims to explore the impact of Machine Learning (ML) on the evolution and efficacy of Recommendation Systems (RS), particularly in the context of their growing significance in commercial ...
The Blocklace: A Byzantine-repelling and Universal Conflict-free Replicated Data TypePaulo Sérgio Almeida, Ehud Shapiro2024-02-12下载Conflict-free Replicated Data Types (CRDTs) are designed for replica convergence without global coordination or consensus. Recent work has achieved the same in a Byzantine environment, through DAG-lik...
A Quantum Algorithm Based Heuristic to Hide Sensitive ItemsetsAbhijeet Ghoshal, Yan Li, Syam Menon, Sumit Sarkar2024-02-12下载Quantum devices use qubits to represent information, which allows them to exploit important properties from quantum physics, specifically superposition and entanglement.
Queuing dynamics of asynchronous Federated LearningLouis Leconte, Matthieu Jonckheere, Sergey Samsonov, Eric Moulines2024-02-12下载We study asynchronous federated learning mechanisms with nodes having potentially different computational speeds. In such an environment, each node is allowed to work on models with potential delays a...
Empowering Federated Learning for Massive Models with NVIDIA FLAREHolger R. Roth, Ziyue Xu, Yuan-Ting Hsieh, Adithya Renduchintala, Isaac Yang, Zhihong Zhang, Yuhong Wen, Sean Yang, Kevin Lu, Kristopher Kersten, Camir Ricketts, Daguang Xu, Chester Chen, Yan Cheng, Andrew Feng2024-02-12下载In the ever-evolving landscape of artificial intelligence (AI) and large language models (LLMs), handling and leveraging data effectively has become a critical challenge.
Enabling performance portability of data-parallel OpenMP applications on asymmetric multicore processorsJuan Carlos Saez, Fernando Castro, Manuel Prieto-Matias2024-02-12下载Asymmetric multicore processors (AMPs) couple high-performance big cores and low-power small cores with the same instruction-set architecture but different features, such as clock frequency or microar...
LFOC: A Lightweight Fairness-Oriented Cache Clustering Policy for Commodity MulticoresAdrián García-García, Juan Carlos Sáez, Fernando Castro, Manuel Prieto-Matías2024-02-12下载Multicore processors constitute the main architecture choice for modern computing systems in different market segments. Despite their benefits, the contention that naturally appears when multiple appl...
Accelerating Distributed Deep Learning using Lossless Homomorphic CompressionHaoyu Li, Yuchen Xu, Jiayi Chen, Rohit Dwivedula, Wenfei Wu, Keqiang He, Aditya Akella, Daehyeok Kim2024-02-12下载As deep neural networks (DNNs) grow in complexity and size, the resultant increase in communication overhead during distributed training has become a significant bottleneck, challenging the scalabilit...
Fortran... ok, and what's next?Vincent Magnin, José Alves, Antoine Arnoud, Arjen Markus, Michele Esposito Marzino2024-02-12下载Modern Fortran is a standardized language that includes object-oriented and parallel programming paradigms. The Fortran-lang community, created at the end of 2019, is actively working to modernize its...
Logical Synchrony Networks: A formal model for deterministic distributionLogan Kenwright, Partha Roop, Nathan Allen, Sanjay Lall, Calin Cascaval, Tammo Spalink, Martin Izzard2024-02-12下载Kahn Process Networks (KPNs) are a deterministic Model of Computation (MoC) for distributed systems. KPNs supports non-blocking writes and blocking reads, with the consequent assumption of unbounded b...
HPX with Spack and Singularity Containers: Evaluating Overheads for HPX/Kokkos using an astrophysics applicationPatrick Diehl, Steven R. Brandt, Gregor Daiß, Hartmut Kaiser2024-02-12下载Cloud computing for high performance computing resources is an emerging topic. This service is of interest to researchers who care about reproducible computing, for software packages with complex inst...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Locality Sensitive Hashing for Network Traffic FingerprintingNowfel Mashnoor, Jay Thom, Abdur Rouf, Shamik Sengupta, Batyr Charyyev2024-02-12下载The advent of the Internet of Things (IoT) has brought forth additional intricacies and difficulties to computer networks. These gadgets are particularly susceptible to cyber-attacks because of their ...
Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale Wireless NetworksTalha Bozkus, Urbashi Mitra2024-02-12下载Optimizing large-scale wireless networks, including optimal resource management, power allocation, and throughput maximization, is inherently challenging due to their non-observable system dynamics an...
A Multi-Tenant System for 5/6G Testbed as-a-ServiceRaffaele Bolla, Roberto Bruschi, Chiara Lombardo, Sergio Mangialardi, Alireza Mohammadpour, Ramin Rabbani, Beatrice Siccardi2024-02-12下载In order to fulfill the stringent requirements and fast advancements of 5G and beyond applications, it is inevitable to develop research/industrial testbeds to examine the different proposed innovativ...
Optical Routing with Binary Optimisation and Quantum AnnealingEthan Davies, Darren Banfield, Vlad Carare, Ben Weaver, Catherine White, Nigel Walker2024-02-12下载A challenge for scalability of demand-responsive, elastic optical Dense Wavelength Division Multiplexing (DWDM) and Flexgrid networks is the computational complexity of allocating many optical routes ...
Accelerating Distributed Deep Learning using Lossless Homomorphic CompressionHaoyu Li, Yuchen Xu, Jiayi Chen, Rohit Dwivedula, Wenfei Wu, Keqiang He, Aditya Akella, Daehyeok Kim2024-02-12下载As deep neural networks (DNNs) grow in complexity and size, the resultant increase in communication overhead during distributed training has become a significant bottleneck, challenging the scalabilit...
Battery-Less LoRaWAN Communications using Energy Harvesting: Modeling and CharacterizationCarmen Delgado, José María Sanz, Chris Blondia, Jeroen Famaey2024-02-12下载Billions of IoT devices are deployed worldwide and batteries are their main power source. However, these batteries are bulky, short-lived and full of hazardous chemicals that damage our environment.
An Efficient Wireless Channel Estimation Model for Environment SensingZainab Zaidi, Tansu Alpcan, Christopher Leckie, Sarah Efrain2024-02-12下载This paper presents a novel and efficient wireless channel estimation scheme based on a tapped delay line (TDL) model of wireless signal propagation, where a data-driven machine learning approach is u...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
Enabling performance portability of data-parallel OpenMP applications on asymmetric multicore processorsJuan Carlos Saez, Fernando Castro, Manuel Prieto-Matias2024-02-12下载Asymmetric multicore processors (AMPs) couple high-performance big cores and low-power small cores with the same instruction-set architecture but different features, such as clock frequency or microar...

基于 VitePress 构建