Appearance
2024-02-12
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| IR-Aware ECO Timing Optimization Using Reinforcement Learning | Wenjing Jiang, Vidya A. Chhabria, Sachin S. Sapatnekar | 2024-02-12 | 下载 | Engineering change orders (ECOs) in late stages make minimal design fixes to recover from timing shifts due to excessive IR drops. This paper integrates IR-drop-aware timing analysis and ECO timing op... |
| LFOC+: A Fair OS-level Cache-Clustering Policy for Commodity Multicore Systems | Juan Carlos Saez, Fernando Castro, Graziano Fanizzi, Manuel Prieto-Matias | 2024-02-12 | 下载 | Commodity multicore systems are increasingly adopting hardware support that enables the system software to partition the last-level cache (LLC). |
| LFOC: A Lightweight Fairness-Oriented Cache Clustering Policy for Commodity Multicores | Adrián García-García, Juan Carlos Sáez, Fernando Castro, Manuel Prieto-Matías | 2024-02-12 | 下载 | Multicore processors constitute the main architecture choice for modern computing systems in different market segments. Despite their benefits, the contention that naturally appears when multiple appl... |
| A Precision-Optimized Fixed-Point Near-Memory Digital Processing Unit for Analog In-Memory Computing | Elena Ferro, Athanasios Vasilopoulos, Corey Lammie, Manuel Le Gallo, Luca Benini, Irem Boybat, Abu Sebastian | 2024-02-12 | 下载 | Analog In-Memory Computing (AIMC) is an emerging technology for fast and energy-efficient Deep Learning (DL) inference. However, a certain amount of digital post-processing is required to deal with ci... |
| TransAxx: Efficient Transformers with Approximate Computing | Dimitrios Danopoulos, Georgios Zervakis, Dimitrios Soudris, Jörg Henkel | 2024-02-12 | 下载 | Vision Transformer (ViT) models which were recently introduced by the transformer architecture have shown to be very competitive and often become a popular alternative to Convolutional Neural Networks... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| From Data to Decisions: The Transformational Power of Machine Learning in Business Recommendations | Kapilya Gangadharan, K. Malathi, Anoop Purandaran, Barathi Subramanian, Rathinaraja Jeyaraj, Soon Ki Jung | 2024-02-12 | 下载 | This research aims to explore the impact of Machine Learning (ML) on the evolution and efficacy of Recommendation Systems (RS), particularly in the context of their growing significance in commercial ... |
| The Blocklace: A Byzantine-repelling and Universal Conflict-free Replicated Data Type | Paulo Sérgio Almeida, Ehud Shapiro | 2024-02-12 | 下载 | Conflict-free Replicated Data Types (CRDTs) are designed for replica convergence without global coordination or consensus. Recent work has achieved the same in a Byzantine environment, through DAG-lik... |
| A Quantum Algorithm Based Heuristic to Hide Sensitive Itemsets | Abhijeet Ghoshal, Yan Li, Syam Menon, Sumit Sarkar | 2024-02-12 | 下载 | Quantum devices use qubits to represent information, which allows them to exploit important properties from quantum physics, specifically superposition and entanglement. |
| Queuing dynamics of asynchronous Federated Learning | Louis Leconte, Matthieu Jonckheere, Sergey Samsonov, Eric Moulines | 2024-02-12 | 下载 | We study asynchronous federated learning mechanisms with nodes having potentially different computational speeds. In such an environment, each node is allowed to work on models with potential delays a... |
| Empowering Federated Learning for Massive Models with NVIDIA FLARE | Holger R. Roth, Ziyue Xu, Yuan-Ting Hsieh, Adithya Renduchintala, Isaac Yang, Zhihong Zhang, Yuhong Wen, Sean Yang, Kevin Lu, Kristopher Kersten, Camir Ricketts, Daguang Xu, Chester Chen, Yan Cheng, Andrew Feng | 2024-02-12 | 下载 | In the ever-evolving landscape of artificial intelligence (AI) and large language models (LLMs), handling and leveraging data effectively has become a critical challenge. |
| Enabling performance portability of data-parallel OpenMP applications on asymmetric multicore processors | Juan Carlos Saez, Fernando Castro, Manuel Prieto-Matias | 2024-02-12 | 下载 | Asymmetric multicore processors (AMPs) couple high-performance big cores and low-power small cores with the same instruction-set architecture but different features, such as clock frequency or microar... |
| LFOC: A Lightweight Fairness-Oriented Cache Clustering Policy for Commodity Multicores | Adrián García-García, Juan Carlos Sáez, Fernando Castro, Manuel Prieto-Matías | 2024-02-12 | 下载 | Multicore processors constitute the main architecture choice for modern computing systems in different market segments. Despite their benefits, the contention that naturally appears when multiple appl... |
| Accelerating Distributed Deep Learning using Lossless Homomorphic Compression | Haoyu Li, Yuchen Xu, Jiayi Chen, Rohit Dwivedula, Wenfei Wu, Keqiang He, Aditya Akella, Daehyeok Kim | 2024-02-12 | 下载 | As deep neural networks (DNNs) grow in complexity and size, the resultant increase in communication overhead during distributed training has become a significant bottleneck, challenging the scalabilit... |
| Fortran... ok, and what's next? | Vincent Magnin, José Alves, Antoine Arnoud, Arjen Markus, Michele Esposito Marzino | 2024-02-12 | 下载 | Modern Fortran is a standardized language that includes object-oriented and parallel programming paradigms. The Fortran-lang community, created at the end of 2019, is actively working to modernize its... |
| Logical Synchrony Networks: A formal model for deterministic distribution | Logan Kenwright, Partha Roop, Nathan Allen, Sanjay Lall, Calin Cascaval, Tammo Spalink, Martin Izzard | 2024-02-12 | 下载 | Kahn Process Networks (KPNs) are a deterministic Model of Computation (MoC) for distributed systems. KPNs supports non-blocking writes and blocking reads, with the consequent assumption of unbounded b... |
| HPX with Spack and Singularity Containers: Evaluating Overheads for HPX/Kokkos using an astrophysics application | Patrick Diehl, Steven R. Brandt, Gregor Daiß, Hartmut Kaiser | 2024-02-12 | 下载 | Cloud computing for high performance computing resources is an emerging topic. This service is of interest to researchers who care about reproducible computing, for software packages with complex inst... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Locality Sensitive Hashing for Network Traffic Fingerprinting | Nowfel Mashnoor, Jay Thom, Abdur Rouf, Shamik Sengupta, Batyr Charyyev | 2024-02-12 | 下载 | The advent of the Internet of Things (IoT) has brought forth additional intricacies and difficulties to computer networks. These gadgets are particularly susceptible to cyber-attacks because of their ... |
| Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale Wireless Networks | Talha Bozkus, Urbashi Mitra | 2024-02-12 | 下载 | Optimizing large-scale wireless networks, including optimal resource management, power allocation, and throughput maximization, is inherently challenging due to their non-observable system dynamics an... |
| A Multi-Tenant System for 5/6G Testbed as-a-Service | Raffaele Bolla, Roberto Bruschi, Chiara Lombardo, Sergio Mangialardi, Alireza Mohammadpour, Ramin Rabbani, Beatrice Siccardi | 2024-02-12 | 下载 | In order to fulfill the stringent requirements and fast advancements of 5G and beyond applications, it is inevitable to develop research/industrial testbeds to examine the different proposed innovativ... |
| Optical Routing with Binary Optimisation and Quantum Annealing | Ethan Davies, Darren Banfield, Vlad Carare, Ben Weaver, Catherine White, Nigel Walker | 2024-02-12 | 下载 | A challenge for scalability of demand-responsive, elastic optical Dense Wavelength Division Multiplexing (DWDM) and Flexgrid networks is the computational complexity of allocating many optical routes ... |
| Accelerating Distributed Deep Learning using Lossless Homomorphic Compression | Haoyu Li, Yuchen Xu, Jiayi Chen, Rohit Dwivedula, Wenfei Wu, Keqiang He, Aditya Akella, Daehyeok Kim | 2024-02-12 | 下载 | As deep neural networks (DNNs) grow in complexity and size, the resultant increase in communication overhead during distributed training has become a significant bottleneck, challenging the scalabilit... |
| Battery-Less LoRaWAN Communications using Energy Harvesting: Modeling and Characterization | Carmen Delgado, José María Sanz, Chris Blondia, Jeroen Famaey | 2024-02-12 | 下载 | Billions of IoT devices are deployed worldwide and batteries are their main power source. However, these batteries are bulky, short-lived and full of hazardous chemicals that damage our environment. |
| An Efficient Wireless Channel Estimation Model for Environment Sensing | Zainab Zaidi, Tansu Alpcan, Christopher Leckie, Sarah Efrain | 2024-02-12 | 下载 | This paper presents a novel and efficient wireless channel estimation scheme based on a tapped delay line (TDL) model of wireless signal propagation, where a data-driven machine learning approach is u... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Enabling performance portability of data-parallel OpenMP applications on asymmetric multicore processors | Juan Carlos Saez, Fernando Castro, Manuel Prieto-Matias | 2024-02-12 | 下载 | Asymmetric multicore processors (AMPs) couple high-performance big cores and low-power small cores with the same instruction-set architecture but different features, such as clock frequency or microar... |