Appearance
2024-08-22
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| When In-memory Computing Meets Spiking Neural Networks -- A Perspective on Device-Circuit-System-and-Algorithm Co-design | Abhishek Moitra, Abhiroop Bhattacharjee, Yuhang Li, Youngeun Kim, Priyadarshini Panda | 2024-08-22 | 下载 | This review explores the intersection of bio-plausible artificial intelligence in the form of Spiking Neural Networks (SNNs) with the analog In-Memory Computing (IMC) domain, highlighting their collec... |
| TReX- Reusing Vision Transformer's Attention for Efficient Xbar-based Computing | Abhishek Moitra, Abhiroop Bhattacharjee, Youngeun Kim, Priyadarshini Panda | 2024-08-22 | 下载 | Due to the high computation overhead of Vision Transformers (ViTs), In-memory Computing architectures are being researched towards energy-efficient deployment in edge-computing scenarios. |
| Exposing Shadow Branches | Chrysanthos Pepi, Bhargav Reddy Godala, Krishnam Tibrewala, Gino Chacon, Paul V. Gratz, Daniel A. Jiménez, Gilles A. Pokam, David I. August | 2024-08-22 | 下载 | Modern processors implement a decoupled front-end in the form of Fetch Directed Instruction Prefetching (FDIP) to avoid front-end stalls. FDIP is driven by the Branch Prediction Unit (BPU), relying on... |
| Virgo: Cluster-level Matrix Unit Integration in GPUs for Scalability and Energy Efficiency | Hansung Kim, Ruohan Richard Yan, Joshua You, Tieliang Vamber Yang, Yakun Sophia Shao | 2024-08-22 | 下载 | Modern GPUs incorporate specialized matrix units such as Tensor Cores to accelerate GEMM operations, which are central to deep learning workloads. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| NanoFlow: Towards Optimal Large Language Model Serving Throughput | Kan Zhu, Yufei Gao, Yilong Zhao, Liangyu Zhao, Gefei Zuo, Yile Gu, Dedong Xie, Tian Tang, Qinyu Xu, Zihao Ye, Keisuke Kamahori, Chien-Yu Lin, Ziren Wang, Stephanie Wang, Arvind Krishnamurthy, Baris Kasikci | 2024-08-22 | 下载 | Large Language Models (LLMs) have resulted in a surging demand for planet-scale serving systems, where tens of thousands of GPUs continuously serve hundreds of millions of users. |
| Research on Improved U-net Based Remote Sensing Image Segmentation Algorithm | Qiming Yang, Zixin Wang, Shinan Liu, Zizheng Li | 2024-08-22 | 下载 | In recent years, although U-Net network has made significant progress in the field of image segmentation, it still faces performance bottlenecks in remote sensing image segmentation. |
| Poplar: Efficient Scaling of Distributed DNN Training on Heterogeneous GPU Clusters | WenZheng Zhang, Yang Hu, Jing Shi, Xiaoying Bai | 2024-08-22 | 下载 | Scaling Deep Neural Networks (DNNs) requires significant computational resources in terms of GPU quantity and compute capacity. In practice, there usually exists a large number of heterogeneous GPU de... |
| Real-Time Video Generation with Pyramid Attention Broadcast | Xuanlei Zhao, Xiaolong Jin, Kai Wang, Yang You | 2024-08-22 | 下载 | We present Pyramid Attention Broadcast (PAB), a real-time, high quality and training-free approach for DiT-based video generation. Our method is founded on the observation that attention difference in... |
| Verifiable Homomorphic Linear Combinations in Multi-Instance Time-Lock Puzzles | Aydin Abadi | 2024-08-22 | 下载 | Time-Lock Puzzles (TLPs) have been developed to securely transmit sensitive information into the future without relying on a trusted third party. |
| Stream parallel skeleton optimization | Marco Aldinucci, Marco Danelutto | 2024-08-22 | 下载 | We discuss the properties of the composition of stream parallel skeletons such as pipelines and farms. By looking at the ideal performance figures assumed to hold for these skeletons, we show that any... |
| KS+: Predicting Workflow Task Memory Usage Over Time | Jonathan Bader, Ansgar Lößer, Lauritz Thamsen, Björn Scheuermann, Odej Kao | 2024-08-22 | 下载 | Scientific workflow management systems enable the reproducible execution of data analysis pipelines on cluster infrastructures managed by resource managers such as Kubernetes, Slurm, or HTCondor. |
| Fair Combinatorial Auction for Blockchain Trade Intents: Being Fair without Knowing What is Fair | Andrea Canidio, Felix Henneke | 2024-08-22 | 下载 | We study blockchain trade-intent auctions, which currently intermediate about USD 10 billion in trades each month. These auctions are combinatorial because executing multiple trade intents jointly gen... |
| Time Optimal Distance--Dispersion on Dynamic Ring | Brati Mondal, Pritam Goswami, Buddhadeb Sau | 2024-08-22 | 下载 | Dispersion by mobile agents is a well studied problem in the literature on computing by mobile robots. In this problem, robots placed arbitrarily on nodes of a network having nodes are asked t... |
| Two Pareto Optimum-based Heuristic Algorithms for Minimizing Tardiness and Late Jobs in the Single Machine Flowshop Problem | Matthew Gradwohl, Guidio Sewa, Oke Blessing Oghojafor, Richard Wilouwou, Muminu Adamu, Christopher Thron | 2024-08-22 | 下载 | Flowshop problems play a prominent role in operations research, and have considerable practical significance. The single-machine flowshop problem is of particular theoretical interest. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Ant Backpressure Routing for Wireless Multi-hop Networks with Mixed Traffic Patterns | Negar Erfaniantaghvayi, Zhongyuan Zhao, Kevin Chan, Gunjan Verma, Ananthram Swami, Santiago Segarra | 2024-08-22 | 下载 | A mixture of streaming and short-lived traffic presents a common yet challenging scenario for Backpressure routing in wireless multi-hop networks. |
| Age and Value of Information Optimization for Systems with Multi-Class Updates | Ahmed Arafa, Roy D. Yates | 2024-08-22 | 下载 | Received samples of a stochastic process are processed by a server for delivery as updates to a monitor. Each sample belongs to a class that specifies a distribution for its processing time and a func... |
| Looking AT the Blue Skies of Bluesky | Leonhard Balduf, Saidu Sokoto, Onur Ascigil, Gareth Tyson, Björn Scheuermann, Maciej Korczyński, Ignacio Castro, Michał Król | 2024-08-22 | 下载 | The pitfalls of centralized social networks, such as Facebook and Twitter/X, have led to concerns about control, transparency, and accountability. |
| A Deadline-Aware Scheduler for Smart Factory using WiFi 6 | Mohit Jain, Anis Mishra, Syamantak Das, Andreas Wiese, Arani Bhattacharya, Mukulika Maity | 2024-08-22 | 下载 | A key strategy for making production in factories more efficient is to collect data about the functioning of machines, and dynamically adapt their working. |
| Empowering Wireless Network Applications with Deep Learning-based Radio Propagation Models | Stefanos Bakirtzis, Cagkan Yapar, Marco Fiore, Jie Zhang, Ian Wassell | 2024-08-22 | 下载 | The efficient deployment and operation of any wireless communication ecosystem rely on knowledge of the received signal quality over the target coverage area. |
| Exploring the Feasibility of Automated Data Standardization using Large Language Models for Seamless Positioning | Max J. L. Lee, Ju Lin, Li-Ta Hsu | 2024-08-22 | 下载 | We propose a feasibility study for real-time automated data standardization leveraging Large Language Models (LLMs) to enhance seamless positioning systems in IoT environments. |
| Distributed Noncoherent Joint Transmission Based on Multi-Agent Reinforcement Learning for Dense Small Cell MISO Systems | Shaozhuang Bai, Zhenzhen Gao, Xuewen Liao | 2024-08-22 | 下载 | We consider a dense small cell (DSC) network where multi-antenna small cell base stations (SBSs) transmit data to single-antenna users over a shared frequency band. |
| MAC protocol classification in the ISM band using machine learning methods | Hanieh Rashidpour, Hossein Bahramgiri | 2024-08-22 | 下载 | With the emergence of new technologies and a growing number of wireless networks, we face the problem of radio spectrum shortages. As a result, identifying the wireless channel spectrum to exploit the... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Smartphone-based Eye Tracking System using Edge Intelligence and Model Optimisation | Nishan Gunawardena, Gough Yumu Lui, Jeewani Anupama Ginige, Bahman Javadi | 2024-08-22 | 下载 | A significant limitation of current smartphone-based eye-tracking algorithms is their low accuracy when applied to video-type visual stimuli, as they are typically trained on static images. |
| Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments | Maciej Besta, Robert Gerstenberger, Patrick Iff, Pournima Sonawane, Juan Gómez Luna, Raghavendra Kanakagiri, Rui Min, Grzegorz Kwaśniewski, Onur Mutlu, Torsten Hoefler, Raja Appuswamy, Aidan O Mahony | 2024-08-22 | 下载 | Knowledge graphs (KGs) have achieved significant attention in recent years, particularly in the area of the Semantic Web as well as gaining popularity in other application domains such as data mining ... |