Skip to content

2024-08-22

cs.AR - Architecture

标题作者发布日期PDF摘要
When In-memory Computing Meets Spiking Neural Networks -- A Perspective on Device-Circuit-System-and-Algorithm Co-designAbhishek Moitra, Abhiroop Bhattacharjee, Yuhang Li, Youngeun Kim, Priyadarshini Panda2024-08-22下载This review explores the intersection of bio-plausible artificial intelligence in the form of Spiking Neural Networks (SNNs) with the analog In-Memory Computing (IMC) domain, highlighting their collec...
TReX- Reusing Vision Transformer's Attention for Efficient Xbar-based ComputingAbhishek Moitra, Abhiroop Bhattacharjee, Youngeun Kim, Priyadarshini Panda2024-08-22下载Due to the high computation overhead of Vision Transformers (ViTs), In-memory Computing architectures are being researched towards energy-efficient deployment in edge-computing scenarios.
Exposing Shadow BranchesChrysanthos Pepi, Bhargav Reddy Godala, Krishnam Tibrewala, Gino Chacon, Paul V. Gratz, Daniel A. Jiménez, Gilles A. Pokam, David I. August2024-08-22下载Modern processors implement a decoupled front-end in the form of Fetch Directed Instruction Prefetching (FDIP) to avoid front-end stalls. FDIP is driven by the Branch Prediction Unit (BPU), relying on...
Virgo: Cluster-level Matrix Unit Integration in GPUs for Scalability and Energy EfficiencyHansung Kim, Ruohan Richard Yan, Joshua You, Tieliang Vamber Yang, Yakun Sophia Shao2024-08-22下载Modern GPUs incorporate specialized matrix units such as Tensor Cores to accelerate GEMM operations, which are central to deep learning workloads.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
NanoFlow: Towards Optimal Large Language Model Serving ThroughputKan Zhu, Yufei Gao, Yilong Zhao, Liangyu Zhao, Gefei Zuo, Yile Gu, Dedong Xie, Tian Tang, Qinyu Xu, Zihao Ye, Keisuke Kamahori, Chien-Yu Lin, Ziren Wang, Stephanie Wang, Arvind Krishnamurthy, Baris Kasikci2024-08-22下载Large Language Models (LLMs) have resulted in a surging demand for planet-scale serving systems, where tens of thousands of GPUs continuously serve hundreds of millions of users.
Research on Improved U-net Based Remote Sensing Image Segmentation AlgorithmQiming Yang, Zixin Wang, Shinan Liu, Zizheng Li2024-08-22下载In recent years, although U-Net network has made significant progress in the field of image segmentation, it still faces performance bottlenecks in remote sensing image segmentation.
Poplar: Efficient Scaling of Distributed DNN Training on Heterogeneous GPU ClustersWenZheng Zhang, Yang Hu, Jing Shi, Xiaoying Bai2024-08-22下载Scaling Deep Neural Networks (DNNs) requires significant computational resources in terms of GPU quantity and compute capacity. In practice, there usually exists a large number of heterogeneous GPU de...
Real-Time Video Generation with Pyramid Attention BroadcastXuanlei Zhao, Xiaolong Jin, Kai Wang, Yang You2024-08-22下载We present Pyramid Attention Broadcast (PAB), a real-time, high quality and training-free approach for DiT-based video generation. Our method is founded on the observation that attention difference in...
Verifiable Homomorphic Linear Combinations in Multi-Instance Time-Lock PuzzlesAydin Abadi2024-08-22下载Time-Lock Puzzles (TLPs) have been developed to securely transmit sensitive information into the future without relying on a trusted third party.
Stream parallel skeleton optimizationMarco Aldinucci, Marco Danelutto2024-08-22下载We discuss the properties of the composition of stream parallel skeletons such as pipelines and farms. By looking at the ideal performance figures assumed to hold for these skeletons, we show that any...
KS+: Predicting Workflow Task Memory Usage Over TimeJonathan Bader, Ansgar Lößer, Lauritz Thamsen, Björn Scheuermann, Odej Kao2024-08-22下载Scientific workflow management systems enable the reproducible execution of data analysis pipelines on cluster infrastructures managed by resource managers such as Kubernetes, Slurm, or HTCondor.
Fair Combinatorial Auction for Blockchain Trade Intents: Being Fair without Knowing What is FairAndrea Canidio, Felix Henneke2024-08-22下载We study blockchain trade-intent auctions, which currently intermediate about USD 10 billion in trades each month. These auctions are combinatorial because executing multiple trade intents jointly gen...
Time Optimal Distance-kk-Dispersion on Dynamic RingBrati Mondal, Pritam Goswami, Buddhadeb Sau2024-08-22下载Dispersion by mobile agents is a well studied problem in the literature on computing by mobile robots. In this problem, ll robots placed arbitrarily on nodes of a network having nn nodes are asked t...
Two Pareto Optimum-based Heuristic Algorithms for Minimizing Tardiness and Late Jobs in the Single Machine Flowshop ProblemMatthew Gradwohl, Guidio Sewa, Oke Blessing Oghojafor, Richard Wilouwou, Muminu Adamu, Christopher Thron2024-08-22下载Flowshop problems play a prominent role in operations research, and have considerable practical significance. The single-machine flowshop problem is of particular theoretical interest.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Ant Backpressure Routing for Wireless Multi-hop Networks with Mixed Traffic PatternsNegar Erfaniantaghvayi, Zhongyuan Zhao, Kevin Chan, Gunjan Verma, Ananthram Swami, Santiago Segarra2024-08-22下载A mixture of streaming and short-lived traffic presents a common yet challenging scenario for Backpressure routing in wireless multi-hop networks.
Age and Value of Information Optimization for Systems with Multi-Class UpdatesAhmed Arafa, Roy D. Yates2024-08-22下载Received samples of a stochastic process are processed by a server for delivery as updates to a monitor. Each sample belongs to a class that specifies a distribution for its processing time and a func...
Looking AT the Blue Skies of BlueskyLeonhard Balduf, Saidu Sokoto, Onur Ascigil, Gareth Tyson, Björn Scheuermann, Maciej Korczyński, Ignacio Castro, Michał Król2024-08-22下载The pitfalls of centralized social networks, such as Facebook and Twitter/X, have led to concerns about control, transparency, and accountability.
A Deadline-Aware Scheduler for Smart Factory using WiFi 6Mohit Jain, Anis Mishra, Syamantak Das, Andreas Wiese, Arani Bhattacharya, Mukulika Maity2024-08-22下载A key strategy for making production in factories more efficient is to collect data about the functioning of machines, and dynamically adapt their working.
Empowering Wireless Network Applications with Deep Learning-based Radio Propagation ModelsStefanos Bakirtzis, Cagkan Yapar, Marco Fiore, Jie Zhang, Ian Wassell2024-08-22下载The efficient deployment and operation of any wireless communication ecosystem rely on knowledge of the received signal quality over the target coverage area.
Exploring the Feasibility of Automated Data Standardization using Large Language Models for Seamless PositioningMax J. L. Lee, Ju Lin, Li-Ta Hsu2024-08-22下载We propose a feasibility study for real-time automated data standardization leveraging Large Language Models (LLMs) to enhance seamless positioning systems in IoT environments.
Distributed Noncoherent Joint Transmission Based on Multi-Agent Reinforcement Learning for Dense Small Cell MISO SystemsShaozhuang Bai, Zhenzhen Gao, Xuewen Liao2024-08-22下载We consider a dense small cell (DSC) network where multi-antenna small cell base stations (SBSs) transmit data to single-antenna users over a shared frequency band.
MAC protocol classification in the ISM band using machine learning methodsHanieh Rashidpour, Hossein Bahramgiri2024-08-22下载With the emergence of new technologies and a growing number of wireless networks, we face the problem of radio spectrum shortages. As a result, identifying the wireless channel spectrum to exploit the...

cs.PF - Performance

标题作者发布日期PDF摘要
Smartphone-based Eye Tracking System using Edge Intelligence and Model OptimisationNishan Gunawardena, Gough Yumu Lui, Jeewani Anupama Ginige, Bahman Javadi2024-08-22下载A significant limitation of current smartphone-based eye-tracking algorithms is their low accuracy when applied to video-type visual stimuli, as they are typically trained on static images.
Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent DevelopmentsMaciej Besta, Robert Gerstenberger, Patrick Iff, Pournima Sonawane, Juan Gómez Luna, Raghavendra Kanakagiri, Rui Min, Grzegorz Kwaśniewski, Onur Mutlu, Torsten Hoefler, Raja Appuswamy, Aidan O Mahony2024-08-22下载Knowledge graphs (KGs) have achieved significant attention in recent years, particularly in the area of the Semantic Web as well as gaining popularity in other application domains such as data mining ...

基于 VitePress 构建