Skip to content

2024-10-10

cs.AR - Architecture

标题作者发布日期PDF摘要
MENAGE: Mixed-Signal Event-Driven Neuromorphic Accelerator for Edge ApplicationsArmin Abdollahi, Mehdi Kamal, Massoud Pedram2024-10-10下载This paper presents a mixed-signal neuromorphic accelerator architecture designed for accelerating inference with event-based neural network models.
RISC-V V Vector Extension (RVV) with reduced number of vector registersEino Jacobs, Dmitry Utyansky, Muhammad Hassan, Thomas Roecker2024-10-10下载To reduce the area of RISC-V Vector extension (RVV) in small processors, the authors are considering one simple modification: reduce the number of registers in the vector register file.
Neural Architecture Search of Hybrid Models for NPU-CIM Heterogeneous AR/VR DevicesYiwei Zhao, Ziyun Li, Win-San Khwa, Xiaoyu Sun, Sai Qian Zhang, Syed Shakib Sarwar, Kleber Hugo Stangherlin, Yi-Lun Lu, Jorge Tomas Gomez, Jae-Sun Seo, Phillip B. Gibbons, Barbara De Salvo, Chiao Liu2024-10-10下载Low-Latency and Low-Power Edge AI is essential for Virtual Reality and Augmented Reality applications. Recent advances show that hybrid models, combining convolution layers (CNN) and transformers (ViT...
M2^2-ViT: Accelerating Hybrid Vision Transformers with Two-Level Mixed QuantizationYanbiao Liang, Huihong Shi, Zhongfeng Wang2024-10-10下载Although Vision Transformers (ViTs) have achieved significant success, their intensive computations and substantial memory overheads challenge their deployment on edge devices.
vCLIC: Towards Fast Interrupt Handling in Virtualized RISC-V Mixed-criticality SystemsEnrico Zelioli, Alessandro Ottaviano, Robert Balas, Nils Wistoff, Angelo Garofalo, Luca Benini2024-10-10下载The widespread diffusion of compute-intensive edge-AI workloads and the stringent demands of modern autonomous systems require advanced heterogeneous embedded architectures.
GUST: Graph Edge-Coloring Utilization for Accelerating Sparse Matrix Vector MultiplicationArmin Gerami, Bahar Asgari2024-10-10下载Sparse matrix-vector multiplication (SpMV) plays a vital role in various scientific and engineering fields, from scientific computing to machine learning.
The BRAM is the Limit: Shattering Myths, Shaping Standards, and Building Scalable PIM AcceleratorsMD Arafat Kabir, Tendayi Kamucheka, Nathaniel Fredricks, Joel Mandebi, Jason Bakos, Miaoqing Huang, David Andrews2024-10-10下载Many recent FPGA-based Processor-in-Memory (PIM) architectures have appeared with promises of impressive levels of parallelism but with performance that falls short of expectations due to reduced maxi...
Reducing the Cost of Dropout in Flash-Attention by Hiding RNG with GEMMHaiyue Ma, Jian Liu, Ronny Krashinsky2024-10-10下载Dropout, a network operator, when enabled is likely to dramatically impact the performance of Flash-Attention, which in turn increases the end-to-end training time of Large-Language-Models (LLMs).

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
A Simulated Annealing Approach to Identical Parallel Machine SchedulingJiaxing Li, David Perkins2024-10-10下载This paper studies the application of the simulated annealing metaheuristic on the identical parallel machine scheduling problem, a variant of the broader optimal job scheduling problem.
POSEIDON : Efficient Function Placement at the Edge using Deep Reinforcement LearningPrakhar Jain, Prakhar Singhal, Divyansh Pandey, Giovanni Quattrocchi, Karthik Vaidhyanathan2024-10-10下载Edge computing allows for reduced latency and operational costs compared to centralized cloud systems. In this context, serverless functions are emerging as a lightweight and effective paradigm for ma...
Agent-based modeling for realistic reproduction of human mobility and contact behavior to evaluate test and isolation strategies in epidemic infectious disease spreadDavid Kerkmann, Sascha Korf, Khoa Nguyen, Daniel Abele, Alain Schengen, Carlotta Gerstein, Jens Henrik Göbbert, Achim Basermann, Martin J. Kühn, Michael Meyer-Hermann2024-10-10下载Agent-based models have proven to be useful tools in supporting decision-making processes in different application domains. The advent of modern computers and supercomputers has enabled these bottom-u...
NLP-Guided Synthesis: Transitioning from Sequential Programs to Distributed ProgramsArun Sanjel, Bikram Khanal, Greg Speegle, Pablo Rivas2024-10-10下载As the need for large-scale data processing grows, distributed programming frameworks like PySpark have become increasingly popular. However, the task of converting traditional, sequential code to dis...
AI Surrogate Model for Distributed Computing WorkloadsDavid K. Park, Yihui Ren, Ozgur O. Kilic, Tatiana Korchuganova, Sairam Sri Vatsavai, Joseph Boudreau, Tasnuva Chowdhury, Shengyu Feng, Raees Khan, Jaehyung Kim, Scott Klasky, Tadashi Maeno, Paul Nilsson, Verena Ingrid Martinez Outschoorn, Norbert Podhorszki, Frederic Suter, Wei Yang, Yiming Yang, Shinjae Yoo, Alexei Klimentov, Adolfy Hoisie2024-10-10下载Large-scale international scientific collaborations, such as ATLAS, Belle II, CMS, and DUNE, generate vast volumes of data. These experiments necessitate substantial computational power for varied tas...
A Cloud in the Sky: Geo-Aware On-board Data Services for LEO SatellitesThomas Sandholm, Sayandev Mukherjee, Bernardo A Huberman2024-10-10下载We propose an architecture with accompanying protocol for on-board satellite data infrastructure designed for Low Earth Orbit (LEO) constellations offering communication services, such as direct-to-ce...
Exploring the Landscape of Distributed Graph SketchingDavid Tench, Evan T. West, Kenny Zhang, Michael Bender, Daniel DeLayo, Martin Farach-Colton, Gilvir Gill, Tyler Seip, Victor Zhang2024-10-10下载Recent work has initiated the study of dense graph processing using graph sketching methods, which drastically reduce space costs by lossily compressing information about the input graph.
SALINA: Towards Sustainable Live Sonar Analytics in Wild EcosystemsChi Xu, Rongsheng Qian, Hao Fang, Xiaoqiang Ma, William I. Atlas, Jiangchuan Liu, Mark A. Spoljaric2024-10-10下载Sonar radar captures visual representations of underwater objects and structures using sound wave reflections, making it essential for exploration, mapping, and continuous surveillance in wild ecosyst...
Efficient Adaptive Federated OptimizationSu Hyeong Lee, Sidharth Sharma, Manzil Zaheer, Tian Li2024-10-10下载Adaptive optimization is critical in federated learning, where enabling adaptivity on both the server and client sides has proven essential for achieving optimal performance.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
"It's Your Turn": A Novel Channel Contention Mechanism for Improving Wi-Fi's ReliabilityFrancesc Wilhelmi, Lorenzo Galati-Giordano, Gianluca Fontanesi2024-10-10下载The next generation of Wi-Fi, i.e., the IEEE 802.11bn (aka Wi-Fi 8), is not only expected to increase its performance and provide extended capabilities but also aims to offer a reliable service.

cs.PF - Performance

标题作者发布日期PDF摘要
Neural Architecture Search of Hybrid Models for NPU-CIM Heterogeneous AR/VR DevicesYiwei Zhao, Ziyun Li, Win-San Khwa, Xiaoyu Sun, Sai Qian Zhang, Syed Shakib Sarwar, Kleber Hugo Stangherlin, Yi-Lun Lu, Jorge Tomas Gomez, Jae-Sun Seo, Phillip B. Gibbons, Barbara De Salvo, Chiao Liu2024-10-10下载Low-Latency and Low-Power Edge AI is essential for Virtual Reality and Augmented Reality applications. Recent advances show that hybrid models, combining convolution layers (CNN) and transformers (ViT...
Catastrophic Cyber Capabilities Benchmark (3CB): Robustly Evaluating LLM Agent Cyber Offense CapabilitiesAndrey Anurin, Jonathan Ng, Kibo Schaffer, Jason Schreiber, Esben Kran2024-10-10下载LLM agents have the potential to revolutionize defensive cyber operations, but their offensive capabilities are not yet fully understood. To prepare for emerging threats, model developers and governme...
Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled DataCan Wang, Dianbo Sui, Hongliang Sun, Hao Ding, Bolin Zhang, Zhiying Tu2024-10-10下载Large Language Model (LLM) services exhibit impressive capability on unlearned tasks leveraging only a few examples by in-context learning (ICL).
An Analysis of XML Compression EfficiencyChristopher James Augeri, Barry E. Mullins, Leemon C. Baird, Dursun A. Bulutoglu, Rusty O. Baldwin2024-10-10下载XML simplifies data exchange among heterogeneous computers, but it is notoriously verbose and has spawned the development of many XML-specific compressors and binary formats.

基于 VitePress 构建