Skip to content

2024-07-15

cs.AR - Architecture

标题作者发布日期PDF摘要
FabGPT: An Efficient Large Multimodal Model for Complex Wafer Defect Knowledge QueriesYuqi Jiang, Xudong Lu, Qian Jin, Qi Sun, Hanming Wu, Cheng Zhuo2024-07-15下载Intelligence is key to advancing integrated circuit (IC) fabrication. Recent breakthroughs in Large Multimodal Models (LMMs) have unlocked extraditionary abilities in understanding images and text, fo...
Assessing the Performance of Stateful Logic in 1-Selector-1-RRAM Crossbar ArraysArjun Tyagi, Shahar Kvatinsky2024-07-15下载Resistive Random Access Memory (RRAM) crossbar arrays are an attractive memory structure for emerging nonvolatile memory due to their high density and excellent scalability.
SOFA: A Compute-Memory Optimized Sparsity Accelerator via Cross-Stage Coordinated TilingHuizheng Wang, Jiahao Fang, Xinru Tang, Zhiheng Yue, Jinxi Li, Yubin Qin, Sihan Guan, Qize Yang, Yang Wang, Chao Li, Yang Hu, Shouyi Yin2024-07-15下载Benefiting from the self-attention mechanism, Transformer models have attained impressive contextual comprehension capabilities for lengthy texts.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Quality Scalable Quantization Methodology for Deep Learning on EdgeSalman Abdul Khaliq, Rehan Hafiz2024-07-15下载Deep Learning Architectures employ heavy computations and bulk of the computational energy is taken up by the convolution operations in the Convolutional Neural Networks.
Fast Matrix Multiplications for Lookup Table-Quantized LLMsHan Guo, William Brandon, Radostin Cholakov, Jonathan Ragan-Kelley, Eric P. Xing, Yoon Kim2024-07-15下载The deployment of large language models (LLMs) is often constrained by memory bandwidth, where the primary bottleneck is the cost of transferring model parameters from the GPU's global memory to its r...
Error Bounds for the Network Scale-Up MethodSergio Díaz-Aranda, Juan Marcos Ramírez, Mohit Daga, Jaya Prakash Champati, José Aguilar, Rosa Elvira Lillo, Antonio Fernández Anta2024-07-15下载Epidemiologists and social scientists have used the Network Scale-Up Method (NSUM) for over thirty years to estimate the size of a hidden sub-population within a social network.
Comprehensive Review of Performance Optimization Strategies for Serverless Applications on AWS LambdaMohamed Lemine El Bechir, Cheikh Sad Bouh, Abobakr Shuwail2024-07-15下载This review paper synthesizes the latest research on performance optimization strategies for serverless applications deployed on AWS Lambda. By examining recent studies, we highlight the challenges, s...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Joint Optimization of Completion Ratio and Latency of Offloaded Tasks with Multiple Priority Levels in 5G EdgeParisa Fard Moshiri, Murat Simsek, Burak Kantarci2024-07-15下载Multi-Access Edge Computing (MEC) is widely recognized as an essential enabler for applications that necessitate minimal latency. However, the dropped task ratio metric has not been studied thoroughly...
Friedkin-Johnsen Model for Opinion Dynamics on Signed GraphsXiaotian Zhou, Haoxin Sun, Wanyue Xu, Wei Li, Zhongzhi Zhang2024-07-15下载A signed graph offers richer information than an unsigned graph, since it describes both collaborative and competitive relationships in social networks.
Distributed Scheduling for Throughput Maximization under Deadline Constraint in Wireless Mesh NetworksXin Wang, Xudong Wang2024-07-15下载This paper studies the distributed scheduling of traffic flows with arbitrary deadlines that arrive at their source nodes and are transmitted to different destination nodes via multiple intermediate n...
E-Commerce Product Recommendation System based on ML AlgorithmsMd. Zahurul Haque2024-07-15下载Algorithms are used in eCommerce product recommendation systems. These systems just recently began utilizing machine learning algorithms due to the development and growth of the artificial intelligenc...

cs.PF - Performance

标题作者发布日期PDF摘要
ConvBench: A Comprehensive Benchmark for 2D Convolution Primitive EvaluationLucas Alvarenga, Victor Ferrari, Rafael Souza, Marcio Pereira, Guido Araujo2024-07-15下载Convolution is a compute-intensive operation placed at the heart of Convolution Neural Networks (CNNs). It has led to the development of many high-performance algorithms, such as Im2col-GEMM, Winograd...
Assessing the Impact of Network Quality-of-Service on Metaverse Virtual Reality User ExperienceRahul Dev Tripathi, Minzhao Lyu, Vijay Sivaraman2024-07-15下载Metaverse virtual reality (VR) applications enable users to socialise, work, entertain, and study online with immersive experiences beyond the classic PC-based interactions.

基于 VitePress 构建