Appearance
2024-07-15
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| FabGPT: An Efficient Large Multimodal Model for Complex Wafer Defect Knowledge Queries | Yuqi Jiang, Xudong Lu, Qian Jin, Qi Sun, Hanming Wu, Cheng Zhuo | 2024-07-15 | 下载 | Intelligence is key to advancing integrated circuit (IC) fabrication. Recent breakthroughs in Large Multimodal Models (LMMs) have unlocked extraditionary abilities in understanding images and text, fo... |
| Assessing the Performance of Stateful Logic in 1-Selector-1-RRAM Crossbar Arrays | Arjun Tyagi, Shahar Kvatinsky | 2024-07-15 | 下载 | Resistive Random Access Memory (RRAM) crossbar arrays are an attractive memory structure for emerging nonvolatile memory due to their high density and excellent scalability. |
| SOFA: A Compute-Memory Optimized Sparsity Accelerator via Cross-Stage Coordinated Tiling | Huizheng Wang, Jiahao Fang, Xinru Tang, Zhiheng Yue, Jinxi Li, Yubin Qin, Sihan Guan, Qize Yang, Yang Wang, Chao Li, Yang Hu, Shouyi Yin | 2024-07-15 | 下载 | Benefiting from the self-attention mechanism, Transformer models have attained impressive contextual comprehension capabilities for lengthy texts. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Quality Scalable Quantization Methodology for Deep Learning on Edge | Salman Abdul Khaliq, Rehan Hafiz | 2024-07-15 | 下载 | Deep Learning Architectures employ heavy computations and bulk of the computational energy is taken up by the convolution operations in the Convolutional Neural Networks. |
| Fast Matrix Multiplications for Lookup Table-Quantized LLMs | Han Guo, William Brandon, Radostin Cholakov, Jonathan Ragan-Kelley, Eric P. Xing, Yoon Kim | 2024-07-15 | 下载 | The deployment of large language models (LLMs) is often constrained by memory bandwidth, where the primary bottleneck is the cost of transferring model parameters from the GPU's global memory to its r... |
| Error Bounds for the Network Scale-Up Method | Sergio Díaz-Aranda, Juan Marcos Ramírez, Mohit Daga, Jaya Prakash Champati, José Aguilar, Rosa Elvira Lillo, Antonio Fernández Anta | 2024-07-15 | 下载 | Epidemiologists and social scientists have used the Network Scale-Up Method (NSUM) for over thirty years to estimate the size of a hidden sub-population within a social network. |
| Comprehensive Review of Performance Optimization Strategies for Serverless Applications on AWS Lambda | Mohamed Lemine El Bechir, Cheikh Sad Bouh, Abobakr Shuwail | 2024-07-15 | 下载 | This review paper synthesizes the latest research on performance optimization strategies for serverless applications deployed on AWS Lambda. By examining recent studies, we highlight the challenges, s... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Joint Optimization of Completion Ratio and Latency of Offloaded Tasks with Multiple Priority Levels in 5G Edge | Parisa Fard Moshiri, Murat Simsek, Burak Kantarci | 2024-07-15 | 下载 | Multi-Access Edge Computing (MEC) is widely recognized as an essential enabler for applications that necessitate minimal latency. However, the dropped task ratio metric has not been studied thoroughly... |
| Friedkin-Johnsen Model for Opinion Dynamics on Signed Graphs | Xiaotian Zhou, Haoxin Sun, Wanyue Xu, Wei Li, Zhongzhi Zhang | 2024-07-15 | 下载 | A signed graph offers richer information than an unsigned graph, since it describes both collaborative and competitive relationships in social networks. |
| Distributed Scheduling for Throughput Maximization under Deadline Constraint in Wireless Mesh Networks | Xin Wang, Xudong Wang | 2024-07-15 | 下载 | This paper studies the distributed scheduling of traffic flows with arbitrary deadlines that arrive at their source nodes and are transmitted to different destination nodes via multiple intermediate n... |
| E-Commerce Product Recommendation System based on ML Algorithms | Md. Zahurul Haque | 2024-07-15 | 下载 | Algorithms are used in eCommerce product recommendation systems. These systems just recently began utilizing machine learning algorithms due to the development and growth of the artificial intelligenc... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| ConvBench: A Comprehensive Benchmark for 2D Convolution Primitive Evaluation | Lucas Alvarenga, Victor Ferrari, Rafael Souza, Marcio Pereira, Guido Araujo | 2024-07-15 | 下载 | Convolution is a compute-intensive operation placed at the heart of Convolution Neural Networks (CNNs). It has led to the development of many high-performance algorithms, such as Im2col-GEMM, Winograd... |
| Assessing the Impact of Network Quality-of-Service on Metaverse Virtual Reality User Experience | Rahul Dev Tripathi, Minzhao Lyu, Vijay Sivaraman | 2024-07-15 | 下载 | Metaverse virtual reality (VR) applications enable users to socialise, work, entertain, and study online with immersive experiences beyond the classic PC-based interactions. |