Skip to content

2024-12-15

cs.AR - Architecture

标题作者发布日期PDF摘要
Nanoscaling Floating-Point (NxFP): NanoMantissa, Adaptive Microexponents, and Code Recycling for Direct-Cast Compression of Large Language ModelsYun-Chen Lo, Gu-Yeon Wei, David Brooks2024-12-15下载As cutting-edge large language models (LLMs) continue to transform various industries, their fast-growing model size and sequence length have led to memory traffic and capacity challenges.
ChipAlign: Instruction Alignment in Large Language Models for Chip Design via Geodesic InterpolationChenhui Deng, Yunsheng Bai, Haoxing Ren2024-12-15下载Recent advancements in large language models (LLMs) have expanded their application across various domains, including chip design, where domain-adapted chip models like ChipNeMo have emerged.
CoopetitiveV: Leveraging LLM-powered Coopetitive Multi-Agent Prompting for High-quality Verilog GenerationZhendong Mi, Renming Zheng, Haowen Zhong, Yue Sun, Seth Kneeland, Sayan Moitra, Ken Kutzer, Zhaozhuo Xu Shaoyi Huang2024-12-15下载Recent advances in agentic LLMs have demonstrated great capabilities in Verilog code generation. However, existing approaches either use LLM-assisted single-agent prompting or cooperation-only multi-a...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Nanoscaling Floating-Point (NxFP): NanoMantissa, Adaptive Microexponents, and Code Recycling for Direct-Cast Compression of Large Language ModelsYun-Chen Lo, Gu-Yeon Wei, David Brooks2024-12-15下载As cutting-edge large language models (LLMs) continue to transform various industries, their fast-growing model size and sequence length have led to memory traffic and capacity challenges.
GAP: Game Theory-Based Approach for Reliability and Power Management in Emerging Fog ComputingAbolfazl Younesi, Mohsen Ansari, Alireza Ejlali, Mohammad Amin Fazli, Muhammad Shafique, Jörg Henkel2024-12-15下载Fog computing brings about a transformative shift in data management, presenting unprecedented opportunities for enhanced performance and reduced latency.
ProFe: Communication-Efficient Decentralized Federated Learning via Distillation and PrototypesPedro Miguel Sánchez Sánchez, Enrique Tomás Martínez Beltrán, Miguel Fernández Llamas, Gérôme Bovet, Gregorio Martínez Pérez, Alberto Huertas Celdrán2024-12-15下载Decentralized Federated Learning (DFL) trains models in a collaborative and privacy-preserving manner while removing model centralization risks and improving communication bottlenecks.
Deterministic Even-Cycle Detection in Broadcast CONGESTPierre Fraigniaud, Maël Luce, Frédéric Magniez, Ioan Todinca2024-12-15下载We show that, for every k2k\geq 2, C2kC_{2k}-freeness can be decided in O(n11/k)O(n^{1-1/k}) rounds in the Broadcast CONGEST model, by a deterministic algorithm.
MAP-UOT: A Memory-Efficient Approach to Unbalanced Optimal Transport ImplementationChengyu Sun, Jinyu Hu, Hong Jiang2024-12-15下载Unbalanced optimal transport (UOT) has been widely used as a fundamental tool in many application domains, where it often dominates the application running time.
SparseMap: Loop Mapping for Sparse CNNs on Streaming Coarse-grained Reconfigurable ArrayXiaobing Ni, Mengke Ge, Jiaheng Ruan, Song Chen, Yi Kang2024-12-15下载Streaming coarse-grained reconfgurable array (CGRA) is a promising architecture for data/computing-intensive applications because of its fexibility, high throughput and efcient memory system.
FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor CoresJinliang Shi, Shigang Li, Youxuan Xu, Rongtian Fu, Xueying Wang, Tong Wu2024-12-15下载Sparse Matrix-matrix Multiplication (SpMM) and Sampled Dense-dense Matrix Multiplication (SDDMM) are important sparse operators in scientific computing and deep learning.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Grey Wolf-Based Task Scheduling in Vehicular Fog Computing SystemsMaryam Taghizadeh, Mahmood Ahmadi2024-12-15下载Vehicular fog computing (VFC) can be considered as an important alternative to address the existing challenges in intelligent transportation systems (ITS).
ProFe: Communication-Efficient Decentralized Federated Learning via Distillation and PrototypesPedro Miguel Sánchez Sánchez, Enrique Tomás Martínez Beltrán, Miguel Fernández Llamas, Gérôme Bovet, Gregorio Martínez Pérez, Alberto Huertas Celdrán2024-12-15下载Decentralized Federated Learning (DFL) trains models in a collaborative and privacy-preserving manner while removing model centralization risks and improving communication bottlenecks.
Interference in Wireless Networks -- A Power Allocation ApproachTzalik Maimon, Shirley Alus, Gil Kedar2024-12-15下载Co-Channel Interference (CCI) is a fundamental problem in wireless communication networks. It is a well-studied problem in the field. As channels use the same frequency, interference in the radio wave...
Communications over Unlicensed sub-8 GHz Spectrum: Opportunities and ChallengesKarim Saifullin, Hussein Al-Shatri, Mohamed-Slim Alouini2024-12-15下载The utilization of unlicensed spectrum presents a promising solution to the issue of spectrum scarcity in densely populated areas, while also offering a cost-effective means to connect underserved reg...

基于 VitePress 构建