Skip to content

2024-08-24

cs.AR - Architecture

标题作者发布日期PDF摘要
SiTe CiM: Signed Ternary Computing-in-Memory for Ultra-Low Precision Deep Neural NetworksNiharika Thakuria, Akul Malhotra, Sandeep K. Thirumala, Reena Elangovan, Anand Raghunathan, Sumeet K. Gupta2024-08-24下载Ternary Deep Neural Networks (DNN) have shown a large potential for highly energy-constrained systems by virtue of their low power operation (due to ultra-low precision) with only a mild degradation i...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Unleashing Collaborative Computing for Adaptive Video Streaming with Multi-objective Optimization in Satellite Terrestrial NetworksZhishu Shen, Qiushi Zheng, Ziqi Rong, Jiong Jin, Atsushi Tagami, Wei Xiang2024-08-24下载Satellite-terrestrial networks (STNs) are anticipated to deliver seamless IoT services across expansive regions. Given the constrained resources available for offloading computationally intensive task...
Intelligent Router for LLM Workloads: Improving Performance Through Workload-Aware Load BalancingKunal Jain, Anjaly Parayil, Ankur Mallick, Esha Choukse, Xiaoting Qin, Jue Zhang, Íñigo Goiri, Rujia Wang, Chetan Bansal, Victor Rühle, Anoop Kulkarni, Steve Kofsky, Saravan Rajmohan2024-08-24下载Large Language Model (LLM) workloads have distinct prefill and decode phases with different compute and memory requirements which should ideally be accounted for when scheduling input queries across d...
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsChansung Park, Juyong Jiang, Fan Wang, Sayak Paul, Jing Tang2024-08-24下载The widespread adoption of cloud-based proprietary large language models (LLMs) has introduced significant challenges, including operational dependencies, privacy concerns, and the necessity of contin...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
SGP-RI: A Real-Time-Trainable and Decentralized IoT Indoor Localization Model Based on Sparse Gaussian Process with Reduced-Dimensional InputsZhe Tang, Sihao Li, Zichen Huang, Guandong Yang, Kyeong Soo Kim, Jeremy S. Smith2024-08-24下载Internet of Things (IoT) devices are deployed in the filed, there is an enormous amount of untapped potential in local computing on those IoT devices.

基于 VitePress 构建