Skip to content

2025-12-20

cs.AR - Architecture

标题作者发布日期PDF摘要
Weight Transformations in Bit-Sliced Crossbar Arrays for Fault Tolerant Computing-in-Memory: Design Techniques and Evaluation FrameworkAkul Malhotra, Sumeet Kumar Gupta2025-12-20下载The deployment of deep neural networks (DNNs) on compute-in-memory (CiM) accelerators offers significant energy savings and speed-up by reducing data movement during inference.
Theodosian: A Deep Dive into Memory-Hierarchy-Centric FHE AccelerationWonseok Choi, Hyunah Yu, Jongmin Kim, Hyesung Ji, Jaiyoung Park, Jung Ho Ahn2025-12-20下载Fully homomorphic encryption (FHE) enables secure computation on encrypted data, mitigating privacy concerns in cloud and edge environments. However, due to its high compute and memory demands, extens...
BARD: Reducing Write Latency of DDR5 Memory by Exploiting Bank-ParallelismSuhas Vittal, Moinuddin Qureshi2025-12-20下载This paper studies the impact of DRAM writes on DDR5-based system. To efficiently perform DRAM writes, modern systems buffer write requests and try to complete multiple write operations whenever the D...
PIM-FW: Hardware-Software Co-Design of All-pairs Shortest Paths in DRAMTsung-Han Lu, Zheyu Li, Minxuan Zhou, Tajana Rosing2025-12-20下载All-pairs shortest paths (APSP) is a fundamental algorithm used for routing, logistics, and network analysis, but the cubic time complexity and heavy data movement of the canonical Floyd-Warshall (FW)...
Making Strong Error-Correcting Codes Work Effectively for HBM in AI InferenceRui Xie, Yunhua Fang, Asad Ul Haq, Linsen Ma, Sanchari Sen, Swagath Venkataramani, Liu Liu, Tong Zhang2025-12-20下载LLM inference is increasingly memory bound, and HBM cost per GB dominates system cost. Current HBM stacks include short on-die ECC that tightens binning, raises price, and fixes reliability policy ins...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Snowveil: A Framework for Decentralised Preference DiscoveryGrammateia Kotsialou2025-12-20下载Aggregating subjective preferences of a large group is a fundamental challenge in computational social choice, traditionally reliant on central authorities.
MatKV: Trading Compute for Flash Storage in LLM InferenceKun-Woo Shin, Jay H. Park, Moonwook Oh, Yohan Jo, Jaeyoung Do, Sang-Won Lee2025-12-20下载We observe two major trends in LLM-based generative AI: (1) inference is becoming the dominant factor in terms of cost and power consumption, surpassing training, and (2) retrieval augmented generatio...
Faster Vertex Cover Algorithms on GPUs with Component-Aware Parallel BranchingHussein Amro, Basel Fakhri, Amer E. Mouawad, Izzat El Hajj2025-12-20下载Algorithms for finding minimum or bounded vertex covers in graphs use a branch-and-reduce strategy, which involves exploring a highly imbalanced search tree.
Asynchronous Pipeline Parallelism for Real-Time Multilingual Lip Synchronization in Video Communication SystemsEren Caglar, Amirkia Rafiei Oskooei, Mehmet Kutanoglu, Mustafa Keles, Mehmet S. Aktas2025-12-20下载This paper introduces a parallel and asynchronous Transformer framework designed for efficient and accurate multilingual lip synchronization in real-time video conferencing systems.
TraCT: Disaggregated LLM Serving with CXL Shared Memory KV Cache at Rack-ScaleDongha Yoon, Younghoon Min, Hoshik Kim, Sam H. Noh, Jongryool Kim2025-12-20下载Disaggregated LLM serving improves resource efficiency by separating the compute-intensive prefill phase from the latency-critical decode phase.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Protecting Human Activity Signatures in Compressed IEEE 802.11 CSI FeedbackMohamed Seif, Atsutse Kludze, Yasaman Ghasempour, H. Vincent Poor, Doru Calin, Andrea J. Goldsmith2025-12-20下载Explicit channel state information (CSI) feedback in IEEE~802.11 conveys \emph{transmit beamforming directions} by reporting quantized Givens rotation and phase angles that parametrize the right-singu...
Implementing Transport Coding in OMNeT++ for Message Delay ReductionIlya Petrovanov, Anton Sergeev2025-12-20下载Transport coding reduces message delay in packet-switched networks by introducing controlled redundancy at the transport layer: kk original packets are encoded into nkn\ge k coded packets, and the me...
Asynchronous Pipeline Parallelism for Real-Time Multilingual Lip Synchronization in Video Communication SystemsEren Caglar, Amirkia Rafiei Oskooei, Mehmet Kutanoglu, Mustafa Keles, Mehmet S. Aktas2025-12-20下载This paper introduces a parallel and asynchronous Transformer framework designed for efficient and accurate multilingual lip synchronization in real-time video conferencing systems.
TCP BBR Performance over Wi-Fi~6: AQM Impacts and Cross-Layer InsightsShyam Kumar Shrestha, Shiva Raj Pokhrel, Jonathan Kua2025-12-20下载We evaluate TCP BBRv3 on Wi-Fi 6 home networks under modern AQM schemes using a fully wireless testbed and a simple cross-layer model linking Wi-Fi scheduling, router queueing, and BBRv3's pacing dyna...
Performance Guarantees for Data Freshness in Resource-Constrained Adversarial IoT SystemsAresh Dadlani, Muthukrishnan Senthil Kumar, Omid Ardakanian, Ioanis Nikolaidis2025-12-20下载Timely updates are critical for real-time monitoring and control applications powered by the Internet of Things (IoT). As these systems scale, they become increasingly vulnerable to adversarial attack...

cs.OS - Operating Systems

标题作者发布日期PDF摘要
VeruSAGE: A Study of Agent-Based Verification for Rust SystemsChenyuan Yang, Natalie Neamtu, Chris Hawblitzel, Jacob R. Lorch, Shan Lu2025-12-20下载Large language models (LLMs) have shown impressive capability to understand and develop code. However, their capability to rigorously reason about and prove code correctness remains in question.

cs.PF - Performance

标题作者发布日期PDF摘要
Age of Information with Age-Dependent Server SelectionNail Akar, Ismail Cosandal, Sennur Ulukus2025-12-20下载In this paper, we consider a single-source multi-server generate-at-will discrete-time non-preemptive status update system where update packets are transmitted using {\em only one} of the available se...
Performance Guarantees for Data Freshness in Resource-Constrained Adversarial IoT SystemsAresh Dadlani, Muthukrishnan Senthil Kumar, Omid Ardakanian, Ioanis Nikolaidis2025-12-20下载Timely updates are critical for real-time monitoring and control applications powered by the Internet of Things (IoT). As these systems scale, they become increasingly vulnerable to adversarial attack...

基于 VitePress 构建