Skip to content

2025-08-10

cs.AR - Architecture

标题作者发布日期PDF摘要
The Monte Carlo Method and New Device and Architectural Techniques for Accelerating ItJanith Petangoda, Chatura Samarakoon, James Meech, Divya Thekke Kanapram, Hamid Toshani, Nathaniel Tye, Vasileios Tsoutsouras, Phillip Stanley-Marbell2025-08-10下载Computing systems interacting with real-world processes must safely and reliably process uncertain data. The Monte Carlo method is a popular approach for computing with such uncertain values.
Tasa: Thermal-aware 3D-Stacked Architecture Design with Bandwidth Sharing for LLM InferenceSiyuan He, Peiran Yan, Yandong He, Youwei Zhuo, Tianyu Jia2025-08-10下载The autoregressive decoding in LLMs is the major inference bottleneck due to the memory-intensive operations and limited hardware bandwidth. 3D-stacked architecture is a promising solution with signif...
LP-Spec: Leveraging LPDDR PIM for Efficient LLM Mobile Speculative Inference with Architecture-Dataflow Co-OptimizationSiyuan He, Zhantong Zhu, Yandong He, Tianyu Jia2025-08-10下载LLM inference on mobile devices faces extraneous challenges due to limited memory bandwidth and computational resources. To address these issues, speculative inference and processing-in-memory (PIM) t...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Enhancing Privacy in Decentralized Min-Max Optimization: A Differentially Private ApproachYueyang Quan, Chang Wang, Shengjie Zhai, Minghong Fang, Zhuqing Liu2025-08-10下载Decentralized min-max optimization allows multi-agent systems to collaboratively solve global min-max optimization problems by facilitating the exchange of model updates among neighboring agents, elim...
On the Efficiency of Dynamic Transaction Scheduling in Blockchain ShardingRamesh Adhikari, Costas Busch, Miroslav Popovic2025-08-10下载Sharding is a technique to speed up transaction processing in blockchains, where the nn processing nodes in the blockchain are divided into ss disjoint groups (shards) that can process transactions ...
Real-Time Analysis of Unstructured Data with Machine Learning on Heterogeneous ArchitecturesFotis I. Giasemis2025-08-10下载As the particle physics community needs higher and higher precisions in order to test our current model of the subatomic world, larger and larger datasets are necessary.
An Experimental Exploration of In-Memory Computing for Multi-Layer PerceptronsPedro Carrinho, Hamid Moghadaspour, Oscar Ferraz, João Dinis Ferreira, Yann Falevoz, Vitor Silva, Gabriel Falcao2025-08-10下载In modern computer architectures, the performance of many memory-bound workloads (e.g., machine learning, graph processing, databases) is limited by the data movement bottleneck that emerges when tran...
FlashMP: Fast Discrete Transform-Based Solver for Preconditioning Maxwell's Equations on GPUsHaoyuan Zhang, Yaqian Gao, Xinxin Zhang, Jialin Li, Runfeng Jin, Yidong Chen, Feng Zhang, Wu Yuan, Wenpeng Ma, Shan Liang, Jian Zhang, Zhonghua Lu2025-08-10下载Efficiently solving large-scale linear systems is a critical challenge in electromagnetic simulations, particularly when using the Crank-Nicolson Finite-Difference Time-Domain (CN-FDTD) method.
AerialDB: A Federated Peer-to-Peer Spatio-temporal Edge Datastore for Drone FleetsShashwat Jaiswal, Suman Raj, Subhajit Sidhanta, Yogesh Simmhan2025-08-10下载Recent years have seen an unprecedented growth in research that leverages the newest computing paradigm of Internet of Drones, comprising a fleet of connected Unmanned Aerial Vehicles (UAVs) used for ...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Unveiling IPv6 Scanning Dynamics: A Longitudinal Study Using Large Scale Proactive and Passive IPv6 TelescopesHammas Bin Tanveer, Wai Sun Chan, Ricky K. P. Mok, Sebastian Kappes, Philipp Richter, Oliver Gasser, John Ronan, Arthur Berger, kc Claffy2025-08-10下载We introduce new tools and vantage points to develop and integrate proactive techniques to attract IPv6 scan traffic, thus enabling its analysis.
The Search for Relevance: A Context-Aware Paradigm Shift in Semantic and Task-Oriented V2X CommunicationsLuca Lusvarghi, Javier Gozalvez, Baldomero Coll-Perales, Mohammad Irfan Khan, Miguel Sepulcre, Seyhan Ucar, Onur Altintas2025-08-10下载The design of communication systems has traditionally focused on the reliable and timely delivery of data. However, the scalability challenges faced by the evolution to a 6G-driven society demand new ...
CoMoE: Collaborative Optimization of Expert Aggregation and Offloading for MoE-based LLMs at EdgeMuqing Li, Ning Li, Xin Yuan, Wenchao Xu, Quan Chen, Song Guo, Haijun Zhang2025-08-10下载The proliferation of large language models (LLMs) has driven the adoption of Mixture-of-Experts (MoE) architectures as a promising solution to scale model capacity while controlling computational cost...
Mind the IP Gap: Measuring the impact of IPv6 on DNS censorshipIan Martiny, Hammas Bin Tanveer, Jack Wampler, Rishab Nithyanand, Eric Wustrow2025-08-10下载Internet censorship impacts large segments of the Internet, but so far, prior work has focused almost exclusively on performing measurements using IPv4.
ProtoScan: Measuring censorship in IPv6Jack Wampler, Hammas Bin Tanveer, Rishab Nithyanand, Eric Wustrow2025-08-10下载Internet censorship continues to impact billions of people worldwide, and measurement of it remains an important focus of research. However, most Internet censorship measurements have focused solely o...

基于 VitePress 构建