Appearance
2025-08-10
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| The Monte Carlo Method and New Device and Architectural Techniques for Accelerating It | Janith Petangoda, Chatura Samarakoon, James Meech, Divya Thekke Kanapram, Hamid Toshani, Nathaniel Tye, Vasileios Tsoutsouras, Phillip Stanley-Marbell | 2025-08-10 | 下载 | Computing systems interacting with real-world processes must safely and reliably process uncertain data. The Monte Carlo method is a popular approach for computing with such uncertain values. |
| Tasa: Thermal-aware 3D-Stacked Architecture Design with Bandwidth Sharing for LLM Inference | Siyuan He, Peiran Yan, Yandong He, Youwei Zhuo, Tianyu Jia | 2025-08-10 | 下载 | The autoregressive decoding in LLMs is the major inference bottleneck due to the memory-intensive operations and limited hardware bandwidth. 3D-stacked architecture is a promising solution with signif... |
| LP-Spec: Leveraging LPDDR PIM for Efficient LLM Mobile Speculative Inference with Architecture-Dataflow Co-Optimization | Siyuan He, Zhantong Zhu, Yandong He, Tianyu Jia | 2025-08-10 | 下载 | LLM inference on mobile devices faces extraneous challenges due to limited memory bandwidth and computational resources. To address these issues, speculative inference and processing-in-memory (PIM) t... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Enhancing Privacy in Decentralized Min-Max Optimization: A Differentially Private Approach | Yueyang Quan, Chang Wang, Shengjie Zhai, Minghong Fang, Zhuqing Liu | 2025-08-10 | 下载 | Decentralized min-max optimization allows multi-agent systems to collaboratively solve global min-max optimization problems by facilitating the exchange of model updates among neighboring agents, elim... |
| On the Efficiency of Dynamic Transaction Scheduling in Blockchain Sharding | Ramesh Adhikari, Costas Busch, Miroslav Popovic | 2025-08-10 | 下载 | Sharding is a technique to speed up transaction processing in blockchains, where the processing nodes in the blockchain are divided into disjoint groups (shards) that can process transactions ... |
| Real-Time Analysis of Unstructured Data with Machine Learning on Heterogeneous Architectures | Fotis I. Giasemis | 2025-08-10 | 下载 | As the particle physics community needs higher and higher precisions in order to test our current model of the subatomic world, larger and larger datasets are necessary. |
| An Experimental Exploration of In-Memory Computing for Multi-Layer Perceptrons | Pedro Carrinho, Hamid Moghadaspour, Oscar Ferraz, João Dinis Ferreira, Yann Falevoz, Vitor Silva, Gabriel Falcao | 2025-08-10 | 下载 | In modern computer architectures, the performance of many memory-bound workloads (e.g., machine learning, graph processing, databases) is limited by the data movement bottleneck that emerges when tran... |
| FlashMP: Fast Discrete Transform-Based Solver for Preconditioning Maxwell's Equations on GPUs | Haoyuan Zhang, Yaqian Gao, Xinxin Zhang, Jialin Li, Runfeng Jin, Yidong Chen, Feng Zhang, Wu Yuan, Wenpeng Ma, Shan Liang, Jian Zhang, Zhonghua Lu | 2025-08-10 | 下载 | Efficiently solving large-scale linear systems is a critical challenge in electromagnetic simulations, particularly when using the Crank-Nicolson Finite-Difference Time-Domain (CN-FDTD) method. |
| AerialDB: A Federated Peer-to-Peer Spatio-temporal Edge Datastore for Drone Fleets | Shashwat Jaiswal, Suman Raj, Subhajit Sidhanta, Yogesh Simmhan | 2025-08-10 | 下载 | Recent years have seen an unprecedented growth in research that leverages the newest computing paradigm of Internet of Drones, comprising a fleet of connected Unmanned Aerial Vehicles (UAVs) used for ... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Unveiling IPv6 Scanning Dynamics: A Longitudinal Study Using Large Scale Proactive and Passive IPv6 Telescopes | Hammas Bin Tanveer, Wai Sun Chan, Ricky K. P. Mok, Sebastian Kappes, Philipp Richter, Oliver Gasser, John Ronan, Arthur Berger, kc Claffy | 2025-08-10 | 下载 | We introduce new tools and vantage points to develop and integrate proactive techniques to attract IPv6 scan traffic, thus enabling its analysis. |
| The Search for Relevance: A Context-Aware Paradigm Shift in Semantic and Task-Oriented V2X Communications | Luca Lusvarghi, Javier Gozalvez, Baldomero Coll-Perales, Mohammad Irfan Khan, Miguel Sepulcre, Seyhan Ucar, Onur Altintas | 2025-08-10 | 下载 | The design of communication systems has traditionally focused on the reliable and timely delivery of data. However, the scalability challenges faced by the evolution to a 6G-driven society demand new ... |
| CoMoE: Collaborative Optimization of Expert Aggregation and Offloading for MoE-based LLMs at Edge | Muqing Li, Ning Li, Xin Yuan, Wenchao Xu, Quan Chen, Song Guo, Haijun Zhang | 2025-08-10 | 下载 | The proliferation of large language models (LLMs) has driven the adoption of Mixture-of-Experts (MoE) architectures as a promising solution to scale model capacity while controlling computational cost... |
| Mind the IP Gap: Measuring the impact of IPv6 on DNS censorship | Ian Martiny, Hammas Bin Tanveer, Jack Wampler, Rishab Nithyanand, Eric Wustrow | 2025-08-10 | 下载 | Internet censorship impacts large segments of the Internet, but so far, prior work has focused almost exclusively on performing measurements using IPv4. |
| ProtoScan: Measuring censorship in IPv6 | Jack Wampler, Hammas Bin Tanveer, Rishab Nithyanand, Eric Wustrow | 2025-08-10 | 下载 | Internet censorship continues to impact billions of people worldwide, and measurement of it remains an important focus of research. However, most Internet censorship measurements have focused solely o... |