Skip to content

2024-10-11

cs.AR - Architecture

标题作者发布日期PDF摘要
MFIT: Multi-Fidelity Thermal Modeling for 2.5D and 3D Multi-Chiplet ArchitecturesLukas Pfromm, Alish Kanani, Harsh Sharma, Parth Solanki, Eric Tervo, Jaehyun Park, Janardhan Rao Doppa, Partha Pratim Pande, Umit Y. Ogras2024-10-11下载Rapidly evolving artificial intelligence and machine learning applications require ever-increasing computational capabilities, while monolithic 2D design technologies approach their limits.
Energy-efficient SNN Architecture using 3nm FinFET Multiport SRAM-based CIM with Online LearningLucas Huijbregts, Liu Hsiao-Hsuan, Paul Detterer, Said Hamdioui, Amirreza Yousefzadeh, Rajendra Bishnoi2024-10-11下载Current Artificial Intelligence (AI) computation systems face challenges, primarily from the memory-wall issue, limiting overall system-level performance, especially for Edge devices with constrained ...
Quantum Operating System Support for Quantum Trusted Execution EnvironmentsTheodoros Trochatos, Jakub Szefer2024-10-11下载With the growing reliance on cloud-based quantum computing, ensuring the confidentiality and integrity of quantum computations is paramount. Quantum Trusted Execution Environments (QTEEs) have been pr...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Parallel Watershed Partitioning: GPU-Based Hierarchical Image SegmentationVarduhi Yeghiazaryan, Yeva Gabrielyan, Irina Voiculescu2024-10-11下载Many image processing applications rely on partitioning an image into disjoint regions whose pixels are 'similar.' The watershed and waterfall transforms are established mathematical morphology pixel ...
Understanding the Statistical Accuracy-Communication Trade-off in Personalized Federated Learning with Minimax GuaranteesXin Yu, Zelin He, Ying Sun, Lingzhou Xue, Runze Li2024-10-11下载Personalized federated learning (PFL) offers a flexible framework for aggregating information across distributed clients with heterogeneous data.
A Scored Non-Deterministic Finite Automata Processor for Sequence AlignmentRyan Karbowniczak Rasha Karakchi2024-10-11下载The rapid growth of symbolic data in areas like internet, biological, and financial data has increased the demand for efficient pattern matching and regular expression processing.
MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge DevicesMohamed Amine Hamdi, Francesco Daghero, Giuseppe Maria Sarda, Josse Van Delm, Arne Symons, Luca Benini, Marian Verhelst, Daniele Jahier Pagliari, Alessio Burrello2024-10-11下载Streamlining the deployment of Deep Neural Networks (DNNs) on heterogeneous edge platforms, coupling within the same micro-controller unit (MCU) instruction processors and hardware accelerators for te...
Obelia: Scaling DAG-Based Blockchains to Hundreds of ValidatorsGeorge Danezis, Lefteris Kokoris-Kogias, Alberto Sonnino, Mingwei Tian2024-10-11下载Obelia improves upon structured DAG-based consensus protocols used in proof-of-stake systems, allowing them to effectively scale to accommodate hundreds of validators.
Mahi-Mahi: Low-Latency Asynchronous BFT DAG-Based ConsensusPhilipp Jovanovic, Lefteris Kokoris Kogias, Bryan Kumara, Alberto Sonnino, Pasindu Tennage, Igor Zablotchi2024-10-11下载We present Mahi-Mahi, the first asynchronous BFT consensus protocol that achieves sub-second latency in the WAN while processing over 100,000 transactions per second.
Edge AI Collaborative Learning: Bayesian Approaches to Uncertainty EstimationGleb Radchenko, Victoria Andrea Fill2024-10-11下载Recent advancements in edge computing have significantly enhanced the AI capabilities of Internet of Things (IoT) devices. However, these advancements introduce new challenges in knowledge exchange an...
To Repair or Not to Repair: Assessing Fault Resilience in MPI Stencil ApplicationsRoberto Rocco, Elisabetta Boella, Daniele Gregori, Gianluca Palermo2024-10-11下载With the increasing size of HPC computations, faults are becoming more and more relevant in the HPC field. The MPI standard does not define the application behaviour after a fault, leaving the burden ...
SwitchFS: Asynchronous Metadata Updates for Distributed Filesystems with In-Network CoordinationJingwei Xu, Mingkai Dong, Qiulin Tian, Ziyi Tian, Tong Xin, Haibo Chen2024-10-11下载Distributed filesystem metadata updates are typically synchronous. This creates inherent challenges for access efficiency, load balancing, and directory contention, especially under dynamic and skewed...
Unity is Power: Semi-Asynchronous Collaborative Training of Large-Scale Models with Structured Pruning in Resource-Limited ClientsYan Li, Xiao Zhang, Mingyi Li, Guangwei Xu, Feng Chen, Yuan Yuan, Yifei Zou, Mengying Zhao, Jianbo Lu, Dongxiao Yu2024-10-11下载In this work, we study to release the potential of massive heterogeneous weak computing power to collaboratively train large-scale models on dispersed datasets.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
DAWN: Designing Distributed Agents in a Worldwide NetworkZahra Aminiranjbar, Jianan Tang, Qiudan Wang, Shubha Pant, Mahesh Viswanathan2024-10-11下载The rapid evolution of Large Language Models (LLMs) has transformed them from basic conversational tools into sophisticated entities capable of complex reasoning and decision-making.
Leveraging Internet Principles to Build a Quantum NetworkLeonardo Bacciottini, Matheus Guedes De Andrade, Shahrooz Pouryousef, Emily A. Van Milligen, Aparimit Chandra, Nitish K. Panigrahy, Nageswara S. V. Rao, Gayane Vardoyan, Don Towsley2024-10-11下载Designing an operational architecture for the Quantum Internet is challenging in light of both fundamental limits imposed by physics laws and technological constraints.
Hybrid LLM-DDQN based Joint Optimization of V2I Communication and Autonomous DrivingZijiang Yan, Hao Zhou, Hina Tabassum, Xue Liu2024-10-11下载Large language models (LLMs) have received considerable interest recently due to their outstanding reasoning and comprehension capabilities. This work explores applying LLMs to vehicular networks, aim...
Online Learning for Intelligent Thermal Management of Interference-coupled and Passively Cooled Base StationsZhanwei Yu, Yi Zhao, Xiaoli Chu, Di Yuan2024-10-11下载Passively cooled base stations (PCBSs) have emerged to deliver better cost and energy efficiency. However, passive cooling necessitates intelligent thermal control via traffic management, i.e.
Cross-chain Sharing of Personal Health Records: Heterogeneous and Interoperable BlockchainsYongyang Lv, Xiaohong Li, Yingwenbo Wang, Kui Chen, Zhe Hou, Ruitao Feng2024-10-11下载With the widespread adoption of medical informatics, a wealth of valuable personal health records (PHR) has been generated. Concurrently, blockchain technology has enhanced the security of medical ins...
OpenWiFiSync: Open Source Implementation of a Clock Synchronization Algorithm using Wi-FiM. Gundall, H. D. Schotten2024-10-11下载Precise clock synchronization is an important requirement for distributed and networked industrial use cases. As more and more use cases contain mobile devices, clock synchronization has to be perform...
Bad Neighbors: On Understanding VPN Provider NetworksTeemu Rytilahti, Thorsten Holz2024-10-11下载Virtual Private Network (VPN) solutions are used to connect private networks securely over the Internet. Besides their benefits in corporate environments, VPNs are also marketed to privacy-minded user...
Smart PRACH Jamming: A Serious Threat for 5G Campus NetworksJ. R. Stegmann, M. Gundall, H. D. Schotten2024-10-11下载Smart jamming attacks on cellular campus networks represent an enormous potential threat, especially in the industrial environment. In complex production processes, the disruption of a single wireless...
5G as Enabler for Industrie 4.0 Use Cases: Challenges and ConceptsM. Gundall, J. Schneider, H. D. Schotten, M. Aleksy, D. Schulz, N. Franchi, N. Schwarzenberg, C. Markwart, R. Halfmann, P. Rost, D. Wübben, A. Neumann, M. Düngen, T. Neugebauer, R. Blunk, M. Kus, J. Grießbach2024-10-11下载The increasing demand for highly customized products, as well as flexible production lines, can be seen as trigger for the "fourth industrial revolution", referred to as "Industrie 4.0".
Goal-Oriented Status Updating for Real-time Remote Inference over Networks with Two-Way DelayCagri Ari, Md Kamran Chowdhury Shisher, Yin Sun, Elif Uysal2024-10-11下载We study a setting where an intelligent model (e.g., a pre-trained neural network) infers the real-time value of a target signal using data samples transmitted from a remote source.
Progressive Pruning: Analyzing the Impact of Intersection AttacksChristoph Döpmann, Maximilian Weisenseel, Florian Tschorsch2024-10-11下载Stream-based communication dominates today's Internet, posing unique challenges for anonymous communication networks (ACNs). Traditionally designed for independent messages, ACNs struggle to account f...
Red is Sus: Automated Identification of Low-Quality Service Availability Claims in the US National Broadband MapSyed Tauhidun Nabi, Zhuowei Wen, Brooke Ritter, Shaddi Hasan2024-10-11下载The FCC's National Broadband Map aspires to provide an unprecedented view into broadband availability in the US. However, this map, which also determines eligibility for public grant funding, relies o...
JingZhao: A Framework for Rapid NIC Prototyping in the Domain-Specific-Network EraFan Yang, Zhan Wang, Ning Kang, Zhenlong Ma, Jianxiong Li, Guojun Yuan, Guangming Tan2024-10-11下载The network is becoming domain-specific, which requires on-demand design of the network protocols, as well as the microarchitecture of the NIC. However, to develop such a NIC is not that easy.
Beamforming Design for Intelligent Reffecting Surface Aided Near-Field THz CommunicationsChi Qiu, Qingqing Wu, Wen Chen, Meng Hua, Wanming Hao, Mengnan Jian, Fen Hou2024-10-11下载Intelligent reflecting surface (IRS) operating in the terahertz (THz) band has recently gained considerable interest due to its high spectrum bandwidth.

cs.OS - Operating Systems

标题作者发布日期PDF摘要
SwitchFS: Asynchronous Metadata Updates for Distributed Filesystems with In-Network CoordinationJingwei Xu, Mingkai Dong, Qiulin Tian, Ziyi Tian, Tong Xin, Haibo Chen2024-10-11下载Distributed filesystem metadata updates are typically synchronous. This creates inherent challenges for access efficiency, load balancing, and directory contention, especially under dynamic and skewed...
SoK: Software CompartmentalizationHugo Lefeuvre, Nathan Dautenhahn, David Chisnall, Pierre Olivier2024-10-11下载Decomposing large systems into smaller components with limited privileges has long been recognized as an effective means to minimize the impact of exploits.

cs.PF - Performance

标题作者发布日期PDF摘要
Testing the Unknown: A Framework for OpenMP Testing via Random Program GenerationIgnacio Laguna, Patrick Chapman, Konstantinos Parasyris, Giorgis Georgakoudis, Cindy Rubio-González2024-10-11下载We present a randomized differential testing approach to test OpenMP implementations. In contrast to previous work that manually creates dozens of verification and validation tests, our approach is ab...
Unlocking FedNL: Self-Contained Compute-Optimized ImplementationKonstantin Burlachenko, Peter Richtárik2024-10-11下载Federated Learning (FL) is an emerging paradigm that enables intelligent agents to collaboratively train Machine Learning (ML) models in a distributed manner, eliminating the need for sharing their lo...
SwitchFS: Asynchronous Metadata Updates for Distributed Filesystems with In-Network CoordinationJingwei Xu, Mingkai Dong, Qiulin Tian, Ziyi Tian, Tong Xin, Haibo Chen2024-10-11下载Distributed filesystem metadata updates are typically synchronous. This creates inherent challenges for access efficiency, load balancing, and directory contention, especially under dynamic and skewed...

基于 VitePress 构建