Skip to content

2025-05-05

cs.AR - Architecture

标题作者发布日期PDF摘要
Rapid yet accurate Tile-circuit and device modeling for Analog In-Memory ComputingJ. Luquin, C. Mackin, S. Ambrogio, A. Chen, F. Baldi, G. Miralles, M. J. Rasch, J. Büchel, M. Lalwani, W. Ponghiran, P. Solomon, H. Tsai, G. W. Burr, P. Narayanan2025-05-05下载Analog In-Memory Compute (AIMC) can improve the energy efficiency of Deep Learning by orders of magnitude. Yet analog-domain device and circuit non-idealities -- within the analog ``Tiles'' performing...
Open Challenges for a Production-ready Cloud Environment on top of RISC-V hardwareAaron Call, Ramon Nou, Guillem Senabre2025-05-05下载As part of the Vitamin-V European project, we have built a prototype of a RISC-V cluster managed by OpenStack, with the goal of realizing a functional RISC-V cloud ecosystem.
End-to-end fully-binarized network design: from Generic Learned Thermometer to Block PruningThien Nguyen, William Guicquero2025-05-05下载Existing works on Binary Neural Network (BNN) mainly focus on model's weights and activations while discarding considerations on the input raw data.
Machine-Learning-Powered Neural Interfaces for Smart Prosthetics and DiagnosticsMohammadAli Shaeri, Jinhan Liu, Mahsa Shoaran2025-05-05下载Advanced neural interfaces are transforming applications ranging from neuroscience research to diagnostic tools (for mental state recognition, tremor and seizure detection) as well as prosthetic devic...
NeuroSim V1.5: Improved Software Backbone for Benchmarking Compute-in-Memory Accelerators with Device and Circuit-level Non-idealitiesJames Read, Ming-Yen Lee, Wei-Hsing Huang, Yuan-Chun Luo, Anni Lu, Shimeng Yu2025-05-05下载The exponential growth of artificial intelligence (AI) applications has exposed the inefficiency of conventional von Neumann architectures, where frequent data transfers between compute units and memo...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Multiscale Parallel Simulation of Malignant Pleural Mesothelioma via Adaptive Domain Partitioning -- an Efficiency Analysis StudyAnton Dolganov, Valeria Krzhizhanovskaya, Stefano Trebeschi, Vivek M. Sheraton2025-05-05下载A novel parallel efficiency analysis on a framework for simulating the growth of Malignant Pleural Mesothelioma (MPM) tumours is presented. Proliferation of MPM tumours in the pleural space is simulat...
"Two-Stagification": Job Dispatching in Large-Scale Clusters via a Two-Stage ArchitectureMert Yildiz, Alexey Rolich, Andrea Baiocchi2025-05-05下载A continuing effort is devoted to devising effective dispatching policies for clusters of First Come First Served servers. Although the optimal solution for dispatchers aware of both job size and serv...
Parallel GPU-Accelerated Randomized Construction of Approximate Cholesky PreconditionersTianyu Liang, Chao Chen, Yotam Yaniv, Hengrui Luo, David Tench, Xiaoye S. Li, Aydin Buluc, James Demmel2025-05-05下载We introduce a parallel algorithm to construct a preconditioner for solving a large, sparse linear system where the coefficient matrix is a Laplacian matrix (a.k.a., graph Laplacian).
ARC-V: Vertical Resource Adaptivity for HPC Workloads in Containerized EnvironmentsDaniel Medeiros, Jeremy J. Williams, Jacob Wahlgren, Leonardo Saud Maia Leite, Ivy Peng2025-05-05下载Existing state-of-the-art vertical autoscalers for containerized environments are traditionally built for cloud applications, which might behave differently than HPC workloads with their dynamic resou...
HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language ModelsZheng Lin, Yuxin Zhang, Zhe Chen, Zihan Fang, Xianhao Chen, Praneeth Vepakomma, Wei Ni, Jun Luo, Yue Gao2025-05-05下载Recently, large language models (LLMs) have achieved remarkable breakthroughs, revolutionizing the natural language processing domain and beyond.
Recolorable Graph Exploration by an Oblivious Agent with Fewer ColorsShota Takahashi, Haruki Kanaya, Shoma Hiraoka, Ryota Eguchi, Yuichi Sudo2025-05-05下载Recently, Böckenhauer, Frei, Unger, and Wehner (SIROCCO 2023) introduced a novel variant of the graph exploration problem in which a single memoryless agent must visit all nodes of an unknown, undirec...
Brief Announcement: Minimizing Energy Solves Relative Majority with a Cubic Number of States in Population ProtocolsTom-Lukas Breitkopf, Julien Dallot, Antoine El-Hayek, Stefan Schmid2025-05-05下载This paper revisits a fundamental distributed computing problem in the population protocol model. Provided nn agents each starting with an input color in [k][k], the relative majority problem asks t...
An Almost Tight Lower Bound for Plurality Consensus with Undecided State Dynamics in the Population Protocol ModelAntoine El-Hayek, Robert Elsässer, Stefan Schmid2025-05-05下载We revisit the majority problem in the population protocol communication model, as first studied by Angluin et al. (Distributed Computing 2008).
Optimistic, Signature-Free Reliable Broadcast and Its ApplicationsNibesh Shrestha, Qianyu Yu, Aniket Kate, Giuliano Losa, Kartik Nayak, Xuechao Wang2025-05-05下载Reliable broadcast (RBC) is a key primitive in fault-tolerant distributed systems, and improving its efficiency can benefit a wide range of applications.
A Unifying Framework to Enable Artificial Intelligence in High Performance Computing WorkflowsJens Domke, Mohamed Wahib, Anshu Dubey, Tal Ben-Nun, Erik W. Draeger2025-05-05下载Current trends point to a future where large-scale scientific applications are tightly-coupled HPC/AI hybrids. Hence, we urgently need to invest in creating a seamless, scalable framework where HPC an...
Open Challenges for a Production-ready Cloud Environment on top of RISC-V hardwareAaron Call, Ramon Nou, Guillem Senabre2025-05-05下载As part of the Vitamin-V European project, we have built a prototype of a RISC-V cluster managed by OpenStack, with the goal of realizing a functional RISC-V cloud ecosystem.
Tight Bounds on Channel Reliability via Generalized Quorum Systems (Extended Version)Alejandro Naser-Pastoriza, Gregory Chockler, Alexey Gotsman, Fedor Ryabinin2025-05-05下载Communication channel failures are a major concern for the developers of modern fault-tolerant systems. However, while tight bounds for process failures are well-established, extending them to include...
Large Language Model Partitioning for Low-Latency Inference at the EdgeDimitrios Kafetzis, Ramin Khalili, Iordanis Koutsopoulos2025-05-05下载Large Language Models (LLMs) based on autoregressive, decoder-only Transformers generate text one token at a time, where a token represents a discrete unit of text.
Moving From Monolithic To Microservices Architecture for Multi-Agent SystemsMuskaan Goyal, Pranav Bhasin2025-05-05下载The transition from monolithic to microservices architecture revolutionized software development by improving scalability and maintainability.
Towards One-shot Federated Learning: Advances, Challenges, and Future DirectionsFlora Amato, Lingyu Qiu, Mohammad Tanveer, Salvatore Cuomo, Fabio Giampaolo, Francesco Piccialli2025-05-05下载One-shot FL enables collaborative training in a single round, eliminating the need for iterative communication, making it particularly suitable for use in resource-constrained and privacy-sensitive ap...
Model Checking and Synthesis for Optimal Use of Knowledge in Consensus ProtocolsKaya Alpturer, Gerald Huang, Ron van der Meyden2025-05-05下载Logics of knowledge and knowledge-based programs provide a way to give abstract descriptions of solutions to problems in fault-tolerant distributed computing, and have been used to derive optimal prot...
Opt-GPTQ: An Optimized GPTQ Combining Sparse Attention and Quantization TechniquesJie Kong, Junxiang Zhang, Jiheng Xu, Yalong Li, Shouhua Zhang, Jiehan Zhou, Yuhai Liu, Peng Liang, Quan Zhang, Luohan Jiang2025-05-05下载In the field of deep learning, traditional attention mechanisms face significant challenges related to high computational complexity and large memory consumption when processing long sequence data.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Energy, Scalability, Data and Security in Massive IoT: Current Landscape and Future DirectionsImane Cheikh, Sébastien Roy, Essaid Sabir, Rachid Aouami2025-05-05下载The Massive Internet of Things (MIoT) envisions an interconnected ecosystem of billions of devices, fundamentally transforming diverse sectors such as healthcare, smart cities, transportation, agricul...
Computing in Integrated Terrestrial and Non-Terrestrial Networks: A Comprehensive SurveyHoe Ziet Wong, Insaf Rzig, Safwan Alfattani, Wael Jaafar2025-05-05下载The rapid growth of Internet-of-things (IoT) devices, smart vehicles, and other connected objects is driving demand for ubiquitous connectivity and intensive computing capacity.
Adaptive Budgeted Multi-Armed Bandits for IoT with Dynamic Resource ConstraintsShubham Vaishnav, Praveen Kumar Donta, Sindri Magnússon2025-05-05下载Internet of Things (IoT) systems increasingly operate in environments where devices must respond in real time while managing fluctuating resource constraints, including energy and bandwidth.
A Cross-Layer Analysis of Network Antifragility with RIS-assisted Links under Jamming AttacksMounir Bensalem, Thomas Röthig, Admela Jukan2025-05-05下载Antifragility is an economics term defined as measure of (monetary) benefits gained from the adverse events and variability of the markets. This paper integrates for the first time the antifragility i...
Energy Efficiency Maximization for CR-NOMA based Smart Grid Communication NetworkMubashar Sarfraz, Sheraz Alam, Sajjad A. Ghauri, Asad Mahmood2025-05-05下载Managing massive data flows effectively and resolving spectrum shortages are two challenges that Smart Grid Communication Networks (SGCN) must overcome.
Trustworthy Inter-Provider Agreements in 6G Using a Privacy-Enabled Hybrid Blockchain FrameworkFarhana Javed, Josep Mangues-Bafalluy2025-05-05下载Inter-provider agreements are central to 6G networks, where administrative domains must securely and dynamically share services. To address the dual need for transparency and confidentiality, we propo...
ML-Enabled Eavesdropper Detection in Beyond 5G IIoT NetworksMaria-Lamprini A. Bartsioka, Ioannis A. Bartsiokas, Panagiotis K. Gkonis, Dimitra I. Kaklamani, Iakovos S. Venieris2025-05-05下载Advanced fifth generation (5G) and beyond (B5G) communication networks have revolutionized wireless technologies, supporting ultra-high data rates, low latency, and massive connectivity.

cs.PF - Performance

标题作者发布日期PDF摘要
"Two-Stagification": Job Dispatching in Large-Scale Clusters via a Two-Stage ArchitectureMert Yildiz, Alexey Rolich, Andrea Baiocchi2025-05-05下载A continuing effort is devoted to devising effective dispatching policies for clusters of First Come First Served servers. Although the optimal solution for dispatchers aware of both job size and serv...
Automotive Middleware Performance: Comparison of FastDDS, Zenoh and vSomeIPDavid Philipp Klüner, Lucas Hegerath, Amin Dieter Hatib, Stefan Kowalewski, Bassam Alrifaee, Alexandru Kampmann2025-05-05下载In this study, we evaluate the performance of current automotive communication middlewares under various operating conditions. Specifically, we examine FastDDS, a widely used open-source middleware, t...
Spatiotemporal Non-Uniformity-Aware Online Task Scheduling in Collaborative Edge Computing for Industrial Internet of ThingsYang Li, Xing Zhang, Yukun Sun, Wenbo Wang, Bo Lei2025-05-05下载Mobile edge computing mitigates the shortcomings of cloud computing caused by unpredictable wide-area network latency and serves as a critical enabling technology for the Industrial Internet of Things...
An Empirical Study on the Performance and Energy Usage of Compiled Python CodeVincenzo Stoico, Andrei Calin Dragomir, Patricia Lago2025-05-05下载Python is a popular programming language known for its ease of learning and extensive libraries. However, concerns about performance and energy consumption have led to the development of compilers to ...

基于 VitePress 构建