Skip to content

2024-11-12

cs.AR - Architecture

标题作者发布日期PDF摘要
A low-rank balanced truncation approach for large-scale RLCk model order reduction based on extended Krylov subspace and a frequency-aware convergence criterionChristos Giamouzis, Dimitrios Garyfallou, Nestor Evmorfopoulos, George Stamoulis2024-11-12下载Model order reduction (MOR) is essential in integrated circuit design, particularly when dealing with large-scale electromagnetic models extracted from complex designs.
MANTIS: A Mixed-Signal Near-Sensor Convolutional Imager SoC Using Charge-Domain 4b-Weighted 5-to-84-TOPS/W MAC Operations for Feature Extraction and Region-of-Interest DetectionMartin Lefebvre, David Bol2024-11-12下载Recent advances in artificial intelligence have prompted the search for enhanced algorithms and hardware to support the deployment of machine learning at the edge.
Bayes2IMC: In-Memory Computing for Bayesian Binary Neural NetworksPrabodh Katti, Clement Ruah, Osvaldo Simeone, Bashir M. Al-Hashimi, Bipin Rajendran2024-11-12下载Bayesian Neural Networks (BNNs) provide superior estimates of uncertainty by generating an ensemble of predictive distributions. However, inference via ensembling is resource-intensive, requiring addi...
Web-Based Simulator of Superscalar RISC-V ProcessorsJiri Jaros, Michal Majer, Jakub Horky, Jan Vavra2024-11-12下载Mastering computational architectures is essential for developing fast and power-efficient programs. Our advanced simulator empowers both IT students and professionals to grasp the fundamentals of sup...
RPCAcc: A High-Performance and Reconfigurable PCIe-attached RPC AcceleratorJie Zhang, Hongjing Huang, Xuzheng Xu, Xiang Li, Jieru Zhao, Ming Liu, Zeke Wang2024-11-12下载The emerging microservice/serverless-based cloud programming paradigm and the rising networking speeds leave the RPC stack as the predominant data center tax.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
On the Convergence of Continual Federated Learning Using Incrementally Aggregated GradientsSatish Kumar Keshri, Nazreen Shah, Ranjitha Prasad2024-11-12下载The holy grail of machine learning is to enable Continual Federated Learning (CFL) to enhance the efficiency, privacy, and scalability of AI systems while learning from streaming data.
Efficient Federated Finetuning of Tiny Transformers with Resource-Constrained DevicesKilian Pfeiffer, Mohamed Aboelenien Ahmed, Ramin Khalili, Jörg Henkel2024-11-12下载In recent years, Large Language Models (LLMs) through Transformer structures have dominated many machine learning tasks, especially text processing.
ALANINE: A Novel Decentralized Personalized Federated Learning For Heterogeneous LEO Satellite ConstellationLiang Zhao, Shenglin Geng, Xiongyan Tang, Ammar Hawbani, Yunhe Sun, Lexi Xu, Daniele Tarchi2024-11-12下载Low Earth Orbit (LEO) satellite constellations have seen significant growth and functional enhancement in recent years, which integrates various capabilities like communication, navigation, and remote...
A Framework for Carbon-aware Real-Time Workload Management in Clouds using Renewables-driven CoresTharindu B. Hewage, Shashikant Ilager, Maria A. Rodriguez, Rajkumar Buyya2024-11-12下载Cloud platforms commonly exploit workload temporal flexibility to reduce their carbon emissions. They suspend/resume workload execution for when and where the energy is greenest.
A Performance Analysis of BFT Consensus for BlockchainsJ. D. Chan, Y. C. Tay, Brian R. Z. Yen2024-11-12下载Distributed ledgers are common in the industry. Some of them can use blockchains as their underlying infrastructure. A blockchain requires participants to agree on its contents.
Decentralized Network Topology Design for Task Offloading in Mobile Edge ComputingKe Ma, Junfei Xie2024-11-12下载The rise of delay-sensitive yet computing-intensive Internet of Things (IoT) applications poses challenges due to the limited processing power of IoT devices.
Input-Based Ensemble-Learning Method for Dynamic Memory Configuration of Serverless Computing FunctionsSiddharth Agarwal, Maria A. Rodriguez, Rajkumar Buyya2024-11-12下载In today's Function-as-a-Service offerings, a programmer is usually responsible for configuring function memory for its successful execution, which allocates proportional function resources such as CP...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
A Call to Reconsider Certification Authority Authorization (CAA)Pouyan Fotouhi Tehrani, Raphael Hiesgen, Thomas C. Schmidt, Matthias Wählisch2024-11-12下载Certification Authority Authentication (CAA) is a safeguard against illegitimate certificate issuance. We show how shortcomings in CAA concepts and operational aspects undermine its effectiveness in p...
Optimizing Service Function Chain Mapping in Network Function Virtualization through Simultaneous NF Decomposition and VNF PlacementAsghar Asgharian-Sardroud, Mohammad Hossein Izanlou, Amin Jabbari, Sepehr Mahmoodian Hamedani2024-11-12下载Network function virtualization enables network operators to implement new services through a process called service function chain mapping. The concept of Service Function Chain (SFC) is introduced t...
Trust-Aware Sybil Attack Detection for Resilient Vehicular CommunicationMortan Thomas, Abinash Borah, Anirudh Paranjothi2024-11-12下载Connected autonomous vehicles, or Vehicular Ad hoc Networks (VANETs), hold great promise, but concerns persist regarding safety, privacy, and security, particularly in the face of Sybil attacks, where...
Decentralized Network Topology Design for Task Offloading in Mobile Edge ComputingKe Ma, Junfei Xie2024-11-12下载The rise of delay-sensitive yet computing-intensive Internet of Things (IoT) applications poses challenges due to the limited processing power of IoT devices.

cs.PF - Performance

标题作者发布日期PDF摘要
OSCAR-P and aMLLibrary: Profiling and Predicting the Performance of FaaS-based Applications in Computing ContinuaRoberto Sala, Bruno Guindani, Enrico Galimberti, Federica Filippini, Hamta Sedghani, Danilo Ardagna, Sebastián Risco, Germán Moltó, Miguel Caballer2024-11-12下载This paper proposes an automated framework for efficient application profiling and training of Machine Learning (ML) performance models, composed of two parts: OSCAR-P and aMLLibrary.
A Performance Analysis of BFT Consensus for BlockchainsJ. D. Chan, Y. C. Tay, Brian R. Z. Yen2024-11-12下载Distributed ledgers are common in the industry. Some of them can use blockchains as their underlying infrastructure. A blockchain requires participants to agree on its contents.
Faster LLM Inference using DBMS-Inspired Preemption and Cache Replacement PoliciesKyoungmin Kim, Jiacheng Li, Kijae Hong, Anastasia Ailamaki2024-11-12下载LLMs are increasingly used world-wide from daily tasks to agentic systems and data analytics, requiring significant GPU resources. LLM inference systems, however, are slow compared to database systems...

基于 VitePress 构建