Appearance
2024-12-03
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Massimult: A Novel Parallel CPU Architecture Based on Combinator Reduction | Jurgen Nicklisch-Franken, Ruslan Feizerakhmanov | 2024-12-03 | 下载 | The Massimult project aims to design and implement an innovative CPU architecture based on combinator reduction with a novel combinator base and a new abstract machine. |
| The Tiny Median Filter: A Small Size, Flexible Arbitrary Percentile Finder Scheme Suitable for FPGA Implementation | Jinyuan Wu | 2024-12-03 | 下载 | This document reports the design, implementation and testing of a small silicon resource usage, very flexible arbitrary percentile finding scheme called the Tiny Median Filter. |
| PrefixLLM: LLM-aided Prefix Circuit Design | Weihua Xiao, Venkata Sai Charan Putrevu, Raghu Vamshi Hemadri, Siddharth Garg, Ramesh Karri | 2024-12-03 | 下载 | Prefix circuits are fundamental components in digital adders, widely used in digital systems due to their efficiency in calculating carry signals. |
| ML-based AIG Timing Prediction to Enhance Logic Optimization | Wenjing Jiang, Jin Yan, Sachin S. Sapatnekar | 2024-12-03 | 下载 | As circuit designs become more intricate, obtaining accurate performance estimation in early stages, for effective design space exploration, becomes more time-consuming. |
| MASIM: An Efficient Multi-Array Scheduler for In-Memory SIMD Computation | Xingyue Qian, Chen Nie, Zhezhi He, Weikang Qian | 2024-12-03 | 下载 | Single instruction, multiple data (SIMD) is a popular design style of in-memory computing (IMC) architectures, which enables memory arrays to perform logic operations to achieve low energy consumption... |
| Compromising the Intelligence of Modern DNNs: On the Effectiveness of Targeted RowPress | Ranyang Zhou, Jacqueline T. Liu, Sabbir Ahmed, Shaahin Angizi, Adnan Siraj Rakin | 2024-12-03 | 下载 | Recent advancements in side-channel attacks have revealed the vulnerability of modern Deep Neural Networks (DNNs) to malicious adversarial weight attacks. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| GoldFish: Serverless Actors with Short-Term Memory State for the Edge-Cloud Continuum | Cynthia Marcelino, Jack Shahhoud, Stefan Nastic | 2024-12-03 | 下载 | Serverless Computing is a computing paradigm that provides efficient infrastructure management and elastic scalability. Serverless functions scale up or down based on demand, which means that function... |
| QPET: A Versatile and Portable Quantity-of-Interest-Preservation Framework for Error-Bounded Lossy Compression | Jinyang Liu, Pu Jiao, Kai Zhao, Xin Liang, Sheng Di, Franck Cappello | 2024-12-03 | 下载 | Error-bounded lossy compression has been widely adopted in many scientific domains because it can address the challenges in storing, transferring, and analyzing unprecedented amounts of scientific dat... |
| Taurus Database: How to be Fast, Available, and Frugal in the Cloud | Alex Depoutovitch, Chong Chen, Jin Chen, Paul Larson, Shu Lin, Jack Ng, Wenlin Cui, Qiang Liu, Wei Huang, Yong Xiao, Yongjun He | 2024-12-03 | 下载 | Using cloud Database as a Service (DBaaS) offerings instead of on-premise deployments is increasingly common. Key advantages include improved availability and scalability at a lower cost than on-premi... |
| Massimult: A Novel Parallel CPU Architecture Based on Combinator Reduction | Jurgen Nicklisch-Franken, Ruslan Feizerakhmanov | 2024-12-03 | 下载 | The Massimult project aims to design and implement an innovative CPU architecture based on combinator reduction with a novel combinator base and a new abstract machine. |
| Performance Debugging through Microarchitectural Sensitivity and Causality Analysis | Alban Dutilleul, Hugo Pompougnac, Nicolas Derumigny, Gabriel Rodriguez, Valentin Trophime, Christophe Guillon, Fabrice Rastello | 2024-12-03 | 下载 | Modern Out-of-Order (OoO) CPUs are complex systems with many components interleaved in non-trivial ways. Pinpointing performance bottlenecks and understanding the underlying causes of program performa... |
| Resource-Adaptive Successive Doubling for Hyperparameter Optimization with Large Datasets on High-Performance Computing Systems | Marcel Aach, Rakesh Sarma, Helmut Neukirchen, Morris Riedel, Andreas Lintermann | 2024-12-03 | 下载 | On High-Performance Computing (HPC) systems, several hyperparameter configurations can be evaluated in parallel to speed up the Hyperparameter Optimization (HPO) process. |
| Scalable Analysis of Urban Scaling Laws: Leveraging Cloud Computing to Analyze 21,280 Global Cities | Zhenhui Li, Hongwei Zhang, Kan Wu | 2024-12-03 | 下载 | Cities play a pivotal role in human development and sustainability, yet studying them presents significant challenges due to the vast scale and complexity of spatial-temporal data. |
| Learn More by Using Less: Distributed Learning with Energy-Constrained Devices | Roberto Pereira, Cristian J. Vaca-Rubio, Luis Blanco | 2024-12-03 | 下载 | Federated Learning (FL) has emerged as a solution for distributed model training across decentralized, privacy-preserving devices, but the different energy capacities of participating devices (system ... |
| Technical Report on Reinforcement Learning Control on the Lucas-Nülle Inverted Pendulum | Maximilian Schenke, Shalbus Bukarov | 2024-12-03 | 下载 | The discipline of automatic control is making increased use of concepts that originate from the domain of machine learning. Herein, reinforcement learning (RL) takes an elevated role, as it is inheren... |
| Connecting Large Language Models with Blockchain: Advancing the Evolution of Smart Contracts from Automation to Intelligence | Youquan Xian, Xueying Zeng, Duancheng Xuan, Danping Yang, Chunpei Li, Peng Fan, Peng Liu | 2024-12-03 | 下载 | Blockchain smart contracts have catalyzed the development of decentralized applications across various domains, including decentralized finance. |
| Matryoshka: Optimization of Dynamic Diverse Quantum Chemistry Systems via Elastic Parallelism Transformation | Tuowei Wang, Kun Li, Donglin Bai, Fusong Ju, Leo Xia, Ting Cao, Ju Ren, Yaoxue Zhang, Mao Yang | 2024-12-03 | 下载 | AI infrastructures, predominantly GPUs, have delivered remarkable performance gains for deep learning. Conversely, scientific computing, exemplified by quantum chemistry systems, suffers from dynamic ... |
| Thallus: An RDMA-based Columnar Data Transport Protocol | Jayjeet Chakraborty, Matthieu Dorier, Philip Carns, Robert Ross, Carlos Maltzahn, Heiner Litz | 2024-12-03 | 下载 | The volume of data generated and stored in contemporary global data centers is experiencing exponential growth. This rapid data growth necessitates efficient processing and analysis to extract valuabl... |
| Towards the efficacy of federated prediction for epidemics on networks | Chengpeng Fu, Tong Li, Hao Chen, Wen Du, Zhidong He | 2024-12-03 | 下载 | Epidemic prediction is of practical significance in public health, enabling early intervention, resource allocation, and strategic planning. However, privacy concerns often hinder the sharing of healt... |
| Multi-Bin Batching for Increasing LLM Inference Throughput | Ozgur Guldogan, Jackson Kunde, Kangwook Lee, Ramtin Pedarsani | 2024-12-03 | 下载 | As large language models (LLMs) grow in popularity for their diverse capabilities, improving the efficiency of their inference systems has become increasingly critical. |
| Simplifying HPC resource selection: A tool for optimizing execution time and cost on Azure | Marco A. S. Netto, Wolfgang De Savador, Davide Vanzo | 2024-12-03 | 下载 | Azure Cloud offers a wide range of resources for running HPC workloads, requiring users to configure their deployment by selecting VM types, number of VMs, and processes per VM. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| The GAIUS Experience: Powering a Hyperlocal Mobile Web for Communities in Emerging Regions | Rohail Asim, Arjuna Sathiaseelan, Arko Chatterjee, Mukund Lal, Yasir Zaki, Lakshmi Subramanian | 2024-12-03 | 下载 | Despite increasing mobile Internet penetration in developing regions, mobile users continue to experience a poor web experience due to two key factors: (i) lack of locally relevant content; (ii) poor ... |
| Revolutionizing QoE-Driven Network Management with Digital Agents in 6G | Xuemin Shen, Xinyu Huang, Jianzhe Xue, Conghao Zhou, Xiufang Shi, Weihua Zhuang | 2024-12-03 | 下载 | In this article, we present a digital agent (DA)-assisted network management framework for future sixth generation (6G) networks considering user quality of experience (QoE). |
| Wall-Proximity Matters: Understanding the Effect of Device Placement with Respect to the Wall for Indoor Wi-Fi Sensing | He Wang, Yunpeng Ge, Ivan Wang-Hei Ho | 2024-12-03 | 下载 | Wi-Fi sensing has been extensively explored for various applications, including vital sign monitoring, human activity recognition, indoor localization, and tracking. |
| Hamiltonian Monte Carlo-Based Near-Optimal MIMO Signal Detection | Junichiro Hagiwara, Toshihiko Nishimura, Takanori Sato, Yasutaka Ogawa, Takeo Ohgane | 2024-12-03 | 下载 | Multiple-input multiple-output (MIMO) technology is essential for the optimal functioning of next-generation wireless networks; however, enhancing its signal-detection performance for improved spectra... |
| Exploring Evolutionary Spectral Clustering for Temporal-Smoothed Clustered Cell-Free Networking | Junyuan Wang, Tianyao Wu, Ouyang Zhou, Yaping Zhu | 2024-12-03 | 下载 | Clustered cell-free networking, which dynamically partitions the whole network into nonoverlapping subnetworks, has been recently proposed to mitigate the cell-edge problem in cellular networks. |
| Optimizing Age of Information in Internet of Vehicles Over Error-Prone Channels | Cui Zhang, Maoxin Ji, Qiong Wu, Pingyi Fan, Qiang Fan | 2024-12-03 | 下载 | In the Internet of Vehicles (IoV), Age of Information (AoI) has become a vital performance metric for evaluating the freshness of information in communication systems. |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Thallus: An RDMA-based Columnar Data Transport Protocol | Jayjeet Chakraborty, Matthieu Dorier, Philip Carns, Robert Ross, Carlos Maltzahn, Heiner Litz | 2024-12-03 | 下载 | The volume of data generated and stored in contemporary global data centers is experiencing exponential growth. This rapid data growth necessitates efficient processing and analysis to extract valuabl... |
| Retrofitting XoM for Stripped Binaries without Embedded Data Relocation | Chenke Luo, Jiang Ming, Mengfei Xie, Guojun Peng, Jianming Fu | 2024-12-03 | 下载 | In this paper, we present PXoM, a practical technique to seamlessly retrofit XoM into stripped binaries on the x86-64 platform. As handling the mixture of code and data is a well-known challenge for X... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Massimult: A Novel Parallel CPU Architecture Based on Combinator Reduction | Jurgen Nicklisch-Franken, Ruslan Feizerakhmanov | 2024-12-03 | 下载 | The Massimult project aims to design and implement an innovative CPU architecture based on combinator reduction with a novel combinator base and a new abstract machine. |
| AI-Driven Resource Allocation Framework for Microservices in Hybrid Cloud Platforms | Biman Barua, M. Shamim Kaiser | 2024-12-03 | 下载 | The increasing demand for scalable, efficient resource management in hybrid cloud environments has led to the exploration of AI-driven approaches for dynamic resource allocation. |
| Performance Debugging through Microarchitectural Sensitivity and Causality Analysis | Alban Dutilleul, Hugo Pompougnac, Nicolas Derumigny, Gabriel Rodriguez, Valentin Trophime, Christophe Guillon, Fabrice Rastello | 2024-12-03 | 下载 | Modern Out-of-Order (OoO) CPUs are complex systems with many components interleaved in non-trivial ways. Pinpointing performance bottlenecks and understanding the underlying causes of program performa... |
| Matryoshka: Optimization of Dynamic Diverse Quantum Chemistry Systems via Elastic Parallelism Transformation | Tuowei Wang, Kun Li, Donglin Bai, Fusong Ju, Leo Xia, Ting Cao, Ju Ren, Yaoxue Zhang, Mao Yang | 2024-12-03 | 下载 | AI infrastructures, predominantly GPUs, have delivered remarkable performance gains for deep learning. Conversely, scientific computing, exemplified by quantum chemistry systems, suffers from dynamic ... |