Appearance
2024-09-12
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| LlamaF: An Efficient Llama2 Architecture Accelerator on Embedded FPGAs | Han Xu, Yutong Li, Shihao Ji | 2024-09-12 | 下载 | Large language models (LLMs) have demonstrated remarkable abilities in natural language processing. However, their deployment on resource-constrained embedded devices remains difficult due to memory a... |
| Photonic Quantum Computers | M. AbuGhanem | 2024-09-12 | 下载 | In the pursuit of scalable and fault-tolerant quantum computing architectures, photonic-based quantum computers have emerged as a leading frontier. |
| Rethinking Programmed I/O for Fast Devices, Cheap Cores, and Coherent Interconnects | Anastasiia Ruzhanskaia, Pengcheng Xu, David Cock, Timothy Roscoe | 2024-09-12 | 下载 | Conventional wisdom holds that an efficient interface between an OS running on a CPU and a high-bandwidth I/O device should use Direct Memory Access (DMA) to offload data transfer, descriptor rings fo... |
| Dynamic Simultaneous Multithreaded Architecture | Daniel Ortiz-Arroyo, Ben Lee | 2024-09-12 | 下载 | This paper presents the Dynamic Simultaneous Multi-threaded Architecture (DSMT). DSMT efficiently exe-cutes multiple threads from a single program on a SMT processor core. |
| C3-VQA: Cryogenic Counter-based Co-processor for Variational Quantum Algorithms | Yosuke Ueno, Satoshi Imamura, Yuna Tomida, Teruo Tanimoto, Masamitsu Tanaka, Yutaka Tabuchi, Koji Inoue, Hiroshi Nakamura | 2024-09-12 | 下载 | Cryogenic quantum computers play a leading role in demonstrating quantum advantage. Given the severe constraints on the cooling capacity in cryogenic environments, thermal design is crucial for the sc... |
| Efficient and Reliable Vector Similarity Search Using Asymmetric Encoding with NAND-Flash for Many-Class Few-Shot Learning | Hao-Wei Chiang, Chi-Tse Huang, Hsiang-Yun Cheng, Po-Hao Tseng, Ming-Hsiu Lee, An-Yeu, Wu | 2024-09-12 | 下载 | While memory-augmented neural networks (MANNs) offer an effective solution for few-shot learning (FSL) by integrating deep neural networks with external memory, the capacity requirements and energy ov... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Self-Supervised Inference of Agents in Trustless Environments | Vladyslav Larin, Ivan Nikitin, Alexander Firsov | 2024-09-12 | 下载 | In this paper, we propose a novel approach where agents can form swarms to produce high-quality responses effectively. This is accomplished by utilizing agents capable of data inference and ranking, w... |
| E-QUARTIC: Energy Efficient Edge Ensemble of Convolutional Neural Networks for Resource-Optimized Learning | Le Zhang, Onat Gungor, Flavio Ponzina, Tajana Rosing | 2024-09-12 | 下载 | Ensemble learning is a meta-learning approach that combines the predictions of multiple learners, demonstrating improved accuracy and robustness. |
| Validated Strong Consensus Protocol for Asynchronous Vote-based Blockchains | Yibin Xu, Jianhua Shao, Tijs Slaats, Boris Düdder, Yongluan Zhou | 2024-09-12 | 下载 | Vote-based blockchains construct a state machine replication (SMR) system among participating nodes, using Byzantine Fault Tolerance (BFT) consensus protocols to transition from one state to another. |
| Microarchitectural comparison and in-core modeling of state-of-the-art CPUs: Grace, Sapphire Rapids, and Genoa | Jan Laukemann, Georg Hager, Gerhard Wellein | 2024-09-12 | 下载 | With Nvidia's release of the Grace Superchip, all three big semiconductor companies in HPC (AMD, Intel, Nvidia) are currently competing in the race for the best CPU. |
| Instance Configuration for Sustainable Job Shop Scheduling | Christian Perez, Carlos March, Miguel A. Salido | 2024-09-12 | 下载 | The Job Shop Scheduling Problem (JSP) is a pivotal challenge in operations research and is essential for evaluating the effectiveness and performance of scheduling algorithms. |
| Dynamic Simultaneous Multithreaded Architecture | Daniel Ortiz-Arroyo, Ben Lee | 2024-09-12 | 下载 | This paper presents the Dynamic Simultaneous Multi-threaded Architecture (DSMT). DSMT efficiently exe-cutes multiple threads from a single program on a SMT processor core. |
| DiReDi: Distillation and Reverse Distillation for AIoT Applications | Chen Sun, Qing Tong, Wenshuang Yang, Wenqi Zhang | 2024-09-12 | 下载 | Typically, the significant efficiency can be achieved by deploying different edge AI models in various real world scenarios while a few large models manage those edge AI models remotely from cloud ser... |
| DFDG: Data-Free Dual-Generator Adversarial Distillation for One-Shot Federated Learning | Kangyang Luo, Shuai Wang, Yexuan Fu, Renrong Shao, Xiang Li, Yunshi Lan, Ming Gao, Jinlong Shu | 2024-09-12 | 下载 | Federated Learning (FL) is a distributed machine learning scheme in which clients jointly participate in the collaborative training of a global model by sharing model information rather than their pri... |
| Cooperative Inference with Interleaved Operator Partitioning for CNNs | Zhibang Liu, Chaonong Xu, Zhizhuo Liu, Lekai Huang, Jiachen Wei, Chao Li | 2024-09-12 | 下载 | Deploying deep learning models on Internet of Things (IoT) devices often faces challenges due to limited memory resources and computing capabilities. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Towards Scalable Quantum Networks | Connor Howe, Mohsin Aziz, Ali Anwar | 2024-09-12 | 下载 | This paper presents a comprehensive study on the scalability challenges and opportunities in quantum communication networks, with the goal of determining parameters that impact networks most as well a... |
| Multi-Model based Federated Learning Against Model Poisoning Attack: A Deep Learning Based Model Selection for MEC Systems | Somayeh Kianpisheh, Chafika Benzaid, Tarik Taleb | 2024-09-12 | 下载 | Federated Learning (FL) enables training of a global model from distributed data, while preserving data privacy. However, the singular-model based operation of FL is open with uploading poisoned model... |
| LLM Honeypot: Leveraging Large Language Models as Advanced Interactive Honeypot Systems | Hakan T. Otal, M. Abdullah Canbaz | 2024-09-12 | 下载 | The rapid evolution of cyber threats necessitates innovative solutions for detecting and analyzing malicious activity. Honeypots, which are decoy systems designed to lure and interact with attackers, ... |
| Anonymized Network Sensing Graph Challenge | Hayden Jananthan, Michael Jones, William Arcand, David Bestor, William Bergeron, Daniel Burrill, Aydin Buluc, Chansup Byun, Timothy Davis, Vijay Gadepally, Daniel Grant, Michael Houle, Matthew Hubbell, Piotr Luszczek, Peter Michaleas, Lauren Milechin, Chasen Milner, Guillermo Morales, Andrew Morris, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Gabriel Wachman, Charles Yee, Jeremy Kepner | 2024-09-12 | 下载 | The MIT/IEEE/Amazon GraphChallenge encourages community approaches to developing new solutions for analyzing graphs and sparse data derived from social media, sensor feeds, and scientific data to disc... |
| Towards a graph-based foundation model for network traffic analysis | Louis Van Langendonck, Ismael Castell-Uroz, Pere Barlet-Ros | 2024-09-12 | 下载 | Foundation models have shown great promise in various fields of study. A potential application of such models is in computer network traffic analysis, where these models can grasp the complexities of ... |
| External Memories of PDP Switches for In-Network Implementable Functions Placement: Deep Learning Based Reconfiguration of SFCs | Somayeh Kianpisheh, Tarik Taleb | 2024-09-12 | 下载 | Network function virtualization leverages programmable data plane switches to deploy in-network implementable functions, to improve QoS. The memories of switches can be extended through remote direct ... |
| Directional WPT Charging for Routing-Asymmetric WRSNs with a Mobile Charger | Zhenguo Gao, Qi Zhang, Qingyu Gao, Yunlong Zhao, Hsiao-Chun Wu | 2024-09-12 | 下载 | Mobile Charge Scheduling for wirelessly charging nodes in Wireless Rechargeable Sensor Networks (WRSNs) is a promising but still evolving research area. |
| WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks | Jingwen Tong, Jiawei Shao, Qiong Wu, Wei Guo, Zijian Li, Zehong Lin, Jun Zhang | 2024-09-12 | 下载 | Wireless networks are increasingly facing challenges due to their expanding scale and complexity. These challenges underscore the need for advanced AI-driven strategies, particularly in the upcoming 6... |
cs.OS - Operating Systems
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Rethinking Programmed I/O for Fast Devices, Cheap Cores, and Coherent Interconnects | Anastasiia Ruzhanskaia, Pengcheng Xu, David Cock, Timothy Roscoe | 2024-09-12 | 下载 | Conventional wisdom holds that an efficient interface between an OS running on a CPU and a high-bandwidth I/O device should use Direct Memory Access (DMA) to offload data transfer, descriptor rings fo... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| E-QUARTIC: Energy Efficient Edge Ensemble of Convolutional Neural Networks for Resource-Optimized Learning | Le Zhang, Onat Gungor, Flavio Ponzina, Tajana Rosing | 2024-09-12 | 下载 | Ensemble learning is a meta-learning approach that combines the predictions of multiple learners, demonstrating improved accuracy and robustness. |
| Anonymized Network Sensing Graph Challenge | Hayden Jananthan, Michael Jones, William Arcand, David Bestor, William Bergeron, Daniel Burrill, Aydin Buluc, Chansup Byun, Timothy Davis, Vijay Gadepally, Daniel Grant, Michael Houle, Matthew Hubbell, Piotr Luszczek, Peter Michaleas, Lauren Milechin, Chasen Milner, Guillermo Morales, Andrew Morris, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Gabriel Wachman, Charles Yee, Jeremy Kepner | 2024-09-12 | 下载 | The MIT/IEEE/Amazon GraphChallenge encourages community approaches to developing new solutions for analyzing graphs and sparse data derived from social media, sensor feeds, and scientific data to disc... |
| Microarchitectural comparison and in-core modeling of state-of-the-art CPUs: Grace, Sapphire Rapids, and Genoa | Jan Laukemann, Georg Hager, Gerhard Wellein | 2024-09-12 | 下载 | With Nvidia's release of the Grace Superchip, all three big semiconductor companies in HPC (AMD, Intel, Nvidia) are currently competing in the race for the best CPU. |
| Computational Algorithms for the Product Form Solution of Closed Queuing Networks with Finite Buffers and Skip-Over Policy | Gianfranco Balbo, Andrea Marin, Diletta Olliaro, Matteo Sereno | 2024-09-12 | 下载 | Closed queuing networks with finite capacity buffers and skip-over policies are fundamental models in the performance evaluation of computer and communication systems. |
| Repr Types: One Abstraction to Rule Them All | Viktor Palmkvist, Anders Ågren Thuné, Elias Castegren, David Broman | 2024-09-12 | 下载 | The choice of how to represent an abstract type can have a major impact on the performance of a program, yet mainstream compilers cannot perform optimizations at such a high level. |