Appearance
2024-06-07
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Residue Number System (RNS) based Distributed Quantum Addition | Bhaskar Gaur, Travis S. Humble, Himanshu Thapliyal | 2024-06-07 | 下载 | Quantum Arithmetic faces limitations such as noise and resource constraints in the current Noisy Intermediate Scale Quantum (NISQ) era quantum computers. |
| Look-Up Table based Neural Network Hardware | Ovishake Sen, Chukwufumnanya Ogbogu, Peyman Dehghanzadeh, Janardhan Rao Doppa, Swarup Bhunia, Partha Pratim Pande, Baibhab Chatterjee | 2024-06-07 | 下载 | Traditional digital implementations of neural accelerators are limited by high power and area overheads, while analog and non-CMOS implementations suffer from noise, device mismatch, and reliability i... |
| LLM-Enhanced Bayesian Optimization for Efficient Analog Layout Constraint Generation | Guojin Chen, Keren Zhu, Seunggeun Kim, Hanqing Zhu, Yao Lai, Bei Yu, David Z. Pan | 2024-06-07 | 下载 | Analog layout synthesis faces significant challenges due to its dependence on manual processes, considerable time requirements, and performance instability. |
| Mexican Computers: A Brief Technical and Historical Overview | Daniel Ortiz-Arroyo | 2024-06-07 | 下载 | The emergence of the microprocessor in the early 1970s allowed the design of computers that did not require the substantial economic resources of large computer companies of that era. |
| PolyLUT-Add: FPGA-based LUT Inference with Wide Inputs | Binglei Lou, Richard Rademacher, David Boland, Philip H. W. Leong | 2024-06-07 | 下载 | FPGAs have distinct advantages as a technology for deploying deep neural networks (DNNs) at the edge. Lookup Table (LUT) based networks, where neurons are directly modeled using LUTs, help maximize th... |
| A 2.5-nA Area-Efficient Temperature-Independent 176-/82-ppm/°C CMOS-Only Current Reference in 0.11-μm Bulk and 22-nm FD-SOI | Martin Lefebvre, David Bol | 2024-06-07 | 下载 | Internet-of-Things (IoT) applications require nW-power current references that are robust to process, voltage and temperature (PVT) variations, to maintain the performance of IoT sensor nodes in a wid... |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Residue Number System (RNS) based Distributed Quantum Addition | Bhaskar Gaur, Travis S. Humble, Himanshu Thapliyal | 2024-06-07 | 下载 | Quantum Arithmetic faces limitations such as noise and resource constraints in the current Noisy Intermediate Scale Quantum (NISQ) era quantum computers. |
| Federated LoRA with Sparse Communication | Kevin Kuo, Arian Raje, Kousik Rajesh, Virginia Smith | 2024-06-07 | 下载 | Low-rank adaptation (LoRA) is a natural method for finetuning in communication-constrained machine learning settings such as cross-device federated learning. |
| GCAPS: GPU Context-Aware Preemptive Priority-based Scheduling for Real-Time Tasks | Yidi Wang, Cong Liu, Daniel Wong, Hyoseung Kim | 2024-06-07 | 下载 | Scheduling real-time tasks that utilize GPUs with analyzable guarantees poses a significant challenge due to the intricate interaction between CPU and GPU resources, as well as the complex GPU hardwar... |
| FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models | Rui Ye, Rui Ge, Xinyu Zhu, Jingyi Chai, Yaxin Du, Yang Liu, Yanfeng Wang, Siheng Chen | 2024-06-07 | 下载 | Federated learning has enabled multiple parties to collaboratively train large language models without directly sharing their data (FedLLM). Following this training paradigm, the community has put mas... |
| Enabling Efficient Batch Serving for LMaaS via Generation Length Prediction | Ke Cheng, Wen Hu, Zhi Wang, Peng Du, Jianguo Li, Sheng Zhang | 2024-06-07 | 下载 | Nowadays, large language models (LLMs) are published as a service and can be accessed by various applications via APIs, also known as language-model-as-a-service (LMaaS). |
| Software Engineering for Collective Cyber-Physical Ecosystems | Roberto Casadei, Gianluca Aguzzi, Giorgio Audrito, Ferruccio Damiani, Danilo Pianini, Giordano Scarso, Gianluca Torta, Mirko Viroli | 2024-06-07 | 下载 | Today's distributed and pervasive computing addresses large-scale cyber-physical ecosystems, characterised by dense and large networks of devices capable of computation, communication and interaction ... |
| Approximated Coded Computing: Towards Fast, Private and Secure Distributed Machine Learning | Houming Qiu, Kun Zhu, Nguyen Cong Luong, Dusit Niyato | 2024-06-07 | 下载 | In a large-scale distributed machine learning system, coded computing has attracted wide-spread attention since it can effectively alleviate the impact of stragglers. |
| When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain | Lei Xu, Yulong Chen, Yuntian Chen, Longfeng Nie, Xuetao Wei, Liang Xue, Dongxiao Zhang | 2024-06-07 | 下载 | Machine learning models offer the capability to forecast future energy production or consumption and infer essential unknown variables from existing data. |
| Ensemble Method for System Failure Detection Using Large-Scale Telemetry Data | Priyanka Mudgal, Rita H. Wouhaybi | 2024-06-07 | 下载 | The growing reliance on computer systems, particularly personal computers (PCs), necessitates heightened reliability to uphold user satisfaction. |
| Enhancing Large-Scale AI Training Efficiency: The C4 Solution for Real-Time Anomaly Detection and Communication Optimization | Jianbo Dong, Bin Luo, Jun Zhang, Pengcheng Zhang, Fei Feng, Yikai Zhu, Ang Liu, Zian Chen, Yi Shi, Hairong Jiao, Gang Lu, Yu Guan, Ennan Zhai, Wencong Xiao, Hanyu Zhao, Man Yuan, Siran Yang, Xiang Li, Jiamang Wang, Rui Men, Jianwei Zhang, Chang Zhou, Dennis Cai, Yuan Xie, Binzhang Fu | 2024-06-07 | 下载 | The emergence of Large Language Models (LLMs) has necessitated the adoption of distributed training techniques, involving the deployment of thousands of GPUs to train a single model. |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Individual Packet Features are a Risk to Model Generalisation in ML-Based Intrusion Detection | Kahraman Kostas, Mike Just, Michael A. Lones | 2024-06-07 | 下载 | Machine learning is increasingly used for intrusion detection in IoT networks. This paper explores the effectiveness of using individual packet features (IPF), which are attributes extracted from a si... |
| Online Frequency Scheduling by Learning Parallel Actions | Anastasios Giovanidis, Mathieu Leconte, Sabrine Aroua, Tor Kvernvik, David Sandberg | 2024-06-07 | 下载 | Radio Resource Management is a challenging topic in future 6G networks where novel applications create strong competition among the users for the available resources. |
| Mobile Network Configuration Recommendation using Deep Generative Graph Neural Network | Shirwan Piroti, Ashima Chawla, Tahar Zanouda | 2024-06-07 | 下载 | There are vast number of configurable parameters in a Radio Access Telecom Network. A significant amount of these parameters is configured by Radio Node or cell based on their deployment setting. |
| Statistical QoS Provisioning Architecture for 6G Satellite-Terrestrial Integrated Networks | Jingqing Wang, Wenchi Cheng, Wei Zhang, Hui Liang | 2024-06-07 | 下载 | The emergence of massive ultra-reliable and low latency communications (mURLLC) as a category of time/reliability-sensitive service over 6G networks has received considerable research attention, which... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| LLM-Vectorizer: LLM-based Verified Loop Vectorizer | Jubi Taneja, Avery Laird, Cong Yan, Madan Musuvathi, Shuvendu K. Lahiri | 2024-06-07 | 下载 | Vectorization is a powerful optimization technique that significantly boosts the performance of high performance computing applications operating on large data arrays. |