Skip to content

2024-06-07

cs.AR - Architecture

标题作者发布日期PDF摘要
Residue Number System (RNS) based Distributed Quantum AdditionBhaskar Gaur, Travis S. Humble, Himanshu Thapliyal2024-06-07下载Quantum Arithmetic faces limitations such as noise and resource constraints in the current Noisy Intermediate Scale Quantum (NISQ) era quantum computers.
Look-Up Table based Neural Network HardwareOvishake Sen, Chukwufumnanya Ogbogu, Peyman Dehghanzadeh, Janardhan Rao Doppa, Swarup Bhunia, Partha Pratim Pande, Baibhab Chatterjee2024-06-07下载Traditional digital implementations of neural accelerators are limited by high power and area overheads, while analog and non-CMOS implementations suffer from noise, device mismatch, and reliability i...
LLM-Enhanced Bayesian Optimization for Efficient Analog Layout Constraint GenerationGuojin Chen, Keren Zhu, Seunggeun Kim, Hanqing Zhu, Yao Lai, Bei Yu, David Z. Pan2024-06-07下载Analog layout synthesis faces significant challenges due to its dependence on manual processes, considerable time requirements, and performance instability.
Mexican Computers: A Brief Technical and Historical OverviewDaniel Ortiz-Arroyo2024-06-07下载The emergence of the microprocessor in the early 1970s allowed the design of computers that did not require the substantial economic resources of large computer companies of that era.
PolyLUT-Add: FPGA-based LUT Inference with Wide InputsBinglei Lou, Richard Rademacher, David Boland, Philip H. W. Leong2024-06-07下载FPGAs have distinct advantages as a technology for deploying deep neural networks (DNNs) at the edge. Lookup Table (LUT) based networks, where neurons are directly modeled using LUTs, help maximize th...
A 2.5-nA Area-Efficient Temperature-Independent 176-/82-ppm/°C CMOS-Only Current Reference in 0.11-μm Bulk and 22-nm FD-SOIMartin Lefebvre, David Bol2024-06-07下载Internet-of-Things (IoT) applications require nW-power current references that are robust to process, voltage and temperature (PVT) variations, to maintain the performance of IoT sensor nodes in a wid...

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Residue Number System (RNS) based Distributed Quantum AdditionBhaskar Gaur, Travis S. Humble, Himanshu Thapliyal2024-06-07下载Quantum Arithmetic faces limitations such as noise and resource constraints in the current Noisy Intermediate Scale Quantum (NISQ) era quantum computers.
Federated LoRA with Sparse CommunicationKevin Kuo, Arian Raje, Kousik Rajesh, Virginia Smith2024-06-07下载Low-rank adaptation (LoRA) is a natural method for finetuning in communication-constrained machine learning settings such as cross-device federated learning.
GCAPS: GPU Context-Aware Preemptive Priority-based Scheduling for Real-Time TasksYidi Wang, Cong Liu, Daniel Wong, Hyoseung Kim2024-06-07下载Scheduling real-time tasks that utilize GPUs with analyzable guarantees poses a significant challenge due to the intricate interaction between CPU and GPU resources, as well as the complex GPU hardwar...
FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language ModelsRui Ye, Rui Ge, Xinyu Zhu, Jingyi Chai, Yaxin Du, Yang Liu, Yanfeng Wang, Siheng Chen2024-06-07下载Federated learning has enabled multiple parties to collaboratively train large language models without directly sharing their data (FedLLM). Following this training paradigm, the community has put mas...
Enabling Efficient Batch Serving for LMaaS via Generation Length PredictionKe Cheng, Wen Hu, Zhi Wang, Peng Du, Jianguo Li, Sheng Zhang2024-06-07下载Nowadays, large language models (LLMs) are published as a service and can be accessed by various applications via APIs, also known as language-model-as-a-service (LMaaS).
Software Engineering for Collective Cyber-Physical EcosystemsRoberto Casadei, Gianluca Aguzzi, Giorgio Audrito, Ferruccio Damiani, Danilo Pianini, Giordano Scarso, Gianluca Torta, Mirko Viroli2024-06-07下载Today's distributed and pervasive computing addresses large-scale cyber-physical ecosystems, characterised by dense and large networks of devices capable of computation, communication and interaction ...
Approximated Coded Computing: Towards Fast, Private and Secure Distributed Machine LearningHouming Qiu, Kun Zhu, Nguyen Cong Luong, Dusit Niyato2024-06-07下载In a large-scale distributed machine learning system, coded computing has attracted wide-spread attention since it can effectively alleviate the impact of stragglers.
When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchainLei Xu, Yulong Chen, Yuntian Chen, Longfeng Nie, Xuetao Wei, Liang Xue, Dongxiao Zhang2024-06-07下载Machine learning models offer the capability to forecast future energy production or consumption and infer essential unknown variables from existing data.
Ensemble Method for System Failure Detection Using Large-Scale Telemetry DataPriyanka Mudgal, Rita H. Wouhaybi2024-06-07下载The growing reliance on computer systems, particularly personal computers (PCs), necessitates heightened reliability to uphold user satisfaction.
Enhancing Large-Scale AI Training Efficiency: The C4 Solution for Real-Time Anomaly Detection and Communication OptimizationJianbo Dong, Bin Luo, Jun Zhang, Pengcheng Zhang, Fei Feng, Yikai Zhu, Ang Liu, Zian Chen, Yi Shi, Hairong Jiao, Gang Lu, Yu Guan, Ennan Zhai, Wencong Xiao, Hanyu Zhao, Man Yuan, Siran Yang, Xiang Li, Jiamang Wang, Rui Men, Jianwei Zhang, Chang Zhou, Dennis Cai, Yuan Xie, Binzhang Fu2024-06-07下载The emergence of Large Language Models (LLMs) has necessitated the adoption of distributed training techniques, involving the deployment of thousands of GPUs to train a single model.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Individual Packet Features are a Risk to Model Generalisation in ML-Based Intrusion DetectionKahraman Kostas, Mike Just, Michael A. Lones2024-06-07下载Machine learning is increasingly used for intrusion detection in IoT networks. This paper explores the effectiveness of using individual packet features (IPF), which are attributes extracted from a si...
Online Frequency Scheduling by Learning Parallel ActionsAnastasios Giovanidis, Mathieu Leconte, Sabrine Aroua, Tor Kvernvik, David Sandberg2024-06-07下载Radio Resource Management is a challenging topic in future 6G networks where novel applications create strong competition among the users for the available resources.
Mobile Network Configuration Recommendation using Deep Generative Graph Neural NetworkShirwan Piroti, Ashima Chawla, Tahar Zanouda2024-06-07下载There are vast number of configurable parameters in a Radio Access Telecom Network. A significant amount of these parameters is configured by Radio Node or cell based on their deployment setting.
Statistical QoS Provisioning Architecture for 6G Satellite-Terrestrial Integrated NetworksJingqing Wang, Wenchi Cheng, Wei Zhang, Hui Liang2024-06-07下载The emergence of massive ultra-reliable and low latency communications (mURLLC) as a category of time/reliability-sensitive service over 6G networks has received considerable research attention, which...

cs.PF - Performance

标题作者发布日期PDF摘要
LLM-Vectorizer: LLM-based Verified Loop VectorizerJubi Taneja, Avery Laird, Cong Yan, Madan Musuvathi, Shuvendu K. Lahiri2024-06-07下载Vectorization is a powerful optimization technique that significantly boosts the performance of high performance computing applications operating on large data arrays.

基于 VitePress 构建