Skip to content

2024-07-02

cs.AR - Architecture

标题作者发布日期PDF摘要
RISC-V R-Extension: Advancing Efficiency with Rented-Pipeline for Edge DNN ProcessingWon Hyeok Kim, Hyeong Jin Kim, Tae Hee Han2024-07-02下载The proliferation of edge devices necessitates efficient computational architectures for lightweight tasks, particularly deep neural network (DNN) inference.
Fast, Scalable, Energy-Efficient Non-element-wise Matrix Multiplication on FPGAXuqi Zhu, Huaizhi Zhang, JunKyu Lee, Jiacheng Zhu, Chandrajit Pal, Sangeet Saha, Klaus D. McDonald-Maier, Xiaojun Zhai2024-07-02下载Modern Neural Network (NN) architectures heavily rely on vast numbers of multiply-accumulate arithmetic operations, constituting the predominant computational cost.
Roadmap to Neuromorphic Computing with Emerging TechnologiesAdnan Mehonic, Daniele Ielmini, Kaushik Roy, Onur Mutlu, Shahar Kvatinsky, Teresa Serrano-Gotarredona, Bernabe Linares-Barranco, Sabina Spiga, Sergey Savelev, Alexander G Balanov, Nitin Chawla, Giuseppe Desoli, Gerardo Malavena, Christian Monzio Compagnoni, Zhongrui Wang, J Joshua Yang, Ghazi Sarwat Syed, Abu Sebastian, Thomas Mikolajick, Beatriz Noheda, Stefan Slesazeck, Bernard Dieny, Tuo-Hung, Hou, Akhil Varri, Frank Bruckerhoff-Pluckelmann, Wolfram Pernice, Xixiang Zhang, Sebastian Pazos, Mario Lanza, Stefan Wiefels, Regina Dittmann, Wing H Ng, Mark Buckwell, Horatio RJ Cox, Daniel J Mannion, Anthony J Kenyon, Yingming Lu, Yuchao Yang, Damien Querlioz, Louis Hutin, Elisa Vianello, Sayeed Shafayet Chowdhury, Piergiulio Mannocci, Yimao Cai, Zhong Sun, Giacomo Pedretti, John Paul Strachan, Dmitri Strukov, Manuel Le Gallo, Stefano Ambrogio, Ilia Valov, Rainer Waser2024-07-02下载The roadmap is organized into several thematic sections, outlining current computing challenges, discussing the neuromorphic computing approach, analyzing mature and currently utilized technologies, p...
Theseus: Exploring Efficient Wafer-Scale Chip Design for Large Language ModelsJingchen Zhu, Chenhao Xue, Yiqi Chen, Zhao Wang, Chen Zhang, Yu Shen, Yifan Chen, Zekang Cheng, Yu Jiang, Tianqi Wang, Yibo Lin, Wei Hu, Bin Cui, Runsheng Wang, Yun Liang, Guangyu Sun2024-07-02下载The emergence of the large language model~(LLM) poses an exponential growth of demand for computation throughput, memory capacity, and communication bandwidth.
MG-Verilog: Multi-grained Dataset Towards Enhanced LLM-assisted Verilog GenerationYongan Zhang, Zhongzhi Yu, Yonggan Fu, Cheng Wan, Yingyan Celine Lin2024-07-02下载Large Language Models (LLMs) have recently shown promise in streamlining hardware design processes by encapsulating vast amounts of domain-specific data.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Accelerating Distributed Optimization: A Primal-Dual Perspective on Local StepsJunchi Yang, Murat Yildirim, Qiu Feng2024-07-02下载In distributed machine learning, efficient training across multiple agents with different data distributions poses significant challenges. Even with a centralized coordinator, current algorithms that ...
Towards Federated Learning with On-device Training and Communication in 8-bit Floating PointBokun Wang, Axel Berg, Durmus Alp Emre Acar, Chuteng Zhou2024-07-02下载Recent work has shown that 8-bit floating point (FP8) can be used for efficiently training neural networks with reduced computational cost compared to training in FP32/FP16.
Decentralized Intelligence Network (DIN)Abraham Nash2024-07-02下载Decentralized Intelligence Network (DIN) is a theoretical framework designed to address challenges in AI development, particularly focusing on data fragmentation and siloing issues.
Uncertainty-Aware Decarbonization for DatacentersAmy Li, Sihang Liu, Yi Ding2024-07-02下载This paper represents the first effort to quantify uncertainty in carbon intensity forecasting for datacenter decarbonization. We identify and analyze two types of uncertainty -- temporal and spatial ...
QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid DevicesJuntao Zhao, Borui Wan, Yanghua Peng, Haibin Lin, Yibo Zhu, Chuan Wu2024-07-02下载A number of production deep learning clusters have attempted to explore inference hardware for DNN training, at the off-peak serving hours with many inference GPUs idling.
MIREncoder: Multi-modal IR-based Pretrained Embeddings for Performance OptimizationsAkash Dutta, Ali Jannesari2024-07-02下载One of the primary areas of interest in High Performance Computing is the improvement of performance of parallel workloads. Nowadays, compilable source code-based optimization tasks that employ deep l...
RollupTheCrowd: Leveraging ZkRollups for a Scalable and Privacy-Preserving Reputation-based Crowdsourcing PlatformAhmed Mounsf Rafik Bendada, Mouhamed Amine Bouchiha, Mourad Rabah, Yacine Ghamri-Doudane2024-07-02下载Current blockchain-based reputation solutions for crowdsourcing fail to tackle the challenge of ensuring both efficiency and privacy without compromising the scalability of the blockchain.
Reusable Formal Verification of DAG-based Consensus ProtocolsNathalie Bertrand, Pranav Ghorpade, Sasha Rubin, Bernhard Scholz, Pavle Subotic2024-07-02下载Blockchains use consensus protocols to reach agreement, e.g., on the ordering of transactions. DAG-based consensus protocols are increasingly adopted by blockchain companies to reduce energy consumpti...
On the Performance and Memory Footprint of Distributed Training: An Empirical Study on TransformersZhengxian Lu, Fangyu Wang, Zhiwei Xu, Fei Yang, Tao Li2024-07-02下载Transformer models have emerged as potent solutions to a wide array of multidisciplinary challenges. The deployment of Transformer architectures is significantly hindered by their extensive computatio...
SwiftDiffusion: Efficient Diffusion Model Serving with Add-on ModulesSuyi Li, Lingyun Yang, Xiaoxiao Jiang, Hanfeng Lu, Dakai An, Zhipeng Di, Weiyi Lu, Jiawei Chen, Kan Liu, Yinghao Yu, Tao Lan, Guodong Yang, Lin Qu, Liping Zhang, Wei Wang2024-07-02下载Text-to-image (T2I) generation using diffusion models has become a blockbuster service in today's AI cloud. A production T2I service typically involves a serving workflow where a base diffusion model ...
Securing Distributed Network Digital Twin Systems Against Model Poisoning AttacksZifan Zhang, Minghong Fang, Mingzhe Chen, Gaolei Li, Xi Lin, Yuchen Liu2024-07-02下载In the era of 5G and beyond, the increasing complexity of wireless networks necessitates innovative frameworks for efficient management and deployment.

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
Navigating Connected Car Cybersecurity: Location Anomaly Detection with RAN DataFeng Wang, Yaron Koral, Kenichi Futamura2024-07-02下载The cybersecurity of connected cars, integral to the broader Internet of Things (IoT) landscape, has become of paramount concern. Cyber-attacks, including hijacking and spoofing, pose significant thre...
Impact of Network Deployment on the Performance of NCR-assisted NetworksGabriel C. M. da Silva, Diego A. Sousa, Victor F. Monteiro, Darlan C. Moreira, Tarcisio F. Maciel, Fco. Rafael M. Lima, Behrooz Makki2024-07-02下载To address the need of coverage enhancement in the fifth generation (5G) of wireless cellular telecommunications, while taking into account possible bottlenecks related to deploying fiber based backha...
Revolutionizing Networking Paradigms: A Comprehensive Exploration of Information-Centric Networking (ICN), Content-Centric Networking(CCNx) and Named Data Networking (NDN)Kamorudeen Amuda, Wakili Almustapha, Binkam Deepak, Ciana Hoggard, Pranay Tiruveedula2024-07-02下载The evolution of networking paradigms has led to the emergence of Information-Centric Networking (ICN), Content-centric networking (CCNx), and Named Data Networking (NDN).
Shared-Protected Backup Paths Assignment with Mode Group Division Multiplexing in Optical NetworksJiaheng Xiong, Qiaolun Zhang, Ruikun Wang, Alberto Gatto, Francesco Musumeci, Massimo Tornatore2024-07-02下载We evaluate the resource efficiency of Mode Group Division Multiplexing (MGDM) with shared path protection (SPP) in optical networks. On our case studies, SPP with MGDM obtains significant savings in ...
Performance Analysis and Comparison of Full-Fledged 5G Standalone Experimental TDD Testbeds in Single & Multi-UE ScenariosMaryam Amini, Catherine Rosenberg2024-07-02下载Open-source software and Commercial Off-The-Shelf hardware are finally paving their way into the 5G world, resulting in a proliferation of experimental 5G testbeds.
Strategic Demand-Planning in Wireless Networks: Can Generative-AI Save Spectrum and Energy?Berk Çiloğlu, Görkem Berkay Koç, Afsoon Alidadi Shamsabadi, Metin Ozturk, Halim Yanikomeroglu2024-07-02下载Generative-AI (GenAI), a novel technology capable of producing various types of outputs, including text, images, and videos, offers significant potential for wireless communications.
Do CAA, CT, and DANE Interlink in Certificate Deployments? A Web PKI Measurement StudyPouyan Fotouhi Tehrani, Raphael Hiesgen, Teresa Lübeck, Thomas C. Schmidt, Matthias Wählisch2024-07-02下载Integrity and trust on the web build on X.509 certificates. Misuse or misissuance of these certificates threaten the Web PKI security model, which led to the development of several guarding techniques...
Non-Terrestrial Networks for 6G: Integrated, Intelligent and Ubiquitous ConnectivityMuhammad Ali Jamshed, Aryan Kaushik, Miguel Dajer, Alessandro Guidotti, Fanny Parzysz, Eva Lagunas, Marco Di Renzo, Symeon Chatzinotas, Octavia A. Dobre2024-07-02下载Universal connectivity has been part of past and current generations of wireless systems, but as we approach 6G, the subject of social responsibility is being built as a core component.
Saving Private WAN: Using Internet Paths to Offload WAN Traffic in Conferencing ServicesBhaskar Kataria, Palak LNU, Rahul Bothra, Rohan Gandhi, Debopam Bhattacherjee, Venkata N. Padmanabhan, Irena Atov, Sriraam Ramakrishnan, Somesh Chaturmohta, Chakri Kotipalli, Rui Liang, Ken Sueda, Xin He, Kevin Hinton2024-07-02下载Large-scale video conferencing services incur significant network cost while serving surging global demands. Our work systematically explores the opportunity to offload a fraction of this traffic to t...
Securing Distributed Network Digital Twin Systems Against Model Poisoning AttacksZifan Zhang, Minghong Fang, Mingzhe Chen, Gaolei Li, Xi Lin, Yuchen Liu2024-07-02下载In the era of 5G and beyond, the increasing complexity of wireless networks necessitates innovative frameworks for efficient management and deployment.
Maximizing Uplink and Downlink Transmissions in Wirelessly Powered IoT NetworksXiaoyu Song, Kwan-Wu Chin2024-07-02下载This paper considers the problem of scheduling uplinks and downlinks transmissions in an Internet of Things (IoT) network that uses a mode-based time structure and Rate Splitting Multiple Access (RSMA...

cs.PF - Performance

标题作者发布日期PDF摘要
MIREncoder: Multi-modal IR-based Pretrained Embeddings for Performance OptimizationsAkash Dutta, Ali Jannesari2024-07-02下载One of the primary areas of interest in High Performance Computing is the improvement of performance of parallel workloads. Nowadays, compilable source code-based optimization tasks that employ deep l...

基于 VitePress 构建