Skip to content

2024-12-19

cs.AR - Architecture

标题作者发布日期PDF摘要
Relaxed exception semantics for Arm-A (extended version)Ben Simner, Alasdair Armstrong, Thomas Bauereiss, Brian Campbell, Ohad Kammar, Jean Pichon-Pharabod, and Peter Sewell2024-12-19下载To manage exceptions, software relies on a key architectural guarantee, precision: that exceptions appear to execute between instructions. However, this definition, dating back over 60 years, fundamen...
Event-based backpropagation on the neuromorphic platform SpiNNaker2Gabriel Béna, Timo Wunderlich, Mahmoud Akl, Bernhard Vogginger, Christian Mayr, Hector Andres Gonzalez2024-12-19下载Neuromorphic computing aims to replicate the brain's capabilities for energy efficient and parallel information processing, promising a solution to the increasing demand for faster and more efficient ...
GFormer: Accelerating Large Language Models with Optimized Transformers on Gaudi ProcessorsChengming Zhang, Xinheng Ding, Baixi Sun, Xiaodong Yu, Weijian Zheng, Zhen Xie, Dingwen Tao2024-12-19下载Heterogeneous hardware like Gaudi processor has been developed to enhance computations, especially matrix operations for Transformer-based large language models (LLMs) for generative AI tasks.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
Sparse Checkpointing for Fast and Reliable MoE TrainingSwapnil Gandhi, Christos Kozyrakis2024-12-19下载As large language models scale, training them requires thousands of GPUs over extended durations--making frequent failures an inevitable reality.
Joint Task Offloading and Routing in Wireless Multi-hop Networks Using Biased Backpressure AlgorithmZhongyuan Zhao, Jake Perazzone, Gunjan Verma, Kevin Chan, Ananthram Swami, Santiago Segarra2024-12-19下载A significant challenge for computation offloading in wireless multi-hop networks is the complex interaction among traffic flows in the presence of interference.
HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel LanguagesAman Chaturvedi, Daniel Nichols, Siddharth Singh, Abhinav Bhatele2024-12-19下载Large Language Model (LLM) based coding tools have been tremendously successful as software development assistants, yet they are often designed for general purpose programming tasks and perform poorly...
Minimizing speculation overhead in a parallel recognizer for regular textsAngelo Borsotti, Luca Breveglieri, Stefano Crespi Reghizzi, Angelo Morzenti2024-12-19下载Speculative data-parallel algorithms for language recognition have been widely experimented for various types of finite-state automata (FA), deterministic (DFA) and nondeterministic (NFA), often deriv...
TinyLLM: A Framework for Training and Deploying Language Models at the Edge ComputersSavitha Viswanadh Kandala, Pramuka Medaranga, Ambuj Varshney2024-12-19下载Language models have gained significant interest due to their general-purpose capabilities, which appear to emerge as models are scaled to increasingly larger parameter sizes.
A Comprehensive Forecasting Framework based on Multi-Stage Hierarchical Forecasting Reconciliation and AdjustmentZhengchao Yang, Mithun Ghosh, Anish Saha, Dong Xu, Konstantin Shmakov, Kuang-chih Lee2024-12-19下载Ads demand forecasting for Walmart's ad products plays a critical role in enabling effective resource planning, allocation, and management of ads performance.
Creation of AI-driven Smart Spaces for Enhanced Indoor Environments -- A SurveyAygün Varol, Naser Hossein Motlagh, Mirka Leino, Sasu Tarkoma, Johanna Virkki2024-12-19下载Smart spaces are ubiquitous computing environments that integrate diverse sensing and communication technologies to enhance space functionality, optimize energy utilization, and improve user comfort a...
Taming the Memory Beast: Strategies for Reliable ML Training on KubernetesJaideep Ray2024-12-19下载Kubernetes offers a powerful orchestration platform for machine learning training, but memory management can be challenging due to specialized needs and resource constraints.
AIArena: A Blockchain-Based Decentralized AI Training PlatformZhipeng Wang, Rui Sun, Elizabeth Lui, Tuo Zhou, Yizhe Wen, Jiahao Sun2024-12-19下载The rapid advancement of AI has underscored critical challenges in its development and implementation, largely due to centralized control by a few major corporations.
Single-Loop Federated Actor-Critic across Heterogeneous EnvironmentsYe Zhu, Xiaowen Gong2024-12-19下载Federated reinforcement learning (FRL) has emerged as a promising paradigm, enabling multiple agents to collaborate and learn a shared policy adaptable across heterogeneous environments.
Frenzy: A Memory-Aware Serverless LLM Training System for Heterogeneous GPU ClustersZihan Chang, Sheng Xiao, Shuibing He, Siling Yang, Zhe Pan, Dong Li2024-12-19下载Existing work only effective on a given number of GPUs, often neglecting the complexities involved in manually determining the specific types and quantities of GPUs needed, which can be a significant ...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
A Unified Framework for Context-Aware IoT Management and State-of-the-Art IoT Traffic Anomaly DetectionDaniel Adu Worae, Athar Sheikh, Spyridon Mastorakis2024-12-19下载The rapid expansion of Internet of Things (IoT) ecosystems has introduced growing complexities in device management and network security. To address these challenges, we present a unified framework th...
Joint Task Offloading and Routing in Wireless Multi-hop Networks Using Biased Backpressure AlgorithmZhongyuan Zhao, Jake Perazzone, Gunjan Verma, Kevin Chan, Ananthram Swami, Santiago Segarra2024-12-19下载A significant challenge for computation offloading in wireless multi-hop networks is the complex interaction among traffic flows in the presence of interference.
Cruise Control: Dynamic Model Selection for ML-Based Network Traffic AnalysisJohann Hugon, Paul Schmitt, Anthony Busson, Francesco Bronzino2024-12-19下载Modern networks increasingly rely on machine learning models for real-time insights, including traffic classification, application quality of experience inference, and intrusion detection.
6GENABLERS-DLT: DLT-based Marketplace for Decentralized Trading of 6G Telco resourcesAdriana Fernández-Fernández, Angel Martin, Guillermo Gomez2024-12-19下载The 6GENABLERS-DLT project addresses critical challenges in fostering multi-party collaboration within dynamic 6G environments. As operators and service providers increasingly depend on third-party re...
TinyLLM: A Framework for Training and Deploying Language Models at the Edge ComputersSavitha Viswanadh Kandala, Pramuka Medaranga, Ambuj Varshney2024-12-19下载Language models have gained significant interest due to their general-purpose capabilities, which appear to emerge as models are scaled to increasingly larger parameter sizes.
Space-time Peer-to-Peer Distribution of Multi-party Entanglement for Any Quantum NetworkYuexun Huang, Xiangyu Ren, Bikun Li, Yat Wong, Zhiding Liang, Liang Jiang2024-12-19下载Graph states are a class of important multiparty entangled states, of which bell pairs are the special case. Realizing a robust and fast distribution of arbitrary graph states in the downstream layer ...
LoLaFL: Low-Latency Federated Learning via Forward-only PropagationJierui Zhang, Jianhao Huang, Kaibin Huang2024-12-19下载Federated learning (FL) has emerged as a widely adopted paradigm for enabling edge learning with distributed data while ensuring data privacy.
Robustness Evaluation of a Physical Internet-based Intermodal Logistic NetworkFederico Gallo, Alireza Shahedi, Angela Di Febbraro, Mahnam Saeednia, Nicola Sacco2024-12-19下载The Physical Internet (PI) paradigm, which has gained attention in research and academia in recent years, leverages advanced logistics and interconnected networks to revolutionize the way goods are tr...
Overview of AI and Communication for 6G Network: Fundamentals, Challenges, and Future Research OpportunitiesQimei Cui, Xiaohu You, Ni Wei, Guoshun Nan, Xuefei Zhang, Jianhua Zhang, Xinchen Lyu, Ming Ai, Xiaofeng Tao, Zhiyong Feng, Ping Zhang, Qingqing Wu, Meixia Tao, Yongming Huang, Chongwen Huang, Guangyi Liu, Chenghui Peng, Zhiwen Pan, Tao Sun, Dusit Niyato, Tao Chen, Muhammad Khurram Khan, Abbas Jamalipour, Mohsen Guizani, Chau Yuen2024-12-19下载With the growing demand for seamless connectivity and intelligent communication, the integration of artificial intelligence (AI) and sixth-generation (6G) communication networks has emerged as a trans...
WiFi CSI Based Temporal Activity Detection via Dual Pyramid NetworkZhendong Liu, Le Zhang, Bing Li, Yingjie Zhou, Zhenghua Chen, Ce Zhu2024-12-19下载We address the challenge of WiFi-based temporal activity detection and propose an efficient Dual Pyramid Network that integrates Temporal Signal Semantic Encoders and Local Sensitive Response Encoders...

cs.PF - Performance

标题作者发布日期PDF摘要
MMLU-CF: A Contamination-free Multi-task Language Understanding BenchmarkQihao Zhao, Yangyu Huang, Tengchao Lv, Lei Cui, Qinzheng Sun, Shaoguang Mao, Xin Zhang, Ying Xin, Qiufeng Yin, Scarlett Li, Furu Wei2024-12-19下载Multiple-choice question (MCQ) datasets like Massive Multitask Language Understanding (MMLU) are widely used to evaluate the commonsense, understanding, and problem-solving abilities of large language...

基于 VitePress 构建