Appearance
2024-12-19
cs.AR - Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Relaxed exception semantics for Arm-A (extended version) | Ben Simner, Alasdair Armstrong, Thomas Bauereiss, Brian Campbell, Ohad Kammar, Jean Pichon-Pharabod, and Peter Sewell | 2024-12-19 | 下载 | To manage exceptions, software relies on a key architectural guarantee, precision: that exceptions appear to execute between instructions. However, this definition, dating back over 60 years, fundamen... |
| Event-based backpropagation on the neuromorphic platform SpiNNaker2 | Gabriel Béna, Timo Wunderlich, Mahmoud Akl, Bernhard Vogginger, Christian Mayr, Hector Andres Gonzalez | 2024-12-19 | 下载 | Neuromorphic computing aims to replicate the brain's capabilities for energy efficient and parallel information processing, promising a solution to the increasing demand for faster and more efficient ... |
| GFormer: Accelerating Large Language Models with Optimized Transformers on Gaudi Processors | Chengming Zhang, Xinheng Ding, Baixi Sun, Xiaodong Yu, Weijian Zheng, Zhen Xie, Dingwen Tao | 2024-12-19 | 下载 | Heterogeneous hardware like Gaudi processor has been developed to enhance computations, especially matrix operations for Transformer-based large language models (LLMs) for generative AI tasks. |
cs.DC - Distributed, Parallel, and Cluster Computing
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| Sparse Checkpointing for Fast and Reliable MoE Training | Swapnil Gandhi, Christos Kozyrakis | 2024-12-19 | 下载 | As large language models scale, training them requires thousands of GPUs over extended durations--making frequent failures an inevitable reality. |
| Joint Task Offloading and Routing in Wireless Multi-hop Networks Using Biased Backpressure Algorithm | Zhongyuan Zhao, Jake Perazzone, Gunjan Verma, Kevin Chan, Ananthram Swami, Santiago Segarra | 2024-12-19 | 下载 | A significant challenge for computation offloading in wireless multi-hop networks is the complex interaction among traffic flows in the presence of interference. |
| HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages | Aman Chaturvedi, Daniel Nichols, Siddharth Singh, Abhinav Bhatele | 2024-12-19 | 下载 | Large Language Model (LLM) based coding tools have been tremendously successful as software development assistants, yet they are often designed for general purpose programming tasks and perform poorly... |
| Minimizing speculation overhead in a parallel recognizer for regular texts | Angelo Borsotti, Luca Breveglieri, Stefano Crespi Reghizzi, Angelo Morzenti | 2024-12-19 | 下载 | Speculative data-parallel algorithms for language recognition have been widely experimented for various types of finite-state automata (FA), deterministic (DFA) and nondeterministic (NFA), often deriv... |
| TinyLLM: A Framework for Training and Deploying Language Models at the Edge Computers | Savitha Viswanadh Kandala, Pramuka Medaranga, Ambuj Varshney | 2024-12-19 | 下载 | Language models have gained significant interest due to their general-purpose capabilities, which appear to emerge as models are scaled to increasingly larger parameter sizes. |
| A Comprehensive Forecasting Framework based on Multi-Stage Hierarchical Forecasting Reconciliation and Adjustment | Zhengchao Yang, Mithun Ghosh, Anish Saha, Dong Xu, Konstantin Shmakov, Kuang-chih Lee | 2024-12-19 | 下载 | Ads demand forecasting for Walmart's ad products plays a critical role in enabling effective resource planning, allocation, and management of ads performance. |
| Creation of AI-driven Smart Spaces for Enhanced Indoor Environments -- A Survey | Aygün Varol, Naser Hossein Motlagh, Mirka Leino, Sasu Tarkoma, Johanna Virkki | 2024-12-19 | 下载 | Smart spaces are ubiquitous computing environments that integrate diverse sensing and communication technologies to enhance space functionality, optimize energy utilization, and improve user comfort a... |
| Taming the Memory Beast: Strategies for Reliable ML Training on Kubernetes | Jaideep Ray | 2024-12-19 | 下载 | Kubernetes offers a powerful orchestration platform for machine learning training, but memory management can be challenging due to specialized needs and resource constraints. |
| AIArena: A Blockchain-Based Decentralized AI Training Platform | Zhipeng Wang, Rui Sun, Elizabeth Lui, Tuo Zhou, Yizhe Wen, Jiahao Sun | 2024-12-19 | 下载 | The rapid advancement of AI has underscored critical challenges in its development and implementation, largely due to centralized control by a few major corporations. |
| Single-Loop Federated Actor-Critic across Heterogeneous Environments | Ye Zhu, Xiaowen Gong | 2024-12-19 | 下载 | Federated reinforcement learning (FRL) has emerged as a promising paradigm, enabling multiple agents to collaborate and learn a shared policy adaptable across heterogeneous environments. |
| Frenzy: A Memory-Aware Serverless LLM Training System for Heterogeneous GPU Clusters | Zihan Chang, Sheng Xiao, Shuibing He, Siling Yang, Zhe Pan, Dong Li | 2024-12-19 | 下载 | Existing work only effective on a given number of GPUs, often neglecting the complexities involved in manually determining the specific types and quantities of GPUs needed, which can be a significant ... |
cs.NI - Networking and Internet Architecture
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| A Unified Framework for Context-Aware IoT Management and State-of-the-Art IoT Traffic Anomaly Detection | Daniel Adu Worae, Athar Sheikh, Spyridon Mastorakis | 2024-12-19 | 下载 | The rapid expansion of Internet of Things (IoT) ecosystems has introduced growing complexities in device management and network security. To address these challenges, we present a unified framework th... |
| Joint Task Offloading and Routing in Wireless Multi-hop Networks Using Biased Backpressure Algorithm | Zhongyuan Zhao, Jake Perazzone, Gunjan Verma, Kevin Chan, Ananthram Swami, Santiago Segarra | 2024-12-19 | 下载 | A significant challenge for computation offloading in wireless multi-hop networks is the complex interaction among traffic flows in the presence of interference. |
| Cruise Control: Dynamic Model Selection for ML-Based Network Traffic Analysis | Johann Hugon, Paul Schmitt, Anthony Busson, Francesco Bronzino | 2024-12-19 | 下载 | Modern networks increasingly rely on machine learning models for real-time insights, including traffic classification, application quality of experience inference, and intrusion detection. |
| 6GENABLERS-DLT: DLT-based Marketplace for Decentralized Trading of 6G Telco resources | Adriana Fernández-Fernández, Angel Martin, Guillermo Gomez | 2024-12-19 | 下载 | The 6GENABLERS-DLT project addresses critical challenges in fostering multi-party collaboration within dynamic 6G environments. As operators and service providers increasingly depend on third-party re... |
| TinyLLM: A Framework for Training and Deploying Language Models at the Edge Computers | Savitha Viswanadh Kandala, Pramuka Medaranga, Ambuj Varshney | 2024-12-19 | 下载 | Language models have gained significant interest due to their general-purpose capabilities, which appear to emerge as models are scaled to increasingly larger parameter sizes. |
| Space-time Peer-to-Peer Distribution of Multi-party Entanglement for Any Quantum Network | Yuexun Huang, Xiangyu Ren, Bikun Li, Yat Wong, Zhiding Liang, Liang Jiang | 2024-12-19 | 下载 | Graph states are a class of important multiparty entangled states, of which bell pairs are the special case. Realizing a robust and fast distribution of arbitrary graph states in the downstream layer ... |
| LoLaFL: Low-Latency Federated Learning via Forward-only Propagation | Jierui Zhang, Jianhao Huang, Kaibin Huang | 2024-12-19 | 下载 | Federated learning (FL) has emerged as a widely adopted paradigm for enabling edge learning with distributed data while ensuring data privacy. |
| Robustness Evaluation of a Physical Internet-based Intermodal Logistic Network | Federico Gallo, Alireza Shahedi, Angela Di Febbraro, Mahnam Saeednia, Nicola Sacco | 2024-12-19 | 下载 | The Physical Internet (PI) paradigm, which has gained attention in research and academia in recent years, leverages advanced logistics and interconnected networks to revolutionize the way goods are tr... |
| Overview of AI and Communication for 6G Network: Fundamentals, Challenges, and Future Research Opportunities | Qimei Cui, Xiaohu You, Ni Wei, Guoshun Nan, Xuefei Zhang, Jianhua Zhang, Xinchen Lyu, Ming Ai, Xiaofeng Tao, Zhiyong Feng, Ping Zhang, Qingqing Wu, Meixia Tao, Yongming Huang, Chongwen Huang, Guangyi Liu, Chenghui Peng, Zhiwen Pan, Tao Sun, Dusit Niyato, Tao Chen, Muhammad Khurram Khan, Abbas Jamalipour, Mohsen Guizani, Chau Yuen | 2024-12-19 | 下载 | With the growing demand for seamless connectivity and intelligent communication, the integration of artificial intelligence (AI) and sixth-generation (6G) communication networks has emerged as a trans... |
| WiFi CSI Based Temporal Activity Detection via Dual Pyramid Network | Zhendong Liu, Le Zhang, Bing Li, Yingjie Zhou, Zhenghua Chen, Ce Zhu | 2024-12-19 | 下载 | We address the challenge of WiFi-based temporal activity detection and propose an efficient Dual Pyramid Network that integrates Temporal Signal Semantic Encoders and Local Sensitive Response Encoders... |
cs.PF - Performance
| 标题 | 作者 | 发布日期 | 摘要 | |
|---|---|---|---|---|
| MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark | Qihao Zhao, Yangyu Huang, Tengchao Lv, Lei Cui, Qinzheng Sun, Shaoguang Mao, Xin Zhang, Ying Xin, Qiufeng Yin, Scarlett Li, Furu Wei | 2024-12-19 | 下载 | Multiple-choice question (MCQ) datasets like Massive Multitask Language Understanding (MMLU) are widely used to evaluate the commonsense, understanding, and problem-solving abilities of large language... |