Skip to content

2024-10-14

cs.AR - Architecture

标题作者发布日期PDF摘要
Voltage-Controlled Magnetic Tunnel Junction based ADC-less Global Shutter Processing-in-Pixel for Extreme-Edge IntelligenceMd Abdullah-Al Kaiser, Gourav Datta, Jordan Athas, Christian Duffee, Ajey P. Jacob, Pedram Khalili Amiri, Peter A. Beerel, Akhilesh R. Jaiswal2024-10-14下载The vast amount of data generated by camera sensors has prompted the exploration of energy-efficient processing solutions for deploying computer vision tasks on edge devices.
Dynamic Power Control in a Hardware Neural Network with Error-Configurable MAC UnitsMaedeh Ghaderi, Arvin Delavari, Faraz Ghoreishy, Sattar Mirzakuchaki2024-10-14下载Multi-Layer Perceptrons (MLP) are powerful tools for representing complex, non-linear relationships, making them essential for diverse machine learning and AI applications.
Work-in-Progress: Real-Time Neural Network Inference on a Custom RISC-V Multicore Vector ProcessorMaximilian Kirschner, Konstantin Dudzik, Jürgen Becker2024-10-14下载Neural networks are increasingly used in real-time systems, such as automated driving applications. This requires high-performance hardware with predictable timing behavior.
Tracing Human Stress from Physiological Signals using UWB RadarJia Xu, Teng Xiao, Pin Lv, Zhe Chen, Chao Cai, Yang Zhang, Zehui Xiong2024-10-14下载Stress tracing is an important research domain that supports many applications, such as health care and stress management; and its closest related works are derived from stress detection.

cs.DC - Distributed, Parallel, and Cluster Computing

标题作者发布日期PDF摘要
DGRO: Diameter-Guided Ring Optimization for Integrated Research Infrastructure MembershipShixun Wu, Krishnan Raghavan, Sheng Di, Zizhong Chen, Franck Cappello2024-10-14下载Logical ring is a core component in membership protocol. However, the logic ring fails to consider the underlying physical latency, resulting in a high diameter.
Liger Kernel: Efficient Triton Kernels for LLM TrainingPin-Lun Hsu, Yun Dai, Vignesh Kothapalli, Qingquan Song, Shao Tang, Siyu Zhu, Steven Shimizu, Shivam Sahni, Haowen Ning, Yanning Chen2024-10-14下载Training Large Language Models (LLMs) efficiently at scale presents a formidable challenge, driven by their ever-increasing computational demands and the need for enhanced performance.
MEV Capture Through Time-Advantaged ArbitrageRobin Fritsch, Maria Inês Silva, Akaki Mamageishvili, Benjamin Livshits, Edward W. Felten2024-10-14下载As blockchains begin processing significant economic activity, the ability to include and order transactions inevitably becomes highly valuable, a concept known as Maximal Extractable Value (MEV).
Producer vs. Rapper: Who Dominates the Hip Hop Sound? A Case StudyTim Ziemer, Nikita Kudakov, Christoph Reuter2024-10-14下载In hip-hop music, rappers and producers play important, but rather different roles. However, both contribute to the overall sound, as rappers bring in their voice, while producers are responsible for ...
SplitLLM: Collaborative Inference of LLMs for Model Placement and Throughput OptimizationAkrit Mudvari, Yuang Jiang, Leandros Tassiulas2024-10-14下载Large language models (LLMs) have been a disruptive innovation in recent years, and they play a crucial role in our daily lives due to their ability to understand and generate human-like text.
Kub: Enabling Elastic HPC Workloads on Containerized EnvironmentsDaniel Medeiros, Jacob Wahlgren, Gabin Schieffer, Ivy Peng2024-10-14下载The conventional model of resource allocation in HPC systems is static. Thus, a job cannot leverage newly available resources in the system or release underutilized resources during the execution.
A GPU-accelerated Molecular Docking Workflow with Kubernetes and Apache AirflowDaniel Medeiros, Gabin Schieffer, Jacob Wahlgren, Ivy Peng2024-10-14下载Complex workflows play a critical role in accelerating scientific discovery. In many scientific domains, efficient workflow management can lead to faster scientific output and broader user groups.
Accelerating Drug Discovery in AutoDock-GPU with Tensor CoresGabin Schieffer, Ivy Peng2024-10-14下载In drug discovery, molecular docking aims at characterizing the binding of a drug-like molecule to a macromolecule. AutoDock-GPU, a state-of-the-art docking software, estimates the geometrical conform...
OpenCUBE: Building an Open Source Cloud Blueprint with EPI SystemsIvy Peng, Martin Schulz, Utz-Uwe Haus, Craig Prunty, Pedro Marcuello, Emanuele Danovaro, Gabin Schieffer, Jacob Wahlgren, Daniel Medeiros, Philipp Friese, Stefano Markidis2024-10-14下载OpenCUBE aims to develop an open-source full software stack for Cloud computing blueprint deployed on EPI hardware, adaptable to emerging workloads across the computing continuum.
Fed-pilot: Optimizing LoRA Allocation for Efficient Federated Fine-Tuning with Heterogeneous ClientsZikai Zhang, Rui Hu, Ping Liu, Jiahao Xu2024-10-14下载Federated Learning enables the fine-tuning of foundation models (FMs) across distributed clients for specific tasks; however, its scalability is limited by the heterogeneity of client memory capacitie...

cs.NI - Networking and Internet Architecture

标题作者发布日期PDF摘要
SplitLLM: Collaborative Inference of LLMs for Model Placement and Throughput OptimizationAkrit Mudvari, Yuang Jiang, Leandros Tassiulas2024-10-14下载Large language models (LLMs) have been a disruptive innovation in recent years, and they play a crucial role in our daily lives due to their ability to understand and generate human-like text.
Fast Reroute with Highly Connected Routes Based on Maximum Flow EvaluationLeon Okida, Maverson E. Schuze-Rosa, Elias P. Duarte2024-10-14下载Fault-tolerant routing allows the selection of alternative routes to the destination after the route being used fails. Fast Reroute (FRR) is a proactive strategy through which the protocol pre-configu...
Continual Deep Reinforcement Learning to Prevent Catastrophic Forgetting in Jamming MitigationKemal Davaslioglu, Sastry Kompella, Tugba Erpek, Yalin E. Sagduyu2024-10-14下载Deep Reinforcement Learning (DRL) has been highly effective in learning from and adapting to RF environments and thus detecting and mitigating jamming effects to facilitate reliable wireless communica...
Learning Sub-Second Routing Optimization in Computer Networks requires Packet-Level DynamicsAndreas Boltres, Niklas Freymuth, Patrick Jahnke, Holger Karl, Gerhard Neumann2024-10-14下载Finding efficient routes for data packets is an essential task in computer networking. The optimal routes depend greatly on the current network topology, state and traffic demand, and they can change ...
On Efficient Topology Management in Service-Oriented 6G Networks: An Edge Video Distribution Case StudyZied Ennaceur, Mounir Bensalem, Admela Jukan, Claus Keuker, Huanzhuo Wu, Rastin Pries2024-10-14下载An efficient topology management in future 6G networks is one of the fundamental challenges for a dynamic network creation based on location services, whereby each autonomous network entity, i.e.
WT-CFormer: High-Performance Web Traffic Anomaly Detection Based on Spatiotemporal AnalysisYundi He, Runhua Shi, Boyan Wang2024-10-14下载Web traffic (WT) refers to time-series data that captures the volume of data transmitted to and from a web server during a user's visit to a website.
A Survey on Performance, Current and Future Usage of Vehicle-To-Everything Communication StandardsFalk Dettinger, Matthias Weiß, Daniel Dittler, Johannes Stümpfle, Maurice Artelt, Michael Weyrich2024-10-14下载Wireless communication between road users is essential for environmental perception, reasoning, and mission planning to enable fully autonomous vehicles, and thus improve road safety and transport eff...
VNF Migration with Fast Defragmentation: A GAT-Based Deep Learning MethodFangyu Zhang, Yuang Chen, Hancheng Lu, Chengdi Lu2024-10-14下载Network function virtualization (NFV) enhances service flexibility by decoupling network functions from dedicated hardware. To handle time-varying traffic in NFV network, virtualized network function ...
Burst-Mode Digital Signal Processing for Coherent Optical Time-Division Multiple AccessJi Zhou, Cheng Li, Haide Wang, Zhiyang Liu, Weiping Liu, Changyuan Yu2024-10-14下载As the 50G optical access gradually matures, it is time to discuss Beyond 50G optical access. According to the evolution rules of optical access standards, Beyond 50G optical access data rate may achi...

cs.PF - Performance

标题作者发布日期PDF摘要
On Efficient Topology Management in Service-Oriented 6G Networks: An Edge Video Distribution Case StudyZied Ennaceur, Mounir Bensalem, Admela Jukan, Claus Keuker, Huanzhuo Wu, Rastin Pries2024-10-14下载An efficient topology management in future 6G networks is one of the fundamental challenges for a dynamic network creation based on location services, whereby each autonomous network entity, i.e.

基于 VitePress 构建