Quantization Algorithm

Balancing Training, Quantization, And Hardware Integration In NPUs

Evolving challenges and strategies in AI/ML model deployment and hardware optimization have a big impact on NPU architectures ...

Semiconductor Engineering

Outlier-aware Quantization Framework Co-designed With Heterogeneous NVM For SLM Deployment on Edge Platforms (UCSD et al.)

Efficient SLM Edge Inference via Outlier-Aware Quantization and Emergent Memories Co-Design” was published by researchers at ...

IEEE

A Quantization-Aware DL-Based Channel Estimation Algorithm for OFDM Systems

Abstract: This paper proposes a quantization-aware deep learning (DL)-based channel estimation algorithm for orthogonal frequency-division multiplexing (OFDM) systems under varying effective number of ...

GitHub

Cfish808/LLM-Compression-Tool

LLM Compression Tool is a project designed to optimize machine learning inference processes. The framework combines multiple modules to provide a cohesive and efficient pipeline for model quantization ...

GitHub

Outlier-Safe Pre-Training

Quantization plays a crucial role in deploying Large Language Models (LLMs) in resource-constrained environments. However, the presence of outlier features significantly hinders low-bit quantization.

IEEE

Layered Randomized Quantization for Communication-Efficient and Privacy-Preserving Distributed Learning

Abstract: In distributed learning systems, ensuring efficient communication and privacy protection are two significant challenges. Although several existing works have attempted to address these ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results