Quantization Levels - Search News

MBQ: Modality-Balanced Quantization for Large Vision-Language Models

Abstract: Vision-Language Models (VLMs) have enabled a variety of real-world applications. The large parameter size of VLMs brings large memory and computation overhead which poses significant ...

IEEE

Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and Compression

Abstract: Structured pruning and quantization are fundamental techniques used to reduce the size of deep neural networks (DNNs), and typically are applied independently. Applying these techniques ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

MBQ: Modality-Balanced Quantization for Large Vision-Language Models

Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and Compression

Trending now