Vector Quantization Methods

BRIC: Bottom-Up Residual Vector Quantization for Learned Image Compression

Abstract: This paper presents Bottom-up Residual vector quantization for learned Image Compression (BRIC). This novel deep learning-based image compression method quantizes latent representations ...

TechNode

Huawei Zurich Lab’s New Open-Source Tech Lets LLMs Run on Consumer GPUs

Huawei’s Zurich Computing Systems Laboratory has released SINQ (Sinkhorn Normalization Quantization), an open-source quantization method that reduces the memory requirements of large language models ...

GitHub

[FEATURE] [RFC] Integrating Lucene's Better Binary Quantization into OpenSearch

This project aims to integrate BBQ into the OpenSearch k-NN plugin to offer users a memory-efficient alternative, ideal for large-scale vector workloads in constrained compute environments. The ...

IEEE

A Decoupled Few-Shot Defect Detection Approach via Vector Quantization Feature Aggregation

Abstract: In recent years, few-shot detection has become a popular research direction in the field of industrial defect detection, which aims to perform defect detection tasks accurately using a ...

marktechpost

Cornell Researchers Introduce QTIP: A Weight-Only Post-Training Quantization Algorithm that Achieves State-of-the-Art Results through the Use of Trellis-Coded Quantization (TCQ)

Quantization is an essential technique in machine learning for compressing model data, which enables the efficient operation of large language models (LLMs). As the size and complexity of these models ...

marktechpost

Show inaccessible results

BRIC: Bottom-Up Residual Vector Quantization for Learned Image Compression

Huawei Zurich Lab’s New Open-Source Tech Lets LLMs Run on Consumer GPUs

[FEATURE] [RFC] Integrating Lucene's Better Binary Quantization into OpenSearch

A Decoupled Few-Shot Defect Detection Approach via Vector Quantization Feature Aggregation

Cornell Researchers Introduce QTIP: A Weight-Only Post-Training Quantization Algorithm that Achieves State-of-the-Art Results through the Use of Trellis-Coded Quantization (TCQ)

VQ4DiT: A Fast Post-Training Vector Quantization Method for DiTs (Diffusion Transformers Models)

vector-quantization

Optimized Product Quantization