Vector Quantized Generative Adversarial Network and Contrastive Language–Image Pre-training