Fully Local! How to Build an Auto-Captioning Tool for Image Datasets Using Qwen 3.5
An implementation guide for automating image dataset creation using Qwen 3.5 in a local environment. Ensure privacy and operate for free using a local VLM. This post includes a Python script for bulk-generating image captions. I’ve compiled practical tips ranging from Base64 encoding and controlling "Thinking" models to prompt engineering for excluding specific training targets.