Find objects in images using text prompts
Complex text label dection using SAM3 with VLM-FO1
VLM-FO1-3B-Demo