







The experts built a two-model pipeline that separates image understanding from query matching: GPT-4o Vision extracts structured style attributes from product images, then a second model interprets the natural-language sourcing query and returns ranked matches, trained on the business's historical sourcing data. Searches that took hours of manual Google review now return results in seconds.
The pipeline used the GPT-4o Vision API to extract materials, silhouette, and luxury-specific style markers from images, and Claude to interpret the sourcing query and apply matching logic, with ChatGPT also in the toolset. The approach combined computer vision and recommendation systems.
Three outcomes: hours of per-request manual search and image review were replaced by an AI pipeline returning structured results in seconds; historical sourcing data was converted into a proprietary, owned training dataset; and the full vision pipeline was delivered from kickoff to working MVP in five weeks.
The full pipeline — image recognition, product matching, and recommendation output — was delivered from kickoff to working MVP in about five weeks.
Luxury e-commerce and personal shopping businesses scaling research-intensive operations, and AI agencies building vision-enabled product matching tools.