About

How it works

FashionFind uses a fine-tuned CLIP model trained on a 48-category fashion dataset to understand your search in natural language and match it against 30,000 product images.

Text queries are encoded by a lightweight CPU-only service on Modal and matched via dot-product similarity against pre-computed image embeddings. Results are served from Cloudflare R2.