About
How it works
FashionFind uses a fine-tuned CLIP model trained on a 48-category fashion dataset to understand your search in natural language and match it against 30,000 product images.
Text queries are encoded by a lightweight CPU-only service on Modal and matched via dot-product similarity against pre-computed image embeddings. Results are served from Cloudflare R2.
More content coming soon
Team, model details, dataset credits…