CLIP · ViT-L/14 · Foundation Model

Text → Image Retrieval

Prompt-aware retrieval: tokenize → text encoder → semantic pre-filter → FAISS cosine similarity → diversity re-rank → XAI.

⌘K

GPU78%

Loss0.184

Throughput312/s

Execution pipeline

Top-K

Enter a prompt and run retrieval — each run produces a new ranking.