webSLM Demo —
Base + RAG
vs
Fine-tuned SLM
100% in-browser inference via WebGPU • No server required • Powered by WebLLM & MLC-LLM
Product demo
Fine-tuning proof
⚙ Settings
Option 1
Base Model — RAG panel
Load Base Model
Option 2
webSLM-Medical-0.5B — fine-tuned SLM panel
Custom compiled webSLM model — loads from HuggingFace (VishalMysore/WebSLM-Custom-MLC).
Load webSLM Model
💲 Insurance
➕ Medical
⚖ Legal
Send
Try:
Option 1
Base + RAG
General model • Documents retrieved and injected into context
Initialising…
Option 2
Fine-tuned webSLM
Domain-specialized model • No retrieval • Behaviour baked in during training
Initialising…