VridhiPROBuilt for builders

AI capabilities

Thirteen models. Built to act, not just chat.

7 CPU models on day one. 6 GPU models from Phase 3 Month 11. No tier gating, no per-call limits, no third-party AI in the critical path.

Strategy

Three rules behind every model decision.

The AI stack is opinionated on purpose. These are the opinions.

01

CPU first. GPU when it earns its rack.

Eight CPU models — scoring, forecasting, NER, dedupe, sentiment — work on day one on commodity infrastructure. The RTX 4070 Ti Super 16GB lights up in Phase 3 Month 11, once we have the client volume to amortise it.

02

Not chatbots. Assignments.

The AI Reminder Engine takes the buyer-risk score and turns it into a prioritised Action Queue with KPI math behind it. AI does not chat at your sales team — it tells them who, when, and what.

03

Open models. Indian deployment.

Qwen3, BGE-M3, RoBERTa, spaCy, Faster-Whisper — all open weights, all running on our infrastructure inside India. No data leaves the country. Groq API stays as a fallback for early-load smoothing.

The thirteen

One tile per model.

Each tile lists the model, the compute target, the latency budget, and the job it does inside the product.

XGBOOST · CPU · ~1MS

Lead Scoring

A 0–100 score on every lead, retrained weekly. Sales sees the hot list first, not the new list.

XGBOOST + PROPHET · CPU · MINUTES

Sales Forecast

Project-wise monthly bookings forecast with seasonality. The MD's dashboard answers next quarter.

XGBOOST · CPU · MINUTES

Collection Forecast

Per-demand probability of on-time payment. Finance knows what to actually expect this month.

XGBOOST · CPU · ~5MS

Buyer Risk Scoring

Risk score per active SO. The AI Reminder Engine prioritises the buyers most likely to slip.

RAPIDFUZZ · CPU · <10MS

Duplicate Detection

Fuzzy phone + email match at lead ingest. One buyer, one record — even when the call centre misspells.

SPACY · CPU · ~20MS

Entity Extraction

Pulls names, projects, unit types, budgets from inbound email and WhatsApp before they hit a human.

ROBERTA · CPU · ~50MS

Sentiment

Sentiment on every transcribed call and WhatsApp thread. Negative trends become tasks automatically.

BGE-M3 · CPU/GPU · ~80MS

Property Embeddings

pgvector-backed similarity search. "Find me units like this" works the way buyers actually ask.

FASTER-WHISPER · GPU · REAL-TIME

Call Transcription

Every CTI recording becomes searchable Telugu/Hindi/English text within seconds of the call ending.

QWEN3 4B · GPU · ~2S

Call Summary

Three bullets per call: what the buyer wants, what was promised, what the next step is.

QWEN3 8B · GPU · ~15S

Agreement Drafting

First-cut agreement generated from the SO using your lawyer's template. Review, redline, e-sign.

QWEN3 4B (THINKING) · GPU · ~5S

Loan Eligibility

Eligibility estimate from income + CIBIL + obligations. Buyers learn the answer in the first conversation.

QWEN3 4B + LLAMAINDEX · GPU · ~3S

ProX

RAG Chatbot

A 24×7 buyer assistant grounded on your agreements, brochures, and FAQs. ProX only — Customer Portal required.

GPU server

One RTX 4070 Ti Super 16GB. Bought once.

₹1,40,000 one-time at Month 11, when client count crosses fifteen. 16GB GDDR6X holds the full Qwen3 + BGE-M3 + Whisper stack with about 7.5GB of headroom. Cloud GPU costs more after three months, and pricing volatility is real. Owning the box wins.

VRAM
16 GB GDDR6X
One-time cost
₹1,40,000
Trigger
15+ paying clients
Fallback
Groq API, always on

Data residency

Your tenant. Inside India. Period.

Schema-per-tenant Postgres. Realm-per-tenant Keycloak. Per-tenant MinIO prefix. Models run on our hardware, in our colocation. No third-party AI processor in the call path, no buyer PII shipped to a foreign vendor. The audit-log triggers prove it on demand.

  • → AlmaLinux 10 hosts, CtrlS Hyderabad colocation roadmap
  • → PostgreSQL RLS plus FastAPI search_path — double-layer isolation
  • → M46 Data Loader handles full EXPORT on demand
  • → Audit-log triggers on every business object for RERA evidence

Ready when you are

See your sales workflow inside Vridhi Pro.

A live walk-through tailored to your project pipeline, your payment plans, and your team. Forty-five minutes.