VIVA AI Service — Trợ lý AI cho công nhân công nghiệp

Tính năng 01 · Ngôn ngữ tự nhiên

AI hiểu tiếng Việt công nhân — không cần từ chuyên ngành.

Gõ thế nào cũng hiểu.

Prompt engineering tối ưu cho Vietnamese informal — system prompt tune sát với ngôn ngữ công nhân. Hiểu được từ lóng ("đứng máy", "làm ca", "ở KTX"), tên KCN viết tắt, và context location Bắc Ninh.

Tiếng Việt informal Từ lóng công nhân Tên KCN viết tắt ✓ 500+ tests green

Em ở Từ Sơn, muốn tìm việc đứng máy gần 10tr, có KTX

Mình hiểu rồi. Ở Từ Sơn có 3 lựa chọn đứng máy CNC phù hợp: Samsung Yên Phong (11.5tr, KTX free), Goertek VSIP (12tr, KTX), Canon Quế Võ (9.8tr, xe đưa đón). Muốn mình so sánh chi tiết không?

Cái nào dễ vô nhất, không cần kinh nghiệm?

Samsung Yên Phong dễ nhất — không yêu cầu kinh nghiệm, đào tạo 5 ngày có lương. Tuyển gấp, còn 47 slot.

OK chốt. Phỏng vấn ngày nào?

Sáng thứ 2 tới (28/04), 9:00 tại cổng A Samsung Yên Phong. Mình gửi lịch + hướng dẫn đường đi vào Zalo nhé.

Tính năng 02 · Tool orchestration

Tools tra Laravel data — không cần wiki KB tĩnh.

Mọi data đều realtime từ Laravel.

AI gọi function tools đến api.xanhvina.com.vn qua M2M để lấy data trực tiếp — worker profile, jobs, lịch phỏng vấn. Jobs được scrape ngoài rồi đẩy vào Laravel, AI tự rank và explain match. Không cache stale, không hallucinate.

11 tools Generative UI cards Scrape ingest pipeline ✓ HMAC + JWT + Idempotency

worker_profile

Lấy CV, kỹ năng, vị trí worker từ Laravel realtime.

job_search + job_detail

Query live theo location/salary/skill, paginated, full filters.

job_compare

So sánh 2-3 việc — lương, KTX, khoảng cách, yêu cầu, side-by-side.

parse_search_intent

NL → structured filters (industry/location/salary/benefits).

prepare_job_application

Pre-fill form, render Generative UI confirm card, worker bấm 1 nút gửi.

escalate_to_human

Handoff sang ops khi AI không đủ thông tin — lưu context đầy đủ.

Tính năng 03 · Phase 02 robustness

Parallel tool dispatch · result compression · retry classify.

Gọi nhiều tools cùng lúc, gọn token, retry thông minh.

LLM có thể gọi nhiều tools song song qua asyncio.gather — giảm latency 40%+ khi cần nhiều thông tin. Result compressor chỉ giữ field essential per-tool (top-10 jobs với 8 fields thay vì 50 với 30 fields) — tiết kiệm 60-70% context. Retry classifier phân loại lỗi retryable / non-retryable theo provider để tránh đốt quota vô ích.

asyncio.gather Result compressor Retry classify ✓ Indirect injection guard

Worker: "so sánh việc Samsung & Goertek, em muốn KTX free"
    │
    ▼
LLM → 3 tool_calls SONG SONG:
    ├─ job_search(company="Samsung", benefits=["dorm"])
    ├─ job_search(company="Goertek", benefits=["dorm"])
    └─ worker_profile(worker_id=current)
    │
    ▼  (asyncio.gather — 1 round-trip)
Laravel /api/v1/...  ← 3 calls parallel
    │
    ▼  result_compressor: 50 fields → 8 essential
indirect_injection_guard: lọc model tokens / URL inject
    ▼
LLM compose so sánh:
    "Samsung 11.5tr KTX free · Goertek 12tr KTX..."
    │
    ▼
  Streamed to worker via SSE ✓

Tính năng 04 · Cost routing + resilience

OpenAI + DeepSeek — cost routing có chủ đích.

OpenAI gpt-4o-mini chính · DeepSeek dự phòng giá rẻ.

Gateway dùng OpenAI gpt-4o-mini làm primary (chất lượng tiếng Việt tốt + tool calling chuẩn xác), DeepSeek deepseek-chat làm fallback giá rẻ. Provider + model có thể đổi runtime qua admin Settings UI mà không cần redeploy. Circuit breaker mở 30s khi 5xx/timeout, traffic shift tự động. Mỗi request log cost USD vào ``llm_usage_log`` — admin xem realtime trên dashboard.

Circuit Breaker Retry + Backoff Cost tracker

OpenAI gpt-4o-mini PRIMARY

$0.15 / $0.60 / 1M tokens · chat + tool calling · latency p50 ~ 1.1s

● Healthy

↔ failover tự động khi 5xx / 429

DeepSeek deepseek-chat FALLBACK · CHEAP

$0.14 / $0.28 / 1M tokens · OpenAI-compatible · backup khi primary gặp sự cố

● Healthy

↗ embedding via OpenAI text-embedding-3-small (1536-dim)

text-embedding-3-small

$0.02 / 1M tokens · Wiki RAG semantic search · pgvector + tsvector hybrid

● Active

Tính năng 05 · Red Team hardened

Safety layer — VN-aware PII, direct + indirect injection guard.

Đa tầng phòng thủ, hiểu chuẩn Việt Nam.

Input pipeline: Unicode NFKC normalize → direct prompt-injection detector → PII masker VN-aware (CCCD 12 số, CMND 9 số, MST 10/13 số, STK ngân hàng, SĐT VN) → rate limit per-worker + per-IP. Trên đường về: indirect_injection_guard (Phase 02 B4) sanitize tool result trước khi feed lại LLM — chống prompt injection ngầm từ data ngoài.

VN PII (CCCD/MST/STK) Direct injection Indirect injection guard Output filter Rate limit SSRF guard

NFKC normalize

Homoglyph attack prevention · zero-width chars stripped

PII masker VN

CCCD 12 / CMND 9 / MST 10/13 / STK / SĐT VN · auto-redact trước khi log + LLM

Direct injection

"Ignore previous" + EN/VI imperative patterns · semantic model probe

⊕

Indirect guard

Lọc model role tokens + URL inject từ tool result trước khi feed lại LLM

⇄

Output filter

Re-mask PII có thể leak từ LLM response trước khi gửi worker

∫

Rate limit

Token bucket per-worker (30/min) + per-IP (120/min)

⎔

SSRF guard

URL allowlist · private-IP deny · 3s timeout

✖

Tenant scope

JWT sub === worker_id mọi request — Red Team C3 chặn cross-worker access

Tính năng 06 · V5 stack

Generative UI · Intelligence · Webhooks · Memory · Voice.

Generative UI cards

5 cards live: JobCarouselCard swipe ngang (2-8 jobs), JobPickerCard list dọc, ApplicationFormCard ứng tuyển inline, ApplicationSuccessCard, ApplicationConfirmCard. Tool emit tool_card hoặc lead_signal SSE — frontend dispatch theo whitelist.

Intelligence endpoints

4 admin AI: churn risk, draft-reply HR, anomaly detection, campaign suggestions theo KCN.

6 webhook events

profile.built · content.generated · safety.incident · match.found · churn.spike · anomaly.detected — fire-and-forget với daily idempotency keys.

Conversation memory (RLS)

Per-worker chat history với row-level security · auto-summarize mỗi 20 turns · TTL 90d worker / 365d staff / 7d guest (NEW). Guest sessions multi-turn từ 2026-04-29.

Phase 02 robustness

Parallel tool dispatch (asyncio.gather) + result compressor 60-70% gọn + retry classifier theo provider + indirect injection guard.

Profile building

State machine 8 câu hỏi · AI trích skill từ chat · Laravel sync on complete.

Anonymous → registered

Worker chat ngay không cần đăng ký · LLM tool should_capture_lead tự nhận biết khi nào hỏi SĐT (không cần regex hardcode) · auto-prefill vị trí + KCN từ context · ApplicationFormCard inline submit tạo Worker + applied_jobs[].

Worker AI endpoints

search/parse · feed/explain · content/summarize-job · suggestions · thinking events — gọn cho frontend SSE.

Scrape ingest pipeline

Jobs scraper EXTERNAL → Laravel /jobs/ingest → AI extract + rank · 100 jobs/day · <5% manual override.

Có gì mới

Cập nhật gần đây của VIVA AI.

2026-05-03

🚀 Hành động trực tiếp trong chat — Wave 1 chat-driven actions chat funnel

4 luồng mới không cần rời chat: (1) bấm "Ứng tuyển" → chip "Để lại SĐT" → OTP verify là tự nộp đơn ngay kèm Slack notify HR; (2) hỏi "đơn của tôi sao rồi?" → card list 10 đơn gần nhất + status badge tiếng Việt; (3) "lưu việc Goertek" → ConfirmCard, bấm Lưu là xong; (4) "báo Samsung Bắc Ninh" → tạo Saved Search, hệ thống tự alert khi có việc mới khớp (cap 3 alert/ngày, max 5 search active). Chống spam: trần 15 đơn/ngày/worker (atomic Redis counter).
2026-04-29

🧠 Lead-capture trigger LLM-driven, không còn regex hardcode chat

Tool mới should_capture_lead: AI đọc full ngữ cảnh hội thoại → tự quyết khi nào nên hỏi SĐT thay vì dò 12 keyword cứng. Auto-prefill vị trí + KCN từ chat → IdentifyForm không cần gõ lại.
2026-04-29

🔍 parse_search_intent gọi trước mọi job_search search

"Goer tek 12tr Bắc Ninh" → AI normalize typo + extract {location: 'Bắc Ninh', salary_min: 12000000} trước khi tra Laravel. Loại bỏ class lỗi "AI bịa keyword" và xử lý phrasing tự nhiên VN.
2026-04-29

💬 Chat đa lượt cho khách (multi-turn context) chat

Khách không đăng nhập giờ cũng nhớ ngữ cảnh xuyên các tin nhắn — RLS policy nới cho guest, conversation_id round-trip giữa client + server. Hỏi "Goertek" rồi hỏi "Bắc Ninh" — AI hiểu là Goertek tại Bắc Ninh.
2026-04-29

🗂️ Lịch sử chat lưu localStorage 24h, F5 không mất ux

Đóng tab / refresh / mở lại — đoạn hội thoại đang dở vẫn còn. Tự động cắt sau 100 lượt hoặc 24h. Reset rõ ràng qua nút "Bắt đầu chat mới".
2026-04-29

🔐 Token guest tự gia hạn không cần user thấy lỗi auth

JWT 1h hết hạn giữa cuộc chat → frontend tự re-mint trước 5 phút + retry mid-stream khi 401. User không bao giờ thấy "token expired" toast.
2026-04-29

🎠 JobCarouselCard — vuốt ngang chọn việc ui

2-8 việc trả về dạng carousel ngang (Embla, swipe-aware, dot indicator). 1 thẻ/lượt rõ ràng hơn list dọc, có CTA "Ứng tuyển ngay" + "Xem chi tiết" nội tại.
2026-04-29

📝 ApplicationFormCard — ứng tuyển ngay trong chat funnel

Bấm "Ứng tuyển" trên carousel → form họ tên + SĐT + năm sinh + kinh nghiệm hiện inline. Submit tạo Worker (Laravel lookup-or-create) + ghi applied_jobs[] vào meta của conversation. Không rời khung chat.
2026-04-29

💰 Bộ lọc lương dùng cột wage_min/wage_max chuẩn hoá search

"từ 15tr" giờ thực sự lọc theo overlap với khoảng lương — không còn match toàn bộ vì regex [0-9]. Artisan jobs:backfill-wage parse "13-18 triệu/tháng" → wage_min=13M, wage_max=18M.
2026-04-29

🎚️ max_tokens điều chỉnh runtime qua admin (R2) arch

Provider OpenAI + DeepSeek đọc ai.tasks.chat.max_tokens từ settings_client (Laravel admin → Redis → env). Clamp 120-4096. Marketing tune độ dài câu trả lời live, không cần redeploy.
2026-04-29

📐 Wiki RAG semantic-only — bỏ BM25 mặc định không hiểu tiếng Việt rag

PG default tsvector tokenize Latin, mọi query VN trả 0 lexical hit. Drop BM25 leg; pgvector cosine với 1536-dim embedding cover synonym + typo tốt cho corpus 10 bài hiện tại. Sẽ bật lại khi có pg_trgm Vietnamese index.
2026-04-28

⚙️ LLM stack OpenAI + DeepSeek · settings sync runtime arch

OpenAI gpt-4o-mini làm primary, DeepSeek deepseek-chat dự phòng. Provider/model + giá đổi runtime qua admin Settings UI, đẩy invalidate qua Redis pub/sub. Wiki RAG dùng text-embedding-3-small 1536-dim.
2026-04-28

📚 Wiki RAG live · 10 article seed KCN Bắc Ninh rag

Semantic search pgvector 1536-dim · citation [#W{id}] · admin CRUD đầy đủ qua Laravel UI. Nhân viên content cập nhật bài → AI dùng ngay không redeploy. (Lưu ý 2026-04-29: BM25 leg bỏ tạm vì tsvector không hiểu tiếng Việt — xem changelog.)
2026-04-28

📊 Token + cost tracking realtime obs

Mọi LLM/embed/STT call ghi cost USD vào ``llm_usage_log``. Admin xem theo ngày/provider/model/worker · top-N consumers · live error feed.
2026-04-27

📡 6 webhook events đẩy realtime sang Laravel infra

profile.built, content.generated, safety.incident, match.found, churn.spike, anomaly.detected — fire-and-forget với daily idempotency keys.
2026-04-27

🎯 Admin intelligence — 4 endpoints AI cho ops admin

Churn risk score (LLM heuristic), reply suggestions HR, anomaly detection, campaign suggestions theo KCN.
2026-04-27

🧮 Nén tool result tiết kiệm token infra

Chỉ giữ field essential per-tool (top-10 jobs với 8 fields thay vì 50 jobs với 30 fields) — context LLM gọn hơn 60-70%.
2026-04-27

🛡️ Sanitize tool result chống prompt injection ngầm safety

Lọc model role tokens, URL injection, các kiểu "ignore previous" tiếng Việt + Anh từ tool response trước khi feed lại LLM.
2026-04-27

⚙️ Tool dispatch song song infra

LLM có thể gọi nhiều tools cùng lúc (asyncio.gather) — giảm latency 40%+ khi cần lấy nhiều thông tin.
2026-04-26

✨ Trợ lý xác nhận thông minh chat

AI tự điền sẵn form ứng tuyển từ hồ sơ của bạn, bấm "Xác nhận & gửi" là HR nhận đơn ngay. Không cần copy-paste lại thông tin.
2026-04-26

💬 Câu hỏi gợi ý sau mỗi câu trả lời chat

VIVA AI đề xuất 3 câu hỏi tiếp theo dựa trên cuộc trò chuyện, bấm là hỏi luôn — không cần nghĩ phải gõ gì.

Theo dõi đầy đủ tại GitLab commits.

Sẵn sàng trò chuyện với VIVA AI?

Test ngay trong widget phía trên — hoặc đọc docs để biết cách nạp thêm knowledge.

Chat với AI Xem Docs

Trợ lý AI cho công nhân công nghiệp.

AI hiểu tiếng Việt công nhân — không cần từ chuyên ngành.

Gõ thế nào cũng hiểu.

Tools tra Laravel data — không cần wiki KB tĩnh.

Mọi data đều realtime từ Laravel.

worker_profile

job_search + job_detail

job_compare

parse_search_intent

prepare_job_application

escalate_to_human

Parallel tool dispatch · result compression · retry classify.

Gọi nhiều tools cùng lúc, gọn token, retry thông minh.

OpenAI + DeepSeek — cost routing có chủ đích.

OpenAI gpt-4o-mini chính · DeepSeek dự phòng giá rẻ.

Safety layer — VN-aware PII, direct + indirect injection guard.

Đa tầng phòng thủ, hiểu chuẩn Việt Nam.

Generative UI · Intelligence · Webhooks · Memory · Voice.

Generative UI cards

Intelligence endpoints

6 webhook events

Conversation memory (RLS)

Phase 02 robustness

Profile building

Anonymous → registered

Worker AI endpoints

Scrape ingest pipeline

Cập nhật gần đây của VIVA AI.

🚀 Hành động trực tiếp trong chat — Wave 1 chat-driven actions chat funnel

🧠 Lead-capture trigger LLM-driven, không còn regex hardcode chat

🔍 parse_search_intent gọi trước mọi job_search search

💬 Chat đa lượt cho khách (multi-turn context) chat

🗂️ Lịch sử chat lưu localStorage 24h, F5 không mất ux

🔐 Token guest tự gia hạn không cần user thấy lỗi auth

🎠 JobCarouselCard — vuốt ngang chọn việc ui

📝 ApplicationFormCard — ứng tuyển ngay trong chat funnel

💰 Bộ lọc lương dùng cột wage_min/wage_max chuẩn hoá search

🎚️ max_tokens điều chỉnh runtime qua admin (R2) arch

📐 Wiki RAG semantic-only — bỏ BM25 mặc định không hiểu tiếng Việt rag

⚙️ LLM stack OpenAI + DeepSeek · settings sync runtime arch

📚 Wiki RAG live · 10 article seed KCN Bắc Ninh rag

📊 Token + cost tracking realtime obs

📡 6 webhook events đẩy realtime sang Laravel infra

🎯 Admin intelligence — 4 endpoints AI cho ops admin

🧮 Nén tool result tiết kiệm token infra

🛡️ Sanitize tool result chống prompt injection ngầm safety

⚙️ Tool dispatch song song infra

✨ Trợ lý xác nhận thông minh chat

💬 Câu hỏi gợi ý sau mỗi câu trả lời chat

Sẵn sàng trò chuyện với VIVA AI?

Trợ lý AI cho
công nhân công nghiệp.