From 0be1fb8ebaa55d11c761b7fa654e438fe4755a56 Mon Sep 17 00:00:00 2001 From: happybell80 Date: Mon, 16 Mar 2026 22:49:27 +0900 Subject: [PATCH] Refine Gemini embedding phase-one closure docs --- ...ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๊ณ„ํš.md | 54 +++++++++++++------ ...ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_ํ˜„ํ™ฉ_SSOT_๋ฆฌ์„œ์น˜.md | 33 ++++++++++-- ...1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๋ฌธ์ œ์˜คํ”ˆ.md | 28 ++++++++-- 3 files changed, 93 insertions(+), 22 deletions(-) diff --git a/journey/plans/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๊ณ„ํš.md b/journey/plans/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๊ณ„ํš.md index 329bd34..9efb921 100644 --- a/journey/plans/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๊ณ„ํš.md +++ b/journey/plans/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๊ณ„ํš.md @@ -17,6 +17,20 @@ tags: [plans, embedding, gemini, rag, robeing, 1์ฐจ] - planned +## ๊ฒฐ์ • ํ™•์ • + +- **๋Œ€์ƒ ๋ ˆํฌ**: `skill-embedding`, `skill-rag-file`, `rb8001`, `DOCS` +- **๊ณต์‹ ์ž„๋ฒ ๋”ฉ ๊ฒฝ๋กœ**: `skill-embedding` ๋‹จ์ผ ๊ฒŒ์ดํŠธ์›จ์ด +- **๋ชจ๋ธ/์ฐจ์›**: `Gemini Embedding 2`, `768d` +- **์ž…๋ ฅ ๋ฒ”์œ„**: ํ…์ŠคํŠธ + PDF + ์ด๋ฏธ์ง€ +- **๋ฉ”๋ชจ๋ฆฌ ๋ฒ”์œ„**: `rb8001` ๋ฉ”๋ชจ๋ฆฌ ์ปฌ๋ ‰์…˜ ๋“œ๋ฆฌํ”„ํŠธ๋„ 1์ฐจ ์ข…๋ฃŒ ๋ฒ”์œ„์— ํฌํ•จ +- **ํ˜ผ์žฌ ์ •์ฑ…**: 1์ฐจ ๋Œ€์ƒ ๋ฒ”์œ„ ์•ˆ์—์„œ๋Š” `384/768` ํ˜ผ์žฌ ๊ธˆ์ง€ +- **์ƒˆ ์ž๋ฃŒ ์œ ์ž…**: ์•ž์œผ๋กœ ๋“ค์–ด์˜ค๋Š” ์ƒˆ ์ •๋ณด/์ž๋ฃŒ๋„ ๋ชจ๋‘ `Gemini 2 768d`๋ฅผ ํ•„์ˆ˜๋กœ ์‚ฌ์šฉ +- **๋ฐฐํฌ ๊ธฐ์ค€**: ์‹ค์ œ ์šด์˜ ๋ฐฐํฌ ์™„๋ฃŒ ์ „์—๋Š” ๋‹ซ์ง€ ์•Š์Œ +- **๊ฒ€์ฆ ๊ธฐ์ค€**: ์ž๋™ ํ…Œ์ŠคํŠธ + ๋Œ€ํ‘œ ์งˆ๋ฌธ์…‹ ์ˆ˜๋™ ๊ฒ€์ฆ + ์šด์˜ ๊ตฌ์กฐํ™” ๋กœ๊ทธ +- **fallback ์ •์ฑ…**: ๋ฐฐํฌ ์ค‘ ์ž„์‹œ fallback ํ—ˆ์šฉ ๊ฐ€๋Šฅ. ๋‹จ, ๊ตฌ์กฐํ™” ๋กœ๊ทธ ํ•„์ˆ˜, ์ตœ์ข… ๋‹ซํž˜ ์ „ ์ œ๊ฑฐ ํ•„์ˆ˜ +- **๊ทผ๊ฑฐ ๋ฌธ์„œ ์›์น™**: ๋‹ซํž˜ ์ฆ๊ฑฐ๋Š” ๋‹จ์ผ worklog์— ๋ชฐ์ง€ ์•Š๊ณ  `worklog + ํ…Œ์ŠคํŠธ ๊ฒฐ๊ณผ + ๋ฐฐํฌ/์šด์˜ ๋กœ๊ทธ ๊ธฐ๋ก`์œผ๋กœ ๋ถ„๋ฆฌ + ## ์„ ํ–‰ ์กฐ๊ฑด - `workspace-config/runtime.env`์— `EMBEDDING_SERVICE_URL`, `EMBEDDING_DIM`, `EMBEDDING_MODEL` ๋ฐ˜์˜๋จ (SSOT) @@ -36,12 +50,12 @@ tags: [plans, embedding, gemini, rag, robeing, 1์ฐจ] - **๊ฒฝ๋กœ ๊ฒฐ์ •**: skill-embedding ๊ต์ฒด (ONNXโ†’Gemini 2). [๋ฆฌ์„œ์น˜ ยง7 ๊ฒฝ๋กœ ์„ค๊ณ„ ๊ฒฐ์ •](../research/rag/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_ํ˜„ํ™ฉ_SSOT_๋ฆฌ์„œ์น˜.md) ์ฐธ์กฐ. - skill-embedding ๋‚ด๋ถ€ Gemini 2 ์ „ํ™˜, /embed ์—”๋“œํฌ์ธํŠธ ์œ ์ง€ - NAS RAGยทCompany X RAG PDFยท์ด๋ฏธ์ง€ ์ž„๋ฒ ๋”ฉ ๊ฒฝ๋กœ +- Company X ๋ฌธ์„œ ์ปฌ๋ ‰์…˜๊ณผ `rb8001` ๋ฉ”๋ชจ๋ฆฌ ์ปฌ๋ ‰์…˜์˜ 768d ์žฌ์ •๋น„ - MRL 768d, ChromaDB/pgvector ์Šคํ‚ค๋งˆ ํ˜ธํ™˜ - **์ฒญํ‚น**: 1์ฐจ๋Š” ๊ธฐ์กด Micro ์œ ์ง€. 2๋‹จ๊ณ„์—์„œ Macro(2,000~4,000) ๊ฒ€ํ† . [๋ฆฌ์„œ์น˜ ยง8 ์ฒญํ‚น ๋‹จ๊ณ„](../research/rag/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_ํ˜„ํ™ฉ_SSOT_๋ฆฌ์„œ์น˜.md) ์ฐธ์กฐ. ### ์ œ์™ธ -- rb8001 ๋ฉ”๋ชจ๋ฆฌ 768/384 ์ฐจ์› ๋“œ๋ฆฌํ”„ํŠธ (๋ณ„๋„ ์ด์Šˆ) - StarsAndI, TheGooseCouncil (2์ฐจ ํ”Œ๋žœ) ## env SSOT @@ -58,8 +72,8 @@ tags: [plans, embedding, gemini, rag, robeing, 1์ฐจ] - ํ…์ŠคํŠธ 768d ๊ฒ€์ฆ (output_dimensionality=768) - PDFยท์ด๋ฏธ์ง€ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์ƒ˜ํ”Œ 1๊ฑด each - 1M ํ† ํฐ ๋น„์šฉ $0.25 ์ดํ•˜ ํ™•์ธ - - Recall ๊ธฐ์กด ๋Œ€๋น„ ์œ ์ง€ ๋˜๋Š” ๊ฐœ์„  (์„ ํƒ: ko-sroberta ๋Œ€๋น„ ์ƒ˜ํ”Œ ๋น„๊ต) -- **์™„๋ฃŒ ๊ธฐ์ค€**: ํ…Œ์ŠคํŠธ ํ†ต๊ณผ, ๋น„์šฉยทRecall ๊ธฐ์ค€ ์ถฉ์กฑ + - ๋Œ€ํ‘œ ์งˆ๋ฌธ์…‹ ๊ธฐ์ค€ ๊ฒ€์ƒ‰ ํ’ˆ์งˆ ์œ ์ง€ ๋˜๋Š” ๊ฐœ์„  +- **์™„๋ฃŒ ๊ธฐ์ค€**: ํ…Œ์ŠคํŠธ ํ†ต๊ณผ, ๋น„์šฉ ๊ธฐ์ค€ ์ถฉ์กฑ, ํ’ˆ์งˆ ๊ฒ€์ฆ ๊ธฐ๋ก ์ƒ์„ฑ ### 2. skill-embedding ์ „ํ™˜ @@ -77,50 +91,60 @@ tags: [plans, embedding, gemini, rag, robeing, 1์ฐจ] - **์ž‘์—…**: - output_dimensionality=768๋กœ ChromaDB ์ปฌ๋ ‰์…˜ ์ƒ์„ฑ/๋งˆ์ด๊ทธ๋ ˆ์ด์…˜ - intent_prototypes pgvector 768d ํ™•์ธ (์ด๋ฏธ 768d๋ฉด ์Šคํ‚ค๋งˆ ๋ณ€๊ฒฝ ์—†์Œ) + - Company X ๋ฌธ์„œ ์ปฌ๋ ‰์…˜๊ณผ `rb8001` ๋ฉ”๋ชจ๋ฆฌ ์ปฌ๋ ‰์…˜์„ 1์ฐจ ์šฐ์„  ์žฌ์ •๋น„ - ์ฒญํ‚น: ๊ธฐ์กด Micro(300~500 ๋‹จ์–ด) ์œ ์ง€, ์ฝ”๋“œ ๋ณ€๊ฒฝ ์—†์Œ -- **์™„๋ฃŒ ๊ธฐ์ค€**: ChromaDBยทpgvector 768d ํ†ต์ผ, migrate ์Šคํฌ๋ฆฝํŠธ ์‹คํ–‰ ๊ฐ€๋Šฅ +- **์™„๋ฃŒ ๊ธฐ์ค€**: 1์ฐจ ๋Œ€์ƒ ๋ฒ”์œ„์—์„œ ChromaDBยทpgvector 768d ํ†ต์ผ, ๋ฉ”๋ชจ๋ฆฌ ๋“œ๋ฆฌํ”„ํŠธ ๋กœ๊ทธ ์ œ๊ฑฐ ํ™•์ธ ### 4. ์ ์šฉ ๋ฐ ๊ฒ€์ฆ - **๋Œ€์ƒ**: Company X RAG ๋˜๋Š” NAS RAG 1๊ฐœ ๊ฒฝ๋กœ ์šฐ์„  - **์ž‘์—…**: - skill-rag-file, rb8001์ด ๊ธฐ์กด `EMBEDDING_SERVICE_URL`/`SKILL_EMBEDDING_URL`๋กœ ์ƒˆ skill-embedding ํ˜ธ์ถœ (URL ๋ณ€๊ฒฝ ์—†์Œ) - - RAG ์—…๋กœ๋“œ โ†’ ์ž„๋ฒ ๋”ฉ โ†’ ๊ฒ€์ƒ‰ ํŒŒ์ดํ”„๋ผ์ธ 1ํšŒ ์ˆ˜๋™ ๊ฒ€์ฆ + - Company X ๋ฌธ์„œ ๊ฒ€์ƒ‰๊ณผ `rb8001` ๋ฉ”๋ชจ๋ฆฌ ์ €์žฅ/๊ฒ€์ƒ‰์„ ๋Œ€ํ‘œ ์งˆ๋ฌธ์…‹์œผ๋กœ ์ˆ˜๋™ ๊ฒ€์ฆ + - fallback ๋ฐœ์ƒ ์‹œ ๊ตฌ์กฐํ™” ๋กœ๊ทธ์— ์›์ธ๊ณผ ์š”์ฒญ ๋‹จ์œ„๋ฅผ ๊ธฐ๋ก - DOCS `skills/companyx-rag/SKILL.md`, `330_*.md` ํ•„์š” ์‹œ ๊ฐฑ์‹  -- **์™„๋ฃŒ ๊ธฐ์ค€**: Company X RAG ๋˜๋Š” NAS RAG ์ƒˆ ๊ฒฝ๋กœ๋กœ ๋™์ž‘ ํ™•์ธ +- **์™„๋ฃŒ ๊ธฐ์ค€**: Company X ๊ฒฝ๋กœ์™€ `rb8001` ๋ฉ”๋ชจ๋ฆฌ ๊ฒฝ๋กœ๊ฐ€ ๋ชจ๋‘ ์ƒˆ ๊ฒฝ๋กœ๋กœ ๋™์ž‘ ํ™•์ธ, fallback ์ œ๊ฑฐ ํ™•์ธ ### 5. ivada-infra ๋ฐฐํฌ (์„œ๋ฒ„ ๊ด€๋ฆฌ์ž) - **๋Œ€์ƒ**: ivada-infra skill-embedding ๋ฐฐํฌ, 23/24 ์„œ๋ฒ„ -- **์ž‘์—…**: `.env.deploy` ๊ฐฑ์‹ , skill-embedding ์ด๋ฏธ์ง€ ์žฌ๋นŒ๋“œยท์žฌ๋ฐฐํฌ +- **์ž‘์—…**: `.env.deploy` ๊ฐฑ์‹ , skill-embedding ์ด๋ฏธ์ง€ ์žฌ๋นŒ๋“œยท์žฌ๋ฐฐํฌ, ๊ด€๋ จ ์„œ๋น„์Šค ์ˆœ์ฐจ ๋ฐ˜์˜ - **์‹คํ–‰**: ์„œ๋ฒ„ ๊ด€๋ฆฌ์ž๋งŒ ์ˆ˜ํ–‰ +- **์™„๋ฃŒ ๊ธฐ์ค€**: ์šด์˜ ๋ฐฐํฌ ํ›„ ๊ตฌ์กฐํ™” ๋กœ๊ทธ์—์„œ ์‹ค์ œ Gemini 2 ๊ฒฝ๋กœ ์‚ฌ์šฉ ํ™•์ธ ### 6. worklog ์ž‘์„ฑ ํ›„ ๋‹ซํž˜ ์„ ์–ธ -- worklog์— 1~5 ์™„๋ฃŒ ๊ธฐ๋ก, [๋ฌธ์ œ ์˜คํ”ˆ](../troubleshooting/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๋ฌธ์ œ์˜คํ”ˆ.md) ๋‹ซํž˜ ์กฐ๊ฑด 6๊ฐœ ์ถฉ์กฑ ์„ ์–ธ +- worklog์— 1~5 ์™„๋ฃŒ ๊ธฐ๋ก +- ํ…Œ์ŠคํŠธ ๊ฒฐ๊ณผ ๋ฌธ์„œ ๋˜๋Š” ์„น์…˜์— ์ž๋™ ํ…Œ์ŠคํŠธ/์งˆ๋ฌธ์…‹ ๊ฒฐ๊ณผ ๋งํฌ +- ๋ฐฐํฌ/์šด์˜ ๊ฒ€์ฆ ๊ธฐ๋ก ๋ฌธ์„œ ๋˜๋Š” ๋กœ๊ทธ ๊ฒฝ๋กœ ๋งํฌ +- [๋ฌธ์ œ ์˜คํ”ˆ](../troubleshooting/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๋ฌธ์ œ์˜คํ”ˆ.md) ๋‹ซํž˜ ์กฐ๊ฑด 8๊ฐœ ์ถฉ์กฑ ์„ ์–ธ ## ๋กค๋ฐฑ ์ ˆ์ฐจ - skill-embedding: ์ด์ „ ONNX ko-sroberta ์ด๋ฏธ์ง€/์ฝ”๋“œ๋กœ ๋ณต๊ท€ - workspace-config: `EMBEDDING_MODEL` ์ œ๊ฑฐ ๋˜๋Š” ์ด์ „๊ฐ’ ๋ณต์› - ChromaDB: migrate ์ „ ๋ฐฑ์—… ์žˆ์œผ๋ฉด ๋ณต์›. ์—†์œผ๋ฉด 768d ์‹ ๊ทœ ์ปฌ๋ ‰์…˜๋งŒ ์‚ญ์ œ +- ๋‹จ, ๋กค๋ฐฑ ์ค‘ fallback ๊ฒฝ๋กœ๋ฅผ ์‚ฌ์šฉํ–ˆ๋‹ค๋ฉด ํ•ด๋‹น ๋กœ๊ทธ๋ฅผ ๊ทผ๊ฑฐ ๋ฌธ์„œ์— ๋‚จ๊น€ ## ๊ฒ€์ฆ ๊ธฐ์ค€ (๋‹ซํž˜ ์กฐ๊ฑด) -[๋ฌธ์ œ ์˜คํ”ˆ ๋‹ซํž˜ ์กฐ๊ฑด 6๊ฐœ](../troubleshooting/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๋ฌธ์ œ์˜คํ”ˆ.md#๋‹ซํž˜-์กฐ๊ฑด)์™€ ๋™์ผ: +[๋ฌธ์ œ ์˜คํ”ˆ ๋‹ซํž˜ ์กฐ๊ฑด 8๊ฐœ](../troubleshooting/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๋ฌธ์ œ์˜คํ”ˆ.md#๋‹ซํž˜-์กฐ๊ฑด)์™€ ๋™์ผ: 1. skill-embedding์ด Gemini 2, 768d๋กœ ๋™์ž‘ํ•œ๋‹ค. 2. rb8001ยทskill-rag-file์ด ๊ธฐ์กด /embed URL๋กœ ์ƒˆ ๋ชจ๋ธ์„ ์ฐธ์กฐํ•œ๋‹ค. -3. ChromaDBยทpgvector ์Šคํ‚ค๋งˆ๊ฐ€ 768d๋กœ ํ†ต์ผ๋œ๋‹ค. -4. Company X RAG, NAS RAG๊ฐ€ ์ƒˆ ๊ฒฝ๋กœ๋กœ ๋™์ž‘ํ•œ๋‹ค. -5. PDFยท์ด๋ฏธ์ง€ ์ง์ ‘ ์ž„๋ฒ ๋”ฉ Recall ์œ ์ง€ ๋˜๋Š” ๊ฐœ์„ , 1M ํ† ํฐ ๋น„์šฉ $0.25 ์ดํ•˜. -6. worklog์—์„œ ๋‹ซํž˜ ์„ ์–ธํ•œ๋‹ค. +3. `skill-rag-file` ์ปฌ๋ ‰์…˜๊ณผ `rb8001` ๋ฉ”๋ชจ๋ฆฌ ์ปฌ๋ ‰์…˜์—์„œ 384/768 ํ˜ผ์žฌ๊ฐ€ ์ œ๊ฑฐ๋˜๊ณ , ChromaDBยทpgvector๊ฐ€ 768d๋กœ ํ†ต์ผ๋œ๋‹ค. +4. Company X RAG์™€ `rb8001` ๋ฉ”๋ชจ๋ฆฌ ์ €์žฅ/๊ฒ€์ƒ‰์ด ์ƒˆ ๊ฒฝ๋กœ๋กœ ๋™์ž‘ํ•œ๋‹ค. +5. PDFยท์ด๋ฏธ์ง€ ์ง์ ‘ ์ž„๋ฒ ๋”ฉ์ด ๋™์ž‘ํ•˜๊ณ , ๊ฒ€์ƒ‰ ํ’ˆ์งˆ์€ ๋Œ€ํ‘œ ์งˆ๋ฌธ์…‹ ๊ธฐ์ค€ ๊ธฐ์กด ๋Œ€๋น„ ์œ ์ง€ ๋˜๋Š” ๊ฐœ์„ ๋œ๋‹ค. +6. ์ž๋™ ํ…Œ์ŠคํŠธ, ์ˆ˜๋™ ์งˆ๋ฌธ์…‹ ๊ฒ€์ฆ, ์šด์˜ ๊ตฌ์กฐํ™” ๋กœ๊ทธ๊ฐ€ ๋ชจ๋‘ ๋‚จ๋Š”๋‹ค. +7. ์‹ค์ œ ๋ฐฐํฌ ํ›„ fallback ์—†์ด ์šด์˜ ๊ฒฝ๋กœ๊ฐ€ Gemini 2 ๋‹จ์ผ ๊ฒฝ๋กœ๋กœ ์œ ์ง€๋œ๋‹ค. +8. worklog์™€ ๋ฐฐํฌ/๊ฒ€์ฆ ๊ทผ๊ฑฐ ๋ฌธ์„œ์—์„œ ๋‹ซํž˜ ์„ ์–ธํ•œ๋‹ค. ## ๋‹ซํž˜ ์„ ์–ธ -- worklog์—์„œ๋งŒ ์„ ์–ธํ•œ๋‹ค. +- worklog์—์„œ ์ฃผ ์„ ์–ธํ•œ๋‹ค. +- ํ…Œ์ŠคํŠธ ๊ฒฐ๊ณผ์™€ ๋ฐฐํฌ/์šด์˜ ๊ฒ€์ฆ ๊ทผ๊ฑฐ ๋งํฌ๋ฅผ ํ•จ๊ป˜ ๋‚จ๊ธด๋‹ค. - ๋ณธ ๋ฌธ์„œ ์ƒํƒœ๋ฅผ `completed`๋กœ ๊ฐฑ์‹ ํ•˜๊ณ  worklog ๋งํฌ๋ฅผ ์ถ”๊ฐ€ํ•œ๋‹ค. -- [๋ฌธ์ œ ์˜คํ”ˆ](../troubleshooting/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๋ฌธ์ œ์˜คํ”ˆ.md) ๋‹ซํž˜ ์กฐ๊ฑด 6๊ฐœ ์ „๋ถ€ ์ถฉ์กฑ ํ›„ ์„ ์–ธ. +- [๋ฌธ์ œ ์˜คํ”ˆ](../troubleshooting/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๋ฌธ์ œ์˜คํ”ˆ.md) ๋‹ซํž˜ ์กฐ๊ฑด 8๊ฐœ ์ „๋ถ€ ์ถฉ์กฑ ํ›„ ์„ ์–ธ. ## ๊ด€๋ จ ๋ฌธ์„œ diff --git a/journey/research/rag/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_ํ˜„ํ™ฉ_SSOT_๋ฆฌ์„œ์น˜.md b/journey/research/rag/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_ํ˜„ํ™ฉ_SSOT_๋ฆฌ์„œ์น˜.md index 19c0671..78b4f84 100644 --- a/journey/research/rag/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_ํ˜„ํ™ฉ_SSOT_๋ฆฌ์„œ์น˜.md +++ b/journey/research/rag/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_ํ˜„ํ™ฉ_SSOT_๋ฆฌ์„œ์น˜.md @@ -65,13 +65,17 @@ tags: [research, embedding, ssot, robeing, 1์ฐจ] - skill-rag-file์€ EMBEDDING_SERVICE_URL๋กœ skill-embedding ํ˜ธ์ถœ - ๊ธฐ์กด ๋ฐ์ดํ„ฐ ์ ์Œ โ†’ ์ „์ˆ˜ ๊ต์ฒด ๊ธฐ์ˆ  ๋ถ€์ฑ„ ๋‚ฎ์Œ - 0_VALUE ์ž„๋ฒ ๋”ฉ ์ •์ฑ…: Gemini 2, 768d, ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ +- `rb8001` ๋ฉ”๋ชจ๋ฆฌ ์ฐจ์› ๋“œ๋ฆฌํ”„ํŠธ๊ฐ€ ๋ณ„๋„ ๋ฌธ์„œ๋กœ ์—ด๋ ค ์žˆ์œผ๋‚˜, 1์ฐจ ๋‹ซํž˜ ๋ฌธ๋งฅ์—์„œ๋Š” ์ œ์™ธํ•˜์ง€ ์•Š๊ธฐ๋กœ ๊ฒฐ์ •๋จ +- 1์ฐจ ๋‹ซํž˜ ๊ทผ๊ฑฐ๋Š” ๋‹จ์ผ worklog๊ฐ€ ์•„๋‹ˆ๋ผ ํ…Œ์ŠคํŠธยท๋ฐฐํฌยท์šด์˜ ๋กœ๊ทธ๊นŒ์ง€ ๋ถ„๋ฆฌ ์ฆ๊ฑฐ๋กœ ๋‚จ๊ฒจ์•ผ ํ•จ --- ## 6. ํ•ด์„(Interpretation) - skill-embedding์„ Gemini 2๋กœ ์ „ํ™˜ํ•˜๋ฉด skill-rag-fileยทrb8001์€ URL๋งŒ ์œ ์ง€ํ•˜๊ณ  ๋ชจ๋ธ์€ ์ž๋™ ๋ฐ˜์˜ -- skill-rag-file ๋‚ด๋ถ€ ์ง์ ‘ Gemini API ํ˜ธ์ถœ๋„ ๊ฐ€๋Šฅ. ๊ฒฝ๋กœ ์„ค๊ณ„ ์„ ํƒ ํ•„์š” +- skill-rag-file ๋‚ด๋ถ€ ์ง์ ‘ Gemini API ํ˜ธ์ถœ๋„ ๊ฐ€๋Šฅ. ๋‹ค๋งŒ 1์ฐจ๋Š” ๋‹จ์ผ ๊ฒŒ์ดํŠธ์›จ์ด ์œ ์ง€๊ฐ€ SSOT์— ๋” ๋งž์Œ +- 1์ฐจ๋ฅผ ์ •๋ง ๋‹ซ์œผ๋ ค๋ฉด `RAG๋งŒ Gemini 2`๊ฐ€ ์•„๋‹ˆ๋ผ `rb8001` ๋ฉ”๋ชจ๋ฆฌ, ๊ธฐ์กด ๋ฐ์ดํ„ฐ, ์ƒˆ ์ž๋ฃŒ ์œ ์ž… ๊ทœ์น™๊นŒ์ง€ ํ•จ๊ป˜ ๋‹ซ์•„์•ผ ํ•จ +- fallback์€ ๋ฐฐํฌ ์•ˆ์ „์žฅ์น˜๋กœ๋งŒ ํ—ˆ์šฉํ•  ์ˆ˜ ์žˆ๊ณ , ์ตœ์ข… ๋‹ซํž˜ ์ƒํƒœ์˜ ์ผ๋ถ€๋กœ ๋‚จ๊ธฐ๋ฉด ์•ˆ ๋จ --- @@ -90,14 +94,37 @@ tags: [research, embedding, ssot, robeing, 1์ฐจ] --- -## 9. ivada-infra ์—ฐ๊ณ„ +## 9. ์ด๋ฒˆ ๋ฆฌ์„œ์น˜์—์„œ ๊ณ ์ •๋œ ๊ฒฐ์ • + +- 1์ฐจ ๋ฒ”์œ„๋Š” `skill-embedding`, `skill-rag-file`, `rb8001`, `DOCS`์ž…๋‹ˆ๋‹ค. +- ๊ณต์‹ ์ž„๋ฒ ๋”ฉ ๊ฒฝ๋กœ๋Š” `skill-embedding` ๋‹จ์ผ ๊ฒŒ์ดํŠธ์›จ์ด์ž…๋‹ˆ๋‹ค. +- ๋ชจ๋ธ/์ฐจ์›์€ `Gemini Embedding 2`, `768d`์ž…๋‹ˆ๋‹ค. +- ํ…์ŠคํŠธ, PDF, ์ด๋ฏธ์ง€๊ฐ€ ๋ชจ๋‘ 1์ฐจ ๋‹ซํž˜ ๋ฒ”์œ„์ž…๋‹ˆ๋‹ค. +- `rb8001` ๋ฉ”๋ชจ๋ฆฌ ๋“œ๋ฆฌํ”„ํŠธ๋Š” ๋ณ„๋„ ์ œ์™ธ๊ฐ€ ์•„๋‹ˆ๋ผ 1์ฐจ ์ข…๋ฃŒ ๋ฒ”์œ„์— ํฌํ•จ๋ฉ๋‹ˆ๋‹ค. +- 1์ฐจ ๋Œ€์ƒ ๋ฒ”์œ„์—์„œ๋Š” `384/768` ํ˜ผ์žฌ๋ฅผ ํ—ˆ์šฉํ•˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค. +- ์ƒˆ ์ž๋ฃŒ ์œ ์ž…์€ ๋ชจ๋‘ `Gemini 2 768d`๋ฅผ ํ•„์ˆ˜๋กœ ๋”ฐ๋ฅด๋ฉฐ, ๋‹ค๋ฅธ ๊ฒฝ๋กœ๋Š” ์‹คํŒจ ์ฒ˜๋ฆฌ์™€ ๋กœ๊ทธ ๊ธฐ๋ก ๋Œ€์ƒ์ž…๋‹ˆ๋‹ค. +- ์‹ค์ œ ์šด์˜ ๋ฐฐํฌ, ์ž๋™ ํ…Œ์ŠคํŠธ, ์งˆ๋ฌธ์…‹ ๊ฒ€์ฆ, ์šด์˜ ๋กœ๊ทธ๊ฐ€ ๋ชจ๋‘ ๋‹ซํž˜ ๊ทผ๊ฑฐ๋กœ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค. +- fallback์€ ๋ฐฐํฌ ๋„์ค‘ ์ž„์‹œ ํ—ˆ์šฉ ๊ฐ€๋Šฅํ•˜์ง€๋งŒ, ์ตœ์ข… ๋‹ซํž˜ ์ƒํƒœ์—๋Š” ํฌํ•จ๋˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค. + +--- + +## 10. ๋‚จ์€ ๋ฏธํ™•์ • + +- skill-embedding ๋‚ด๋ถ€์—์„œ PDFยท์ด๋ฏธ์ง€ ์ž…๋ ฅ์„ `/embed` ๋‹จ์ผ ๊ณ„์•ฝ์œผ๋กœ ์–ด๋–ป๊ฒŒ ์ง๋ ฌํ™”ํ• ์ง€๋Š” ๊ตฌํ˜„ ์ƒ์„ธ๊ฐ€ ์•„์ง ๋ฏธํ™•์ •์ž…๋‹ˆ๋‹ค. +- Company X ๋ฌธ์„œ ์ปฌ๋ ‰์…˜๊ณผ `rb8001` ๋ฉ”๋ชจ๋ฆฌ ์ปฌ๋ ‰์…˜์„ ์–ด๋–ค ๋ฐฐ์น˜ ๋‹จ์œ„์™€ ์ˆœ์„œ๋กœ ์žฌ์ž„๋ฒ ๋”ฉํ• ์ง€๋Š” ์•„์ง ๋ฏธํ™•์ •์ž…๋‹ˆ๋‹ค. +- ๋Œ€ํ‘œ ์งˆ๋ฌธ์…‹์„ ์–ด๋–ค ๋ฌธ์„œ๊ตฐ์œผ๋กœ ๊ณ ์ •ํ• ์ง€, ๊ฒ€์ƒ‰ ํ’ˆ์งˆ ์œ ์ง€/๊ฐœ์„  ๊ธฐ์ค€์„ ์–ด๋–ค ํŒ์ •ํ‘œ๋กœ ๋‚จ๊ธธ์ง€๋Š” ์•„์ง ๋ฏธํ™•์ •์ž…๋‹ˆ๋‹ค. +- ๊ตฌ์กฐํ™” ๋กœ๊ทธ ์ €์žฅ ์œ„์น˜์™€ ํ•„๋“œ๋ช…์„ ์–ด๋–ค ์„œ๋น„์Šค์—์„œ ๋™์ผ ํฌ๋งท์œผ๋กœ ๋งž์ถœ์ง€๋Š” ์•„์ง ๋ฏธํ™•์ •์ž…๋‹ˆ๋‹ค. + +--- + +## 11. ivada-infra ์—ฐ๊ณ„ - skill-embedding์€ ivada-infra์—์„œ 23/24 ์„œ๋ฒ„ ๋ฐฐํฌ. `.env.deploy`, docker-compose ๊ฒฝ๋กœ ์กด์žฌ. - 1์ฐจ ์ „ํ™˜ ์‹œ: skill-embedding ๋ ˆํฌ Gemini 2 ์ „ํ™˜ โ†’ ivada-infra skill-embedding ๋ฐฐํฌ ์„ค์ • ๊ฐฑ์‹  โ†’ 23/24 ์„œ๋ฒ„ ์žฌ๋ฐฐํฌ. ์„œ๋ฒ„ ๊ด€๋ฆฌ์ž ์‹คํ–‰. --- -## 10. ๊ด€๋ จ ๋ฌธ์„œ +## 12. ๊ด€๋ จ ๋ฌธ์„œ - [์ž„๋ฒ ๋”ฉ 1์ฐจ ๋กœ๋น™ Gemini 2 ์ „ํ™˜ ๋ฌธ์ œ ์˜คํ”ˆ](../../troubleshooting/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๋ฌธ์ œ์˜คํ”ˆ.md) - [์ž„๋ฒ ๋”ฉ 1์ฐจ ๋กœ๋น™ Gemini 2 ์ „ํ™˜ ๊ณ„ํš](../../plans/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๊ณ„ํš.md) diff --git a/journey/troubleshooting/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๋ฌธ์ œ์˜คํ”ˆ.md b/journey/troubleshooting/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๋ฌธ์ œ์˜คํ”ˆ.md index 589b272..d2c3b6e 100644 --- a/journey/troubleshooting/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๋ฌธ์ œ์˜คํ”ˆ.md +++ b/journey/troubleshooting/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_Gemini2_์ „ํ™˜_๋ฌธ์ œ์˜คํ”ˆ.md @@ -39,6 +39,20 @@ tags: [troubleshooting, embedding, gemini, rag, robeing, 1์ฐจ] - rb8001 ๋ฉ”๋ชจ๋ฆฌ: ChromaDB 768/384 ์ฐจ์› ๋“œ๋ฆฌํ”„ํŠธ - ๊ธฐ์กด ๋ฐ์ดํ„ฐ: ์ ์Œ โ†’ ์ „์ˆ˜ ๊ต์ฒด ์‹œ ๊ธฐ์ˆ  ๋ถ€์ฑ„ ๊ฑฐ์˜ ์—†์Œ +## 1์ฐจ ๊ฒฐ์ • ๊ณ ์ • + +- **๋Œ€์ƒ ๋ ˆํฌ**: `skill-embedding`, `skill-rag-file`, `rb8001`, `DOCS` +- **๊ณต์‹ ๊ฒฝ๋กœ**: 1์ฐจ ๋Œ€์ƒ ์„œ๋น„์Šค๋Š” ๋ชจ๋‘ `skill-embedding`๋งŒ ๊ณต์‹ ์ž„๋ฒ ๋”ฉ ๊ฒŒ์ดํŠธ์›จ์ด๋กœ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. +- **๋ชจ๋ธ/์ฐจ์›**: `Gemini Embedding 2`, `768d` +- **์ž…๋ ฅ ๋ฒ”์œ„**: ํ…์ŠคํŠธ, PDF, ์ด๋ฏธ์ง€๊นŒ์ง€ 1์ฐจ ๋‹ซํž˜ ๋ฒ”์œ„์— ํฌํ•จํ•ฉ๋‹ˆ๋‹ค. +- **๋ฉ”๋ชจ๋ฆฌ ๋ฒ”์œ„**: `rb8001` ๋ฉ”๋ชจ๋ฆฌ ์ฐจ์› ๋“œ๋ฆฌํ”„ํŠธ๋„ 1์ฐจ์—์„œ ํ•จ๊ป˜ ๋‹ซ์Šต๋‹ˆ๋‹ค. +- **ํ˜ผ์žฌ ์ •์ฑ…**: 1์ฐจ ๋Œ€์ƒ ๋ฒ”์œ„ ์•ˆ์—์„œ๋Š” `384/768` ํ˜ผ์žฌ๋ฅผ ๋‹ซํž˜ ์ƒํƒœ๋กœ ์ธ์ •ํ•˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค. +- **์ƒˆ ์ž๋ฃŒ ์ •์ฑ…**: ์•ž์œผ๋กœ ๋“ค์–ด์˜ค๋Š” ์ƒˆ ์ •๋ณด/์ž๋ฃŒ๋„ ๋ชจ๋‘ ๊ฐ™์€ ๊ฒฝ๋กœ์™€ ์ฐจ์›์„ ๋”ฐ๋ผ์•ผ ํ•ฉ๋‹ˆ๋‹ค. +- **๋ฐฐํฌ ๊ธฐ์ค€**: ๋ฌธ์„œยท์ฝ”๋“œยท๋กœ์ปฌ ๊ฒ€์ฆ๋งŒ์œผ๋กœ๋Š” ๋‹ซ์ง€ ์•Š๊ณ , ์‹ค์ œ ๋ฐฐํฌ์™€ ์šด์˜ ๊ฒ€์ฆ๊นŒ์ง€ ์™„๋ฃŒ๋ผ์•ผ ๋‹ซ์Šต๋‹ˆ๋‹ค. +- **๊ฒ€์ฆ ๊ฐ•๋„**: ์ž๋™ ํ…Œ์ŠคํŠธ + ๋Œ€ํ‘œ ์งˆ๋ฌธ์…‹ ์ˆ˜๋™ ๊ฒ€์ฆ + ์šด์˜ ๊ตฌ์กฐํ™” ๋กœ๊ทธ๋ฅผ ๋ชจ๋‘ ๋‹ซํž˜ ๊ทผ๊ฑฐ๋กœ ์š”๊ตฌํ•ฉ๋‹ˆ๋‹ค. +- **fallback ์ •์ฑ…**: ๋ฐฐํฌ ์ค‘ ์ผ์‹œ์  ๊ตฌํ˜• ์ž„๋ฒ ๋”ฉ fallback์€ ํ—ˆ์šฉํ•  ์ˆ˜ ์žˆ์œผ๋‚˜, fallback ๋ฐœ์ƒ์€ ๊ตฌ์กฐํ™” ๋กœ๊ทธ๋กœ ๋‚จ๊ฒจ์•ผ ํ•˜๋ฉฐ ์ตœ์ข… ๋‹ซํž˜ ์‹œ์ ์—๋Š” ์ œ๊ฑฐ๋ผ์•ผ ํ•ฉ๋‹ˆ๋‹ค. +- **์žฌ์ž„๋ฒ ๋”ฉ ์‹œ์ž‘์ **: Company X ๋ฌธ์„œ ์ปฌ๋ ‰์…˜๊ณผ `rb8001` ๋ฉ”๋ชจ๋ฆฌ ์ปฌ๋ ‰์…˜์„ ์šฐ์„  ์ •๋ฆฌํ•˜๊ณ , ์ดํ›„ 1์ฐจ ๋Œ€์ƒ ๋ ˆํฌ์˜ ๊ธฐ์กด ๋ฐ์ดํ„ฐ๋กœ ํ™•์žฅํ•ฉ๋‹ˆ๋‹ค. + ## ๊ธฐ๋Œ€ ์ƒํƒœ (1์ฐจ ๋‹ซํž˜ ๊ธฐ์ค€) | ํ•ญ๋ชฉ | ๋‚ด์šฉ | @@ -50,6 +64,8 @@ tags: [troubleshooting, embedding, gemini, rag, robeing, 1์ฐจ] | env | workspace-config SSOT, ์„œ๋น„์Šค๋ณ„ ์˜ค๋ฒ„๋ผ์ด๋“œ ๊ธˆ์ง€ | | ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ | PDFยท์ด๋ฏธ์ง€ ์ง์ ‘ ์ž„๋ฒ ๋”ฉ | | ์ฒญํ‚น | 1์ฐจ Micro ์œ ์ง€, 2๋‹จ๊ณ„ Macro ๊ฒ€ํ†  | +| ์šด์˜ ๋ฐ์ดํ„ฐ | 1์ฐจ ๋Œ€์ƒ ๋ฒ”์œ„์—์„œ ์‹ ๊ทœ/๊ธฐ์กด ๋ฐ์ดํ„ฐ ๋ชจ๋‘ 768d ๋‹จ์ผ ๊ฒฝ๋กœ๋กœ ์ˆ˜๋ ด | +| ๋ฉ”๋ชจ๋ฆฌ | `rb8001` ๋ฉ”๋ชจ๋ฆฌ ์ปฌ๋ ‰์…˜๊นŒ์ง€ 768d๋กœ ์ •๋ฆฌ๋˜์–ด ์ฐจ์› ๋“œ๋ฆฌํ”„ํŠธ ๋กœ๊ทธ๊ฐ€ ์‚ฌ๋ผ์ง | ## ์˜ํ–ฅ ๋ฒ”์œ„ (1์ฐจ) @@ -66,10 +82,12 @@ tags: [troubleshooting, embedding, gemini, rag, robeing, 1์ฐจ] 1. skill-embedding์ด Gemini 2, 768d๋กœ ๋™์ž‘ํ•œ๋‹ค. (๊ฒฝ๋กœ: skill-embedding ๊ต์ฒด ํ™•์ •) 2. rb8001ยทskill-rag-file์ด ๊ธฐ์กด /embed URL๋กœ ์ƒˆ ๋ชจ๋ธ์„ ์ฐธ์กฐํ•œ๋‹ค. -3. ChromaDBยทpgvector ์Šคํ‚ค๋งˆ๊ฐ€ 768d๋กœ ํ†ต์ผ๋œ๋‹ค. -4. Company X RAG, NAS RAG๊ฐ€ ์ƒˆ ๊ฒฝ๋กœ๋กœ ๋™์ž‘ํ•œ๋‹ค. -5. PDFยท์ด๋ฏธ์ง€ ์ง์ ‘ ์ž„๋ฒ ๋”ฉ Recall ์œ ์ง€ ๋˜๋Š” ๊ฐœ์„ , 1M ํ† ํฐ ๋น„์šฉ $0.25 ์ดํ•˜. -6. worklog์—์„œ ๋‹ซํž˜ ์„ ์–ธํ•œ๋‹ค. +3. `skill-rag-file` ์ปฌ๋ ‰์…˜๊ณผ `rb8001` ๋ฉ”๋ชจ๋ฆฌ ์ปฌ๋ ‰์…˜์—์„œ 384/768 ํ˜ผ์žฌ๊ฐ€ ์ œ๊ฑฐ๋˜๊ณ , ChromaDBยทpgvector๊ฐ€ 768d๋กœ ํ†ต์ผ๋œ๋‹ค. +4. Company X RAG์™€ `rb8001` ๋ฉ”๋ชจ๋ฆฌ ์ €์žฅ/๊ฒ€์ƒ‰์ด ์ƒˆ ๊ฒฝ๋กœ๋กœ ๋™์ž‘ํ•œ๋‹ค. +5. PDFยท์ด๋ฏธ์ง€ ์ง์ ‘ ์ž„๋ฒ ๋”ฉ์ด ๋™์ž‘ํ•˜๊ณ , ๊ฒ€์ƒ‰ ํ’ˆ์งˆ์€ ๋Œ€ํ‘œ ์งˆ๋ฌธ์…‹ ๊ธฐ์ค€ ๊ธฐ์กด ๋Œ€๋น„ ์œ ์ง€ ๋˜๋Š” ๊ฐœ์„ ๋œ๋‹ค. +6. ์ž๋™ ํ…Œ์ŠคํŠธ, ์ˆ˜๋™ ์งˆ๋ฌธ์…‹ ๊ฒ€์ฆ, ์šด์˜ ๊ตฌ์กฐํ™” ๋กœ๊ทธ๊ฐ€ ๋ชจ๋‘ ๋‚จ๋Š”๋‹ค. +7. ์‹ค์ œ ๋ฐฐํฌ ํ›„ fallback ์—†์ด ์šด์˜ ๊ฒฝ๋กœ๊ฐ€ Gemini 2 ๋‹จ์ผ ๊ฒฝ๋กœ๋กœ ์œ ์ง€๋œ๋‹ค. +8. worklog์™€ ๋ฐฐํฌ/๊ฒ€์ฆ ๊ทผ๊ฑฐ ๋ฌธ์„œ์—์„œ ๋‹ซํž˜ ์„ ์–ธํ•œ๋‹ค. ## ์žฌํ˜„ ์กฐ๊ฑด @@ -88,6 +106,8 @@ tags: [troubleshooting, embedding, gemini, rag, robeing, 1์ฐจ] - **๊ฒฝ๋กœ**: skill-embedding ๊ต์ฒด (ONNXโ†’Gemini 2). [๋ฆฌ์„œ์น˜ ยง7](../research/rag/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_ํ˜„ํ™ฉ_SSOT_๋ฆฌ์„œ์น˜.md) - **์ฒญํ‚น**: 1์ฐจ๋Š” ๊ธฐ์กด Micro ์œ ์ง€. 2๋‹จ๊ณ„์—์„œ Macro ๊ฒ€ํ† . [๋ฆฌ์„œ์น˜ ยง8](../research/rag/260316_์ž„๋ฒ ๋”ฉ_1์ฐจ_๋กœ๋น™_ํ˜„ํ™ฉ_SSOT_๋ฆฌ์„œ์น˜.md) +- **๋ฉ”๋ชจ๋ฆฌ**: `rb8001` ๋ฉ”๋ชจ๋ฆฌ ์ฐจ์› ๋“œ๋ฆฌํ”„ํŠธ๋„ 1์ฐจ ๋‹ซํž˜ ๋ฒ”์œ„์— ํฌํ•จ +- **๋ฐฐํฌ/๊ฒ€์ฆ**: ์‹ค์ œ ๋ฐฐํฌ, ๊ตฌ์กฐํ™” ๋กœ๊ทธ, ๋Œ€ํ‘œ ์งˆ๋ฌธ์…‹ ๊ฒ€์ฆ๊นŒ์ง€ ํ™•๋ณดํ•ด์•ผ ๋‹ซํž˜ ## ์ด ๋ฌธ์„œ๊ฐ€ ์—ฌ๋Š” ๋ฆฌ์„œ์น˜