Context Mode - AI कोडिंग एजेंट के context window को 98% बचाने वाला MCP सर्वर

xguru · 2026-05-03T09:31:02+09:00

MCP tool calls जो raw data context window में उंडेल देते हैं, उसे बचाता है और session duration को ~30 मिनट से ~3 घंटे तक बढ़ाता है Context बचत: raw data को sandbox में isolate करके 315KB → 5.4KB तक 98% कमी Session continuity: file edit, git work, task, error, user decision जैसे सभी events को SQLite+FTS5 में store और index करने के बाद BM25 से search करके session event tracking और compaction recovery कोड के ज़रिये सोचना: LLM डेटा को सीधे पढ़ने के बजाय script लिखता है और सिर्फ़ result लौटाता है (जैसे functions गिनते समय, context save किए बिना संबंधित code लिखकर वास्तविक result ही लाता है) Output compression: ~65-75% output tokens में कमी, तकनीकी शुद्धता बरकरार 14 platforms का समर्थन: Claude Code(plugin marketplace), Codex CLI, Gemini CLI, Cursor, OpenCode, OpenClaw, Antigravity आदि हर platform के hook स्तर(PreToolUse, PostToolUse, SessionStart, PreCompact) के अनुसार session continuity grade अलग-अलग समर्थन 11 MCP tools उपलब्ध: ctx_execute(11 language runtimes), ctx_batch_execute(multiple commands+search batch execution), ctx_execute_file(file sandbox processing), ctx_index/ctx_search(FTS5+BM25 आधारित knowledge base indexing·search), ctx_fetch_and_index(URL fetch·index·24 घंटे TTL cache), ctx_stats/ctx_doctor/ctx_upgrade/ctx_purge/ctx_insight Security: Claude Code की permissions setting(deny/allow patterns) को MCP sandbox पर भी वही लागू, और chained commands की भी अलग-अलग जाँच पूरी तरह local execution: telemetry, cloud sync, usage tracking नहीं; SQLite DB home directory में store होती है Microsoft, Google, Meta, Amazon, NVIDIA, Stripe, Datadog जैसी प्रमुख कंपनियों की teams द्वारा उपयोग में Elastic License 2.0(source available, hosted service उपलब्ध कराना निषिद्ध)

(github.com/mksglu)

23 पॉइंट द्वारा xguru 2026-05-03 | 8 टिप्पणियां | WhatsApp पर शेयर करें

MCP tool calls जो raw data context window में उंडेल देते हैं, उसे बचाता है और session duration को ~30 मिनट से ~3 घंटे तक बढ़ाता है
Context बचत: raw data को sandbox में isolate करके 315KB → 5.4KB तक 98% कमी
Session continuity: file edit, git work, task, error, user decision जैसे सभी events को SQLite+FTS5 में store और index करने के बाद BM25 से search करके session event tracking और compaction recovery
कोड के ज़रिये सोचना: LLM डेटा को सीधे पढ़ने के बजाय script लिखता है और सिर्फ़ result लौटाता है (जैसे functions गिनते समय, context save किए बिना संबंधित code लिखकर वास्तविक result ही लाता है)
Output compression: ~65-75% output tokens में कमी, तकनीकी शुद्धता बरकरार
14 platforms का समर्थन: Claude Code(plugin marketplace), Codex CLI, Gemini CLI, Cursor, OpenCode, OpenClaw, Antigravity आदि
- हर platform के hook स्तर(PreToolUse, PostToolUse, SessionStart, PreCompact) के अनुसार session continuity grade अलग-अलग समर्थन
11 MCP tools उपलब्ध: ctx_execute(11 language runtimes), ctx_batch_execute(multiple commands+search batch execution), ctx_execute_file(file sandbox processing), ctx_index/ctx_search(FTS5+BM25 आधारित knowledge base indexing·search), ctx_fetch_and_index(URL fetch·index·24 घंटे TTL cache), ctx_stats/ctx_doctor/ctx_upgrade/ctx_purge/ctx_insight
Security: Claude Code की permissions setting(deny/allow patterns) को MCP sandbox पर भी वही लागू, और chained commands की भी अलग-अलग जाँच
पूरी तरह local execution: telemetry, cloud sync, usage tracking नहीं; SQLite DB home directory में store होती है
Microsoft, Google, Meta, Amazon, NVIDIA, Stripe, Datadog जैसी प्रमुख कंपनियों की teams द्वारा उपयोग में
Elastic License 2.0(source available, hosted service उपलब्ध कराना निषिद्ध)

8 टिप्पणियां

kirkyoon 2026-05-06

मुझे लगता है कि इसका असर सच में महसूस हो रहा है। मैं सिर्फ MCP इस्तेमाल कर रहा हूँ और hooks को फिलहाल बाहर रख रहा हूँ।

kaydash 2026-05-04

काम करता है क्या..

duse0001 2026-05-03

यह Claude mem और caveman के मिश्रण जैसा लग रहा है।

recast7838 2026-05-03

लगता है आजकल टोकन बचाने वाली तरह-तरह की प्रोडक्ट्स और प्रॉम्प्ट्स काफ़ी चलन में हैं। यह वास्तव में कितना असरदार होगा, पता नहीं।

kurthong 2026-05-04

मेरा मानना है कि token savings और context window savings एक-दूसरे से जुड़े हुए हैं, लेकिन दोनों एक ही बात नहीं हैं.

Token savings लागत का मुद्दा है, और context window savings मॉडल की working memory की quality का मुद्दा है. अगर बड़े पैमाने के logs या DOM snapshots को ज्यों का त्यों डाल दिया जाए, तो लागत भी बढ़ती है, लेकिन उससे भी बड़ी समस्या यह है कि मॉडल के लिए noise के बीच महत्वपूर्ण संकेत छूट जाने या गलत संदर्भ लेने की संभावना बढ़ जाती है.

ऐसा लगता है कि context-mode को “कम इस्तेमाल करें” वाले टूल से ज़्यादा “context में क्या चढ़ाना है, इसे नियंत्रित करें” वाले टूल के रूप में समझना सही होगा.

ehlegeth 2026-05-04

आपको token बचत और context window बचत को अलग-अलग समझना चाहिए।
अक्सर इन्हें साथ में चर्चा किया जाता है, इसलिए भ्रम होना आसान है।

shakespeares 2026-05-03

सही कहा, पता नहीं यह वाकई कितना practical है.. लगता है एक-दो बार और इस्तेमाल करने पर शायद tokens और ज़्यादा खर्च होंगे..

kyg5474 2026-05-03

सच में?

Context Mode - AI कोडिंग एजेंट के context window को 98% बचाने वाला MCP सर्वर

संबंधित पढ़ाई

8 टिप्पणियां