Caveman - Claude/Codex टोकन बचाने के लिए गुफावासी बोलचाल

(github.com/JuliusBrussee)

30 पॉइंट द्वारा GN⁺ 24 일 전 | अभी कोई टिप्पणी नहीं है. | WhatsApp पर शेयर करें

गुफावासी बोलचाल में जवाब देने के लिए मजबूर करके औसतन 65~75% output tokens की बचत करने वाली स्किल
Lite·Full·Ultra तीन चरणों में compression intensity नियंत्रित करते हुए, तकनीकी सटीकता बनाए रखकर छोटे और efficient जवाब बनाती है
वास्तविक benchmark में React·PostgreSQL·Git से जुड़ी व्याख्याओं में भी token उपयोग आधे से कम हो गया
response speed लगभग 3 गुना बेहतर, readability में सुधार, cost reduction का प्रभाव एक साथ देती है
Claude Code और Codex में सरल command से install किया जा सकता है, और पूरे session में लगातार इस्तेमाल संभव है

Caveman अवलोकन

Claude Code और Codex के लिए plugin, जो LLM के जवाबों को ‘गुफावासी बोलचाल(caveman-speak)’ में बदलकर token उपयोग लगभग 75% कम करता है
तकनीकी सटीकता बनाए रखते हुए अनावश्यक शब्द हटाकर छोटे और efficient जवाब बनाता है
install एक लाइन के command से संभव है, और सभी session में लगातार इस्तेमाल किया जा सकता है
कमी केवल output tokens में होती है — thinking/reasoning tokens पर कोई असर नहीं
हटाए जाने वाले उदाहरण:
- greeting·opening: "Sure, I'd be happy to help" (8 tokens की बर्बादी)
- कारण बताने की शुरुआत: "The reason this is happening is because" (7 tokens)
- recommendation phrasing: "I would recommend that you consider" (7 tokens)
- फालतू introduction: "Sure, let me take a look at that for you" (10 tokens)
जिन चीजों को बनाए रखा जाता है: code blocks, technical terms (polymorphism आदि), error messages, git commit·PR messages

एक ही तकनीकी व्याख्या को छोटे वाक्यों में compress करके व्यक्त करता है
- React component rerendering के कारण की व्याख्या: 69 tokens → 19 tokens
- authentication middleware bug की व्याख्या: 75% से अधिक token बचत
Lite / Full / Ultra तीन चरणों में compression intensity नियंत्रित की जा सकती है
- Lite (/caveman lite): अनावश्यक expressions हटाता है, grammar बनाए रखता है — professional लेकिन बिना फालतू विस्तार
- Full (/caveman full): default caveman mode — articles छोड़े जाते हैं, छोटे और टूटे हुए वाक्य
- Ultra (/caveman ultra): अधिकतम compression — telegram style, हर चीज़ संक्षिप्त

Claude API के ज़रिए वास्तविक token उपयोग की तुलना में औसतन 65% reduction
reduction range: 22%~87%
- React rerendering bug की व्याख्या: 1,180 → 159 tokens (87% reduction)
- PostgreSQL connection pool settings: 2,347 → 380 tokens (84% reduction)
- Docker multi-stage build: 1,042 → 290 tokens (72% reduction)
- git rebase vs merge व्याख्या: 702 → 292 tokens (58% reduction)
- callback → async/await refactoring: 387 → 301 tokens (22% reduction, सबसे कम प्रभाव)
कमी केवल output tokens में, thinking·reasoning tokens वैसे ही रहते हैं
मुख्य फायदे हैं readability में सुधार और response speed में बढ़ोतरी, cost reduction एक अतिरिक्त प्रभाव है

मार्च 2026 के शोधपत्र "Brevity Constraints Reverse Performance Hierarchies in Language Models": बड़े models पर संक्षिप्त जवाब देने की बाध्यता लगाने पर कुछ benchmark में accuracy 26%p बढ़ी और performance ranking उलटने की पुष्टि हुई
"Verbose not always better. Sometimes less word = more correct"
- लंबे जवाबों की तुलना में कुछ मामलों में छोटे जवाब अधिक सटीक हो सकते हैं

एक-लाइन install: npx skills add JuliusBrussee/caveman
Claude Code plugin: claude plugin marketplace add JuliusBrussee/caveman
Codex: repository clone करने के बाद /plugins मेनू में Caveman खोजकर install करें
trigger: /caveman, "talk like caveman", "caveman mode", "less tokens please"
बंद करने के लिए: "stop caveman" या "normal mode"
एक बार install → उसके बाद पूरे session पर लागू

trigger commands: /caveman, $caveman, “talk like caveman”, “caveman mode”, “less tokens please”
बंद करने के commands: “stop caveman”, “normal mode”

intensity control

Level	Trigger	विशेषता
Lite	`/caveman lite`	grammar बनाए रखता है, अनावश्यक शब्द हटाता है
Full	`/caveman full`	default mode, articles·फालतू शब्द हटाता है
Ultra	`/caveman ultra`	अधिकतम compression, abbreviations-केंद्रित अभिव्यक्ति