mini-swe-agent - 100-लाइन AI एजेंट जो GitHub issues हल करता है और command line को सपोर्ट करता है

xguru · 2026-04-26T09:31:02+09:00

SWE-bench और SWE-agent बनाने वाली Princeton·Stanford टीम द्वारा विकसित मिनिमलिस्ट AI coding agent लगभग 100 लाइन Python code से बना एक बेहद हल्का AI software engineering agent, जो बड़े config files या monorepo के बिना काम करता है केवल bash को टूल के रूप में इस्तेमाल करता है, और LLM के tool-calling interface के बिना भी किसी भी मॉडल के साथ compatible है SWE-bench verified benchmark में 74% से अधिक हासिल, और Gemini 3 Pro के आधार पर भी वही स्तर दर्ज subprocess.run के जरिए हर action को independent execution में चलाता है, इसलिए stateful shell session की जरूरत नहीं होती, और Docker में बदलने पर बस docker exec से replace करना जितना सरल है पूरी तरह linear history structure के साथ, trajectory और LLM को भेजे जाने वाले messages एक जैसे रहते हैं, जिससे debugging और fine-tuning के लिए यह आदर्श है local, Docker/Podman, Singularity/Apptainer, Bubblewrap, Contree सहित कई sandbox environments में deployment सपोर्ट litellm, openrouter, portkey आदि के जरिए सभी मॉडल सपोर्ट, /completion और /response endpoints तथा interleaved thinking के साथ compatible Meta, NVIDIA, IBM, Essential AI, Nebius, Anyscale, Princeton·Stanford आदि में अपनाया गया Claude Code की तुलना में शुरुआत की गति काफी तेज, और agent scaffold overfitting के बिना LLM की अपनी performance का मूल्यांकन करने के लिए baseline के रूप में उपयोगी CLI(mini), batch inference, trajectory browser, और Python bindings (DefaultAgent, LitellmModel, LocalEnvironment) उपलब्ध MIT लाइसेंस

SWE-bench और SWE-agent बनाने वाली Princeton·Stanford टीम द्वारा विकसित मिनिमलिस्ट AI coding agent
लगभग 100 लाइन Python code से बना एक बेहद हल्का AI software engineering agent, जो बड़े config files या monorepo के बिना काम करता है
केवल bash को टूल के रूप में इस्तेमाल करता है, और LLM के tool-calling interface के बिना भी किसी भी मॉडल के साथ compatible है
SWE-bench verified benchmark में 74% से अधिक हासिल, और Gemini 3 Pro के आधार पर भी वही स्तर दर्ज
subprocess.run के जरिए हर action को independent execution में चलाता है, इसलिए stateful shell session की जरूरत नहीं होती, और Docker में बदलने पर बस docker exec से replace करना जितना सरल है
पूरी तरह linear history structure के साथ, trajectory और LLM को भेजे जाने वाले messages एक जैसे रहते हैं, जिससे debugging और fine-tuning के लिए यह आदर्श है
local, Docker/Podman, Singularity/Apptainer, Bubblewrap, Contree सहित कई sandbox environments में deployment सपोर्ट
litellm, openrouter, portkey आदि के जरिए सभी मॉडल सपोर्ट, /completion और /response endpoints तथा interleaved thinking के साथ compatible
Meta, NVIDIA, IBM, Essential AI, Nebius, Anyscale, Princeton·Stanford आदि में अपनाया गया
Claude Code की तुलना में शुरुआत की गति काफी तेज, और agent scaffold overfitting के बिना LLM की अपनी performance का मूल्यांकन करने के लिए baseline के रूप में उपयोगी
CLI(mini), batch inference, trajectory browser, और Python bindings (DefaultAgent, LitellmModel, LocalEnvironment) उपलब्ध
MIT लाइसेंस

mini-swe-agent - 100-लाइन AI एजेंट जो GitHub issues हल करता है और command line को सपोर्ट करता है

संबंधित पढ़ाई

1 टिप्पणियां