Andrew Ng: AI एजेंट्स की वर्तमान स्थिति [YouTube]

(youtube.com)

19 पॉइंट द्वारा GN⁺ 2025-06-16 | 2 टिप्पणियां | WhatsApp पर शेयर करें

Andrew Ng ने AI agents और agentic systems की अवधारणा के केंद्र में, एजेंट की परिभाषा पर बहस से अधिक autonomy के spectrum पर ध्यान देने का सुझाव दिया
इस समय वास्तविक business opportunities जटिल fully autonomous agents की तुलना में, सरल, linear या छोटे branching वाले workflows में अधिक हैं
Agentic systems के design और operation के लिए ज़रूरी व्यावहारिक skills (task decomposition, evaluation systems, data integration आदि) अभी भी दुर्लभ हैं, और विभिन्न tools का उपयोग करने की क्षमता महत्वपूर्ण है
Evals, voice stack, AI code assistants को ऐसे प्रमुख tools के रूप में बताया गया है जिन्हें अभी पर्याप्त ध्यान नहीं मिला है
Startup सफलता के मुख्य कारक execution speed और गहरी technical understanding हैं, और AI tools की प्रगति के साथ ज़्यादा non-developers के लिए भी बुनियादी coding क्षमता हासिल करना productivity बढ़ाने में मददगार है

Introduction

Andrew Ng ने LangChain सहित विभिन्न AI/agent projects और communities में योगदान दिया है, इसका परिचय
एजेंट की परिभाषा पर बहस के बजाय, agentic systems में autonomy के अलग-अलग स्तर हो सकते हैं, इस बात पर ज़ोर
"क्या यह सचमुच agent है" यह तय करने से बेहतर, autonomy को एक spectrum के रूप में देखकर व्यावहारिक ढंग से आगे बढ़ने का प्रस्ताव

Opportunities: वास्तविक business opportunities

वास्तव में कई business workflows में linear flow या केवल हल्की branching होती है
- उदाहरण: web form भरना, database lookup, simple search जैसे repetitive tasks का automation
Task decomposition और fine-tuning (micro-tasking), evaluation metrics का design, workflow improvement जैसी व्यावहारिक क्षमताएँ दुर्लभ हैं
Complex agentic workflows भी महत्वपूर्ण हैं, लेकिन अधिकांश value creation सरल repetitive संरचनाओं से आती है

Skills: agent builders के लिए आवश्यक क्षमताएँ

Business process को समझने के बाद, data collection/integration, prompts, process splitting आदि की systematic design क्षमता की आवश्यकता
Automated evaluation systems (system/component performance tracking, evaluation framework बनाना आदि) महत्वपूर्ण हैं
अनुभवी teams "अनावश्यक सुधार" पर अटकती नहीं हैं, बल्कि समस्याओं के कुशल workaround या alternatives ढूँढती हैं
विभिन्न AI tools और frameworks को वास्तव में इस्तेमाल करते हुए, decision-making, experimentation की गति और tool combination (Lego block शैली) तेज़ होती है

AI Tools & बदलाव

पिछले 2~3 वर्षों में, AI tools (जैसे Langgraph, RAG, chatbots, memory management, evals/guardrails आदि) का ecosystem अधिक विविध हुआ है
Tools को Lego blocks की तरह कई तरीकों से जोड़ा जा सकता है, इसलिए वास्तविक उपयोग का अनुभव बढ़ने पर तेज़ decision-making संभव होती है
LLM के context window बढ़ने से RAG जैसी कुछ techniques की व्यावहारिक भूमिका बदल रही है—hyperparameter tuning का महत्व घटा है

Underrated Tools: कम आंके गए मुख्य tools

Evals (automated evaluation): कई teams इसे ज़रूरत से ज़्यादा कठिन मानती हैं, लेकिन छोटे examples से जल्दी implement और improve करने की आदत महत्वपूर्ण है
Voice stack (voice-based workflows): बड़े enterprises में इसकी demand और adoption तेज़ी से बढ़ रही है, लेकिन developer community का ध्यान अभी कम है
AI code assistants: AI-supported coding से productivity बढ़ती है, और यदि सभी members बुनियादी coding क्षमता सीख लें तो अलग-अलग roles में productivity सुधरती है
- AI Fund का उदाहरण: receptionist, CFO, lawyers तक सभी coding सीखकर अपने काम की efficiency बढ़ा रहे हैं

Voice Application की विशेषताएँ

Text prompts की तुलना में voice input उपयोगकर्ता का burden कम करता है और जानकारी तेज़ी से दर्ज कराने में मदद करता है
Voice-based agents में response time (latency) बहुत महत्वपूर्ण है (आदर्श रूप से 1 second से कम), और real-time interaction के लिए कई UX tricks उपयोग की जाती हैं (जैसे pre-response, background noise)
Voice interfaces के लागू होने वाले क्षेत्र और उपयोग की संभावनाएँ बहुत बड़ी हैं, लेकिन developer tools और support infrastructure की अभी और आवश्यकता है

MCP: standardization और data integration

MCP(Mesh Capability Protocol): विभिन्न data sources, APIs और tools को standardized interface से जोड़ने का industry trend
MCP standard अभी शुरुआती चरण में है, लेकिन जटिल data/tool integration को सरल बनाने वाले प्रमुख आधार के रूप में विकसित होने की संभावना है
n agents और m data sources को जोड़ते समय n*m नहीं बल्कि n+m लागत पर integration संभव बनाने की दृष्टि

Agent-to-Agent systems

Multi-agent, agent-to-agent interaction अभी बहुत शुरुआती चरण में हैं, और फिलहाल ज़्यादातर ठोस success cases एक ही team के भीतर देखने को मिलते हैं
अलग-अलग teams/companies के बीच agent interaction भविष्य के विकास का क्षेत्र है

Vibe Coding और AI coding

AI assistants के साथ coding करने की 'Vibe Coding' प्रवृत्ति वास्तव में गहरे focus और intellectual work की मांग करती है; नाम के विपरीत यह केवल "feeling" के आधार पर coding नहीं है
AI code assistants के विकास से, अधिक non-developers और विभिन्न professions में coding skills का महत्व बढ़ रहा है
Coding सीखना भविष्य की productivity का मुख्य आधार है—कम से कम एक programming language (विशेष रूप से Python) सीखने की सिफारिश

Advice for Startups: AI startups के लिए सलाह

Startup सफलता की पहली प्राथमिकता execution speed, और दूसरी technology की गहरी समझ है
Marketing, sales, pricing आदि महत्वपूर्ण हैं, लेकिन technology वास्तव में कैसे काम करती है और नवीनतम technical changes क्या हैं, इसकी समझ अधिक दुर्लभ और अधिक महत्वपूर्ण है
गहरी technical fundamentals की समझ रखने वाली teams समस्याओं को तेज़ी और कुशलता से हल कर सकती हैं

2 टिप्पणियां

namojo 2025-06-16

MCP(Mesh Capability Protocol) वाला हिस्सा शायद टाइपो है, है ना?
लगता है कि एजेंट में फ़ंक्शंस और permissions को कैसे बाँटा जाए और workflow कैसे बनाया जाए, यही सफलता की कुंजी है।

beoks 2025-06-16

लगता है MCP हाल ही में उभरा हुआ term है, इसलिए LLM ने उस पर training नहीं ली और अजीब शब्द इस्तेमाल कर रहा है lol