लेखहरू · हरेक महिना अपडेट हुने
लेखहरू
उत्पादनस्तरको एमएल बनाउने बारेमा — एलएलएमभन्दा अघिका चार तहहरू, मोडेलभन्दा माथिको राउटिङ निर्णयहरू, र तीन-एजेन्ट IDP पाइपलाइनबाटका प्रत्यक्ष अनुभवहरू।
अहिले अङ्ग्रेजीमा मात्र उपलब्ध
8 मिनेट
Four layers before the LLM: the gatekeeper pattern
How a stack of cheap classifiers cuts a document-IDP bill by an order of magnitude without ever waking the model.
6 मिनेट
The cheapest token is the one you don't send
Routing, retrieval, and a small set of decisions about which questions a model should never see.
7 मिनेट
Production ML is mostly not ML
A field report from a three-agent IDP pipeline: what breaks first, what's worth automating, and where humans still earn their seat.