llm

(56)

Armin Ronacher: Leaning In To Find Out

by armin-ronacher

Coding agents dominate reinforcement learning data, which means the smartest way to build any agent is to lean into the coding agent paradigm — files, code execution, Unix workflows — rather than fight it with custom tools.

ai-agents llm agentic-coding reinforcement-learning context-engineeringMar 26, 2026

I wish I never did this project..

by pewdiepie

PewDiePie fine-tunes a Qwen 32B model into a coding specialist that scores 39.1% on Aider Polyglot — beating older ChatGPT and Gemini versions — while documenting every failure, melted cable, and dead GPU along the way.

llm ai fine-tuning learn-in-public open-sourceFeb 27, 2026

Raising An Agent - Episode 10

by amp

The IDE sidebar is a dead-end interaction model for coding agents—parallel, headless agent swarms that run for 45 minutes without human input replace the one-on-one assistant workflow.

ai-agents coding-agents developer-experience software-architecture llmFeb 6, 2026

Building a C Compiler with a Team of Parallel Claudes

by nicholas-carlini

Sixteen Claude instances working in parallel without human supervision can produce a 100,000-line Rust-based C compiler capable of compiling the Linux kernel—but only when the task verifier and CI pipeline are nearly perfect.

ai-agents llm compiler rust autonomous-codingFeb 4, 2026

AI Coding Agents and How to Code Them

by alex-shershebnev

Building AI coding agents requires only basic tooling—a ReAct loop, tool definitions, and MCP integration—and the developer's role shifts from writing code to managing autonomous virtual developers.

ai-agents llm mcp tools live-codingJan 30, 2026

Unrolling the Codex Agent Loop

by michael-bolin

The agent loop—a simple cycle of LLM calls and tool execution—is the core of every AI agent, but performance requires stateless design, prompt caching, and context compaction to avoid quadratic inference costs.

ai-agents llm architecture context-managementJan 28, 2026

Kimi K2.5 (Fully Tested): An Open Weights Model beats OPUS 4.5?

by aicodeking

Kimi K2.5 challenges proprietary models at a fraction of the cost: trillion-parameter MoE with vision, agent swarm parallelism, and 5th place on AICodeKing's benchmark—beating Claude Sonnet 4.5 and DeepSeek V3.2.

llm open-source-ai ai-agents benchmarks multimodalJan 27, 2026

Raising An Agent - Episode 9

by amp

The assistant era is over—agents now write production code. The next frontier is building 'agent-native codebases' with feedback loops that let agents verify their own work autonomously.

ai-agents coding-agents developer-experience software-architecture llmJan 26, 2026

Why the Model Context Protocol Does Not Work

by adam-gospodarczyk

MCP's flaws matter less than the fundamental LLM limitations it exposes: context bloat from tool schemas, degraded instruction following in long conversations, and inference costs that balloon with agentic workflows.

mcp ai-agents llm context-management agentic-aiJan 20, 2026

Ask HN: How Are You Doing RAG Locally?

by hacker-news-community

Simple retrieval often outperforms complex vector infrastructure—BM25, SQLite FTS5, and grep handle most local RAG use cases better than dedicated vector databases.

llm search developer-experienceJan 18, 2026

Building an AI Interface for Your Second Brain

by alexander-opalic

Learn how to build a conversational AI that queries your personal knowledge base using Nuxt, Nuxt Content, and the Anthropic SDK.

nuxt ai anthropic tutorial nuxt-content llm ai-agentsJan 17, 2026

How to Steal Any React Component

by david-fant

You can extract and reconstruct any React component from a production website by leveraging React Fiber's internal tree structure combined with LLMs.

react reverse-engineering llmJan 17, 2026

Chat with AI Coding Wizard Dex Horthy

by mattpocockuk, dex-horthy

Live conversation exploring practical approaches to AI-assisted coding, context engineering, and building reliable agents in complex codebases.

ai-agents context-engineering llm developer-experienceJan 16, 2026

Copilot SDK

by github

Build AI coding assistants by connecting to Copilot CLI through JSON-RPC, letting you embed GitHub's coding agent into any application.

ai-agents llm sdk developer-experienceJan 15, 2026

Building Code-Editing Agents: The Emperor Has No Clothes

by geoffrey-huntley

Coding agents are just 300 lines of code running in a loop—demystifying AI tooling reveals that the model does the heavy lifting, and understanding these primitives transforms you from AI consumer to AI producer.

ai-agents llm developer-productivity software-engineeringJan 14, 2026

First Impressions of Claude Cowork, Anthropic's General Agent

by simon-willison

Claude Cowork repackages Claude Code's powerful agentic capabilities for general audiences through accessible design rather than technical innovation—a pragmatic approach to unlock untapped value.

ai-agents claude-code llm-security anthropicJan 12, 2026

How to Build a Coding Agent (Repository)

by geoffrey-huntley

A hands-on Go workshop that builds a coding agent incrementally through six files, each adding one capability - proving agents need only simple primitives composed in a loop.

ai-agents llm go tutorialJan 11, 2026

LLM Predictions for 2026

by simon-willison

Reasoning models have made LLM-generated code undeniably good, and 2026 will bring both a major security incident from coding agents and the resolution of the sandboxing problem.

llm ai-coding predictions software-engineeringJan 8, 2026

Advanced Context Engineering for Coding Agents

by humanlayer-team

Systematic context management—through frequent intentional compaction and a Research-Plan-Implement workflow—enables productive AI-assisted development in complex production codebases.

context-engineering ai-agents llm best-practicesJan 6, 2026

Context Engineering for AI Agents with LangChain and Manus

by langchain

Context engineering—filling the context window with the right information at each step—determines agent performance more than model choice or complex frameworks.

context-engineering ai-agents llm architectureJan 6, 2026

Dynamic Context Discovery

by jediah-katz

Coding agents perform better when they pull context on demand rather than receiving everything upfront—files serve as a simple, future-proof abstraction for this dynamic retrieval.

ai-agents llm context-management developer-toolsJan 6, 2026

Claude Agent SDK [Full Workshop]

by thariq-shihipar

Bash is the most powerful agent tool. The Claude Agent SDK packages Claude Code's battle-tested patterns—tools, file system, skills, sandboxing—for building coding and non-coding agents alike.

claude-code ai-agents llm developer-experience software-architectureJan 5, 2026

The Importance of Agent Harness in 2026

by philipp-schmid

As AI models converge in benchmark performance, the infrastructure managing them—Agent Harnesses—becomes the competitive differentiator for building reliable, multi-day workflows.

ai-agents llm software-architecture developer-experienceJan 5, 2026

AI Engineering with Chip Huyen

by gergely-orosz, chip-huyen

Chip Huyen explains how AI engineering differs from ML engineering, walking through the practical developmental path from prompts to RAG to fine-tuning.

ai llm ai-agents best-practicesJan 4, 2026

AI Engineering

by chip-huyen

A practitioner's guide to building applications on foundation models, covering prompt engineering, RAG, finetuning, agents, and evaluation.

ai llm prompt-engineering ai-agents best-practicesJan 4, 2026

AI-Powered Search

by trey-grainger, doug-turnbull, max-irwin

The holy grail for AI-powered search lies at the intersection of semantic search, personalized search, and domain-aware recommendations—systems that understand the domain, the user, and can match arbitrary queries to any content.

llm ai search ragJan 4, 2026

How AI Will Change Software Engineering

by gergely-orosz, martin-fowler

Martin Fowler argues AI represents the biggest shift in software engineering since assembly to high-level languages—not because of abstraction level, but because we now work with non-deterministic systems.

ai llm software-architecture best-practicesJan 4, 2026

RLHF: Reinforcement Learning from Human Feedback

by chip-huyen

A comprehensive guide explaining the three-phase process (pretraining, supervised fine-tuning, RLHF) used to train models like ChatGPT.

llm ai best-practicesJan 4, 2026

What Is ChatGPT Doing... and Why Does It Work?

by stephen-wolfram

Explains how ChatGPT works by breaking down neural networks, embeddings, and training—connecting modern AI to foundational questions about language and thought.

llm ai philosophyJan 4, 2026

AI Expert: We Have 2 Years Before Everything Changes - Tristan Harris

by steven-bartlett

Tristan Harris warns that AI companies are racing to build a 'digital god' that could automate all human cognitive labor, with insiders believing this will happen within 2-10 years while publicly downplaying the risks.

ai llm ai-agents philosophyJan 3, 2026

Deep Dive into LLMs like ChatGPT

by andrej-karpathy

A comprehensive introduction to how large language models work, covering the entire pipeline from internet data collection through tokenization, neural network training, and inference.

llm deep-learning ai andrej-karpathyJan 3, 2026

Ex-Google Officer Speaks Out On The Dangers Of AI

by mo-gawdat, steven-bartlett

Mo Gawdat, former Chief Business Officer at Google X, warns that AI represents humanity's greatest existential challenge—bigger than climate change—and outlines his 'three inevitables' for why we're approaching a point of no return.

ai llm ai-agents philosophyJan 3, 2026

md-ai

by carl-assmann

CLI tool that enables LLM conversations through markdown files in your preferred editor, storing chat history as readable documents.

llm cli markdown developer-experience ai-toolsJan 3, 2026

No Vibes Allowed: Solving Hard Problems in Complex Codebases

by dex-horthy

A practical framework for getting AI coding agents to work reliably in brownfield codebases through context engineering, intentional compaction, and the Research-Plan-Implement workflow.

ai-agents context-engineering llm developer-experience best-practicesJan 3, 2026

The Context Window Problem

by varin-nair

Frontier models cap out at 1-2 million tokens, yet enterprise codebases span several million. Factory's solution: a five-layer context stack that delivers the right information at the right time.

llm ai-agents architecture developer-experienceJan 3, 2026

Agentic Design Patterns: A Hands-On Guide to Building Intelligent Systems

by antonio-gulli

A practical guide presenting 21 design patterns for building AI agents, covering prompt chaining, tool use, multi-agent collaboration, and self-correction techniques with examples in LangChain, CrewAI, and Google ADK.

ai-agents design-patterns llmJan 2, 2026

AI Agents Guide

Patterns and practices for building autonomous AI agents with prompt chaining, routing, and reflection

ai-agents llm design-patterns best-practicesJan 2, 2026

Andrej Karpathy — We're summoning ghosts, not building animals

by andrej-karpathy, dwarkesh-patel

Karpathy argues we're building 'ghosts' that imitate internet documents rather than evolved animals, and that practical AI agents will take a decade—not a year—to fully mature.

ai-agents llm learningJan 2, 2026

Build Autonomous Agents Using Prompt Chaining with AI Primitives

by maham-codes

A practical guide to building AI agents using prompt chaining and basic primitives instead of heavy frameworks.

ai-agents llm best-practicesJan 2, 2026

Building an AI Agent with TypeScript

by mark-anthony-cianfrani

A first-principles guide to building AI agents in TypeScript, covering LLM integration, conversation memory, tool calling, and agentic loops.

ai-agents llm typescript tool-callingJan 2, 2026

Building Effective Agents

by erik-schluntz, barry-zhang

Anthropic's guide to building agentic LLM systems, advocating for simple composable patterns over complex frameworks.

ai-agents llm best-practices architectureJan 2, 2026

How I Use LLMs

by andrej-karpathy

Andrej Karpathy's practical guide to using LLMs effectively: understanding them as lossy internet zip files, managing token windows, selecting models across providers, and leveraging thinking models for complex problems.

llm ai-tools productivity prompt-engineeringJan 2, 2026

Software Testing with Generative AI

by mark-winteringham

Practical guide to using generative AI for test design, synthetic data generation, and automation without the hype.

testing ai-tools llmJan 2, 2026

What is Prompt Chaining in AI Agents? - Theory and Code

by yacine-mahdid

Prompt chaining breaks complex LLM tasks into sequential steps, each with a specific job and structured input/output, trading latency for reliability.

ai-agents llm prompt-engineering architectureJan 2, 2026

12 Factor Agents

by humanlayer-team

A manifesto for building production-grade LLM agents, arguing that effective agents combine mostly deterministic software with strategic LLM decision-making rather than naive 'loop until solved' patterns.

ai-agents llm software-architecture claude-code developer-experienceJan 1, 2026

2025: The Year in LLMs

by simon-willison

Simon Willison's comprehensive year-in-review analyzing how reasoning models, autonomous agents, and Chinese AI competition fundamentally reshaped the landscape in 2025.

llm ai-agents claude-code ai-tools developer-experienceJan 1, 2026

Context Engineering for AI Systems

Techniques for providing AI agents and LLMs with optimized context

context-engineering ai-tools developer-experience llmJan 1, 2026

Writing a Good CLAUDE.md

by humanlayer-team

Guidelines for crafting an effective CLAUDE.md file, emphasizing brevity, universal applicability, and progressive disclosure to maximize Claude Code's instruction-following capacity.

claude-code ai-tools developer-experience productivity llmJan 1, 2026

We Removed 80% of Our Agent's Tools

by andrew-qu

Stripping a text-to-SQL agent down to a single bash tool produced a 3.5x speedup, 100% success rate, and 37% fewer tokens—proving that simpler agent architectures outperform elaborate tooling.

ai-agents llm simplicity architectureDec 22, 2025

Building an AI Agent from Scratch Without Frameworks

by dennis-stanoev

Understanding AI agents requires building one from scratch: an agent is a wrapper around an LLM that makes decisions and takes actions through a simple tool-use loop.

ai-agents llm typescript tool-useDec 17, 2025

Everything is Context: Agentic File System Abstraction for Context Engineering

by xiwei-xu

Context engineering—not model fine-tuning—should be the central challenge for generative AI systems, solved through a Unix-inspired file system abstraction that treats all context components uniformly.

ai-agents llm rag context-engineering architectureDec 5, 2025

What I Learned Building an Opinionated and Minimal Coding Agent

by mario-zechner

Minimal coding agents outperform bloated ones because frontier models already understand agentic coding—so the harness should stay out of the way with fewer than 1,000 tokens of system prompt and just four tools.

ai-agents llm developer-tools cli architectureNov 30, 2025

The Markdown Programming Language

by ryan-x-charles

Markdown has become a general-purpose programming language—AI agents like Claude Code compile structured specifications into working applications.

ai-agents llm developer-experience programmingNov 15, 2025

Effective Context Engineering for AI Agents

by anthropic

Context is a finite resource in LLM agents; treating tokens as precious budget rather than limitless capacity enables reliable long-horizon task completion.

context-engineering ai-agents llm anthropicSep 29, 2025

How to Build a Coding Agent: Free Workshop

by geoffrey-huntley

Coding agents require only 300 lines of code in a loop with LLM tokens - understanding these fundamentals transforms you from AI consumer to producer.

ai-agents llm tools developer-experienceAug 24, 2025

How to Build an Agent

by thorsten-ball

Building a functional coding agent requires only an LLM, a loop, and a handful of tool definitions—the complexity lies in refinement, not architecture.

ai-agents llm go tool-useApr 15, 2025