HomeAboutBlogContact
← Back to blog
ollama tutorialvs code ai agentscontinue extension setupfree local coding assistantcustom ai agents for programmingui ux ai agentfull stack ai developerai code reviewerai project memoryollama qwen coder

Build Your Own AI Dev Team for Free: 5 Custom Agents with Ollama & Continue.dev

Stop waiting for Claude or Copilot. Assemble your own free, private AI development team inside VS Code — a UI/UX expert, full-stack developer, debugger, historian, and a wise old advisor. Here's exactly how to build them with Ollama and Continue.

Mahtamun Hoque Fahim·May 6, 2026·10 min read

Build Your Own AI Dev Team for Free: 5 Custom Agents with Ollama & Continue

Imagine having a dedicated team of AI experts sitting inside your VS Code, ready to help you at a moment's notice:

  • A UI/UX designer who delivers beautiful, modern interfaces—not average-looking ones
  • A full-stack developer who writes production-ready code for your preferred stack
  • A debugger/code reviewer who finds bugs and explains why they happened
  • A memorizer who remembers important project decisions across sessions
  • An "old man" advisor who always knows the best free, open-source alternative

And the best part? They're all free, private, and run entirely on your own laptop.

This isn't science fiction. It's the power of Ollama (local LLM runner) + Continue (VS Code AI extension) + custom agent prompts. In this guide, I'll show you exactly how to assemble your AI dream team—no cloud subscriptions, no data leaks, no monthly fees.


Who This Is For

You're a developer who:

  • Wants to use AI for coding but doesn't want to pay $20/month for Claude or Copilot
  • Cares about privacy (your code never leaves your machine)
  • Enjoys tinkering and customizing tools to work your way
  • Wants to learn how to build agents, not just use them

If that sounds like you, let's dive in.


What You'll Need (All Free)

ToolPurpose
OllamaRuns LLMs locally on your laptop
Qwen2.5-Coder 7BOur base model (smart, efficient, great at coding)
VS CodeYour editor
Continue extensionTurns VS Code into an AI agent hub
That's it. No cloud accounts. No credit cards.

Step 1: Install Ollama and Pull a Model

If you haven't already, install Ollama from ollama.com. Then open your terminal and run:

bashollama pull qwen2.5-coder:7b

This downloads about 4.5GB. Wait for it to finish.

Test that it works:

bashollama run qwen2.5-coder:7b

Type: "Explain the difference between var, let, and const in JavaScript"

If you get an answer, you're ready.

Note: On CPU-only laptops, responses take a few seconds. That's normal. You're running a 7-billion parameter brain on your own hardware. Be patient—it's worth it.

Step 2: Install Continue in VS Code

Open VS Code, go to the Extensions view (Ctrl+Shift+X), search for "Continue" , and install it.

Once installed, you'll see a new Continue icon on the left sidebar. Click it.


Step 3: Configure Your Custom Agents

Click the gear icon (⚙️) in the Continue panel. This opens your config.json file. Replace the entire contents with the configuration below.

The Complete Configuration (Copy-Paste Ready)

{
  "models": [
    {
      "title": "Qwen2.5-Coder 7B (The Brain)",
      "provider": "ollama",
      "model": "qwen2.5-coder:7b",
      "apiBase": "http://localhost:11434"
    }
  ],
  "customCommands": [
    {
      "name": "design",
      "description": "/design – UI/UX expert for beautiful, modern interfaces",
      "prompt": "[[[You are an expert, high-end UI/UX Designer. Your philosophy: modern design systems, micro-interactions, accessibility, and visual delight. When given a project description: 1) Suggest a coherent color palette and typography. 2) Propose spacing, hierarchy, and user flow. 3) Aim for 'best looking' over 'average'. Output rich guidance with HTML/CSS code examples.]]]"
    },
    {
      "name": "fullstack",
      "description": "/fullstack [stack] [task] – Principal full-stack engineer",
      "prompt": "[[[You are a Principal Full-Stack Developer. For any task: 1) Clarify architecture (frontend/backend/db). 2) Generate production-ready code with error handling and security. 3) Use best practices: env vars, dependency injection, SQL injection prevention. Output deployable blueprints.]]]"
    },
    {
      "name": "review",
      "description": "/review [code] – Thorough code review and debugging",
      "prompt": "[[[You are a Senior Code Reviewer and Debugger. Review code by: 1) Static security/performance analysis. 2) Style guide compliance. 3) Act as a debugger: ask clarifying questions, propose root causes, suggest fixes step-by-step. Always explain the *why* behind bugs.]]]"
    },
    {
      "name": "memorize",
      "description": "/memorize [fact] – Project historian (cross-session memory)",
      "prompt": "[[[You are the project's Historian. Workflow: 1) When user says '/memorize {info}', store it as a concise fact. 2) At start of each conversation, recall the last 3-5 key facts. 3) When answering, proactively reference this memory for consistent advice. You are the memory.]]]"
    },
    {
      "name": "oldman",
      "description": "/oldman [problem] – Wise advisor for best free resources",
      "prompt": "[[[You are 'Old Man'—a wise, practical developer. When asked for help: 1) Always lead with the best free/open-source alternative. 2) Be honest about trade-offs but champion 'good enough' free solutions. 3) Provide direct links and setup commands. Tone: friendly, gruff, relentlessly practical.]]]"
    }
  ]
}

Save the file. Continue will automatically reload your new team.


Meet Your AI Dream Team

Now you have five custom agents you can invoke from the Continue chat panel.

🎨 Agent 1: UI/UX Designer

Command: /design

What it does: Turns your vague ideas into beautiful, modern interface concepts. It thinks about color palettes, typography, spacing, micro-interactions, and accessibility—not just "put a button here."

Example: /design Create a landing page for a meditation app. Calm, nature-inspired, with a call-to-action for a free trial.

Output: A detailed design rationale + HTML/CSS snippet with glassmorphism, soft shadows, and a serene palette.

💻 Agent 2: Full-Stack Developer

Command: /fullstack

What it does: Writes production-ready, secure, and scalable code for your chosen stack (React+Node, Django, Laravel, etc.). It assumes environment variables, proper error handling, and security best practices.

Example: /fullstack React+Express Build a user registration endpoint with password hashing and JWT tokens.

Output: Complete backend route, frontend form component, and instructions for integration.

🐞 Agent 3: Debugger / Code Reviewer

Command: /review

What it does: Doesn't just spot bugs—it explains why they happened. It performs static analysis, checks style guides, and walks you through a systematic debugging process.

Example: /review [paste your buggy function here]

Output: A list of issues, their root causes, and line-by-line fixes.

🧠 Agent 4: The Memorizer (Historian)

Command: /memorize

What it does: Remembers important project decisions across chat sessions. Tell it something once, and it will recall that context in future conversations.

Example: /memorize The user prefers Tailwind CSS over plain CSS and uses PostgreSQL for all databases.

Then later, when you ask about styling a component, the agent will automatically recommend Tailwind classes.

👴 Agent 5: "Old Man" (Free Resource Advisor)

Command: /oldman

What it does: Suggests the best free, open-source alternatives to paid tools. Need a database? Hosting? Auth service? Old Man knows the free way.

Example: /oldman What's the best free alternative to Firebase for a small project?

Output: "For a free backend, check out Supabase. Self-host it with Docker, or use their generous free tier. For auth, try Lucia. Here's the link and a quick start command..."


Workflow Example: Building a Real Feature

Let's walk through a realistic scenario: building a user profile card for a dashboard.

Step 1 – Design: /design Create a modern user profile card with avatar, name, bio, and edit button. Glassmorphism style, dark theme.

The UI/UX agent responds with color suggestions, spacing, and HTML/CSS.

Step 2 – Implement: /fullstack React+Tailwind Turn that HTML into a reusable ProfileCard component. Make it responsive.

The full-stack agent outputs a complete React component with Tailwind classes, state management, and an edit modal.

Step 3 – Review: /review [paste the component code]

The debugger checks for accessibility, performance, and common React pitfalls.

Step 4 – Memorize: /memorize The user prefers dark theme and glassmorphism for all UI components.

Step 5 – Ask Old Man: /oldman What's a free image hosting service for user avatars?

Old Man suggests Cloudinary's free tier or self-hosted Thumbor.


From Mockups to Code: Visualizing the Workflow

mermaidflowchart LR
    A[User: /design "meditation app"] --> B[UI/UX Agent]
    B --> C[HTML/CSS mockup + palette]
    C --> D[User: /fullstack React+Node]
    D --> E[Full-Stack Agent]
    E --> F[Production-ready components + API]
    F --> G[User: /review]
    G --> H[Debugger Agent]
    H --> I[Fixed, secure code]
    I --> J[User: /memorize preferences]
    J --> K[Historian Agent]
    K --> L[Context persists]
    
    M[User: /oldman "free auth"] --> N[Old Man Agent]
    N --> O[Supabase / Lucia links]

Can These Agents Really Replace Claude or Copilot?

Short answer: For many real-world coding tasks, yes—especially when you combine them.

Long answer: A single agent running a 7B model on your laptop won't match the raw brainpower of Claude 3.7 Sonnet (a cloud giant). But here's the secret: agents multiply intelligence.

A focused UI/UX agent with a strong system prompt will often outperform a generic assistant like Claude at design tasks. A dedicated debugger with a step-by-step methodology is better at finding bugs. And the memorizer solves the biggest weakness of local models: short context windows.

Plus, your agents are:

  • Private – No code leaves your machine
  • Free – Forever
  • Offline-capable – Work anywhere
  • Customizable – Tweak their prompts for your exact needs

For the vast majority of indie developers, freelancers, and hobbyists, this setup is not just "good enough"—it's genuinely powerful.


Advanced Tips (After You Get Comfortable)

Tweak the Prompts

Each agent's prompt is plain English. You can:

  • Add your preferred tech stack to the full-stack agent
  • Change the UI designer's style (minimalist, brutalist, neumorphism)
  • Make Old Man more sarcastic or more detailed

Add More Agents

The customCommands array can hold as many as you want. Ideas:

  • /docs – Generate technical documentation
  • /test – Write unit tests
  • /security – Audit for vulnerabilities
  • /sql – Optimize database queries

Use Different Models Per Agent

You can specify different models for different commands. For example, use a smaller 1.5B model for fast /review commands and the 7B for complex /fullstack tasks.

json{
  "name": "review",
  "model": "qwen2.5-coder:1.5b",  // faster
  "prompt": "..."
}

Persist Memory Beyond the Chat

The memorizer prompt works within a single Continue session. To make memory permanent across VS Code restarts, consider:

  • A simple JSON file that agents can read/write
  • The continue-memory MCP server (experimental)

But start with the prompt-based memory above. It's surprisingly effective.


Troubleshooting

Q: The agents respond very slowly. A: You're running on CPU. Use smaller models (1.5B or 3B) for faster responses. Or accept the delay—it's the price of privacy and zero cost.

Q: The UI/UX agent's HTML looks basic. A: Improve the prompt! Add "Use Tailwind CSS, include hover states, add smooth transitions." The more specific you are, the better the output.

Q: The memorizer forgets after I close VS Code. A: Yes—the prompt-based memory is session-only. For permanent memory, you'll need to implement file-based storage. But for most workflows, session memory is enough.

Q: Can I use these agents with other editors? A: Continue works with JetBrains IDEs too. For other editors, you can use Ollama's API directly.


Final Verdict: Is This Worth the Setup Effort?

Absolutely. You're not just getting an AI assistant—you're learning how to orchestrate AI. You're building a team that works exactly the way you want, respects your privacy, and costs nothing.

The 30 minutes you spend setting this up will pay back hundreds of hours of saved coding time.

And when your friends ask, "How do you code so fast?" you can smile and say, "I've got a team."


What's Next?

  1. Install Ollama and pull qwen2.5-coder:7b
  2. Set up Continue in VS Code
  3. Copy the config above
  4. Try /design with your own project idea
  5. Customize the prompts to match your style

Then come back and tell me which agent surprised you the most. My bet is on Old Man.


Enjoyed this guide? Share it with a fellow developer who's tired of paying for AI. Or tweet it with the hashtag #LocalAIDevTeam.

Found a bug or have a suggestion? The Continue and Ollama communities are incredibly welcoming. Jump in and contribute.

Happy building — and may your agents serve you well. 🚀

← All posts