Before you start: pick the right first agent
The biggest mistake first-time agent builders make is picking a task that's too ambitious for a first pass. A good first agent has three properties:
- It runs on data you have lots of. Email, calendar, Slack messages, documents. Not specialized data you'd need to scrape or buy.
- Its job is repetitive. You've done this task a hundred times. You know the right answer 80% of the time. That last 20% is where the agent will need help, and you can spot its mistakes.
- Mistakes are recoverable. If the agent does something wrong, you notice quickly and can undo. Email drafts, not sent emails. Suggestions, not irreversible actions.
Inbox triage, calendar prep briefs, research summaries, meeting note restructuring — all great first agents. Anything that involves publishing publicly, sending money, or making decisions for other people — not great first agents.
The 6 steps
Pick the right platform
We use Lindy for this tutorial. Alternatives that would work just as well for first-timers: Cowork (inside Claude Desktop, if you already have Claude Pro) and the ChatGPT Custom GPT builder. All three are genuinely no-code. We picked Lindy because it has the clearest templates for inbox work.
Define the agent's job in one sentence
Write one sentence that covers: what triggers it, what it does, what it outputs. For our inbox-triage example:
This sentence will become the agent's system prompt. The clearer it is now, the less tuning you'll do later.
Connect the data sources
In Lindy, click "New Agent" → "From Template" → "Inbox Assistant". Lindy walks you through:
- Google or Outlook OAuth — authorize read + draft access (not send)
- Label scope — pick which Gmail labels or Outlook categories the agent uses
- Trigger — "when a new email arrives" (other triggers: "at 8am daily", "when I flag")
Total time: 3-5 minutes.
Write the agent's instructions
Paste your one-sentence definition into the system prompt field. Then add explicit rules for edge cases:
The more specific you are about what NOT to do, the better the agent behaves.
Test in dry-run mode
Lindy has a "Test Mode" that replays the last 20 emails through the agent and shows you what it would have done — labels, drafts, skips. Review every single action.
- Did it correctly label? If 18/20+ are right, you're good.
- Are the drafts in your voice? If not, feed it 3-5 examples of real replies you've sent.
- Did it correctly skip sensitive/urgent emails? This is the most important check.
If accuracy is under 80% in test mode, tune the instructions and re-test. Expect 1-3 rounds of tuning.
Deploy and set a review cadence
Once test mode is solid, deploy. Set a daily review reminder for week 1 — spend 5 minutes each morning reviewing what the agent did yesterday. By week 2, you'll know which behaviors are trustworthy and which need supervision.
After 30 days of stable behavior, you can expand its scope (add more labels, enable auto-send on clearly routine replies, chain it to other agents).
What the final setup looks like
After following the steps above, you'll have:
- A Lindy agent running against your inbox in the background
- Automated labeling of incoming email (~30-60 seconds after arrival)
- Draft replies waiting in your drafts folder for ~30-50% of emails, ready for you to review and send
- A review dashboard showing agent actions over the past 7 days
Typical time savings: 45-90 minutes per week on inbox work for someone with 50+ daily emails.
Common failure modes (and how to fix them)
"Drafts don't sound like me"
Feed the agent 5-10 real examples of emails you've sent in different categories. Most platforms let you attach a "voice sample" file. If yours doesn't, paste the examples directly into the system prompt under a "Tone examples" section.
"It labels too aggressively/conservatively"
Be more explicit in instructions. Add borderline examples: "Email from newsletter X = noise, even though it has a client name in it." The agent will follow specific examples over general rules.
"It's missing obvious urgent emails"
Explicitly whitelist escalation triggers: specific words, specific sender domains, specific time windows. Don't assume the agent shares your urgency intuition.
"It's taking actions I didn't authorize"
This shouldn't happen with read + draft permissions. If it does, immediately revoke access and contact platform support — it's a real bug. In practice: don't grant send permissions for at least 30 days of watching drafts.
Stepping up from here
Once your first agent is running smoothly, the natural progressions are:
- More agents. Calendar prep, meeting summary, weekly client report. Lindy makes each one a copy of your first agent with different instructions.
- Chaining. Output of agent A triggers agent B. Example: agent A extracts meeting notes, agent B drafts follow-up emails.
- Platform upgrade. If you hit Lindy's free tier ceiling, move to $49/month Pro or switch to self-hosted n8n for unlimited runs at $5-20/month VPS.
Is no-code the right path for you?
No-code agent builders are the fastest path from idea to working agent. Their limits: complex multi-step logic, tight integration with systems that don't have pre-built connectors, cost at high volume.
If you're a solo operator with 5-50 agent tasks per week and don't write code — absolutely start no-code. You'll be productive within an hour.
If you write code and expect to run serious volume (5,000+ operations/month) — skip no-code and go straight to n8n self-hosted + Claude API, or custom builds. The learning curve is longer but the long-term cost and flexibility are better.