Research you can trust: Deep Research, citations, verification
Yesterday you picked a daily driver. Today you turn your assistants into a research department — quick sourced answers for small questions, full investigations for big ones — and you learn the 60-second habit that keeps a confident mistake from ever reaching your real life.
AI research is a two-mode skill — quick web-grounded answers for small questions, Deep Research for big ones — and it's only as good as your verification habit: click the sources.
1Why a brilliant assistant gets facts wrong
Back on Day 1 we met the Confident Intern: brilliant, tireless, well-read — and sometimes confidently wrong. Today is the day we deal with the "confidently wrong" part properly, because today is the day you start asking AI for facts — facts you might act on, spend money on, or repeat to someone you respect.
Here's the first-principles reason an assistant can flub a fact while sounding completely certain. When you ask a question and the assistant does not search the web, it answers from training memory — everything it absorbed when it was built. That memory is enormous, but it has two built-in weaknesses:
- It's compressed. The assistant didn't store the world's text like a library stores books; it absorbed patterns from it, the way you remember the gist of a novel rather than every sentence. Gist is wonderful for explaining, drafting and brainstorming. It is shaky for exact prices, dates, names and numbers — and the assistant will state the shaky version with the same calm fluency as the solid one. That's the confident mistake we named on Day 1.
- It has a cutoff. Every assistant has a knowledge cutoff — the date its training memory ends, after which an un-searched assistant knows nothing. Ask it about anything that changed recently — a price, a law, a product, an opening time — and an answer from memory can be perfectly phrased and quietly out of date.
Neither weakness is a malfunction. It's what answering-from-memory is. The fix isn't a smarter assistant — it's changing where the answer comes from: have the assistant search the live web first, and then have you check the parts that matter. Those are the two skills of today, and together they're the difference between people AI helps and people AI embarrasses.
2Mode 1 — the quick grounded answer: search first, cite sources
The first research mode is the everyday one. A web search (grounded answer) means the assistant searches the live web before answering, reads what it finds, writes you a short answer — and attaches citations: the clickable references behind each claim. Depending on the assistant, citations show up as small numbered chips after a sentence, as link icons, or as a list of sources at the end. They all mean the same thing: "here's where I got this — go look."
All three of your assistants from Day 3 can do this on their free plans — for Claude, web search is its free research mode. Two things to know about how to read what comes back:
- Citations are for clicking, not decorating. A wall of source chips looks trustworthy, and that look is exactly the trap. A citation only means "I read something there" — it does not guarantee the sentence in front of it faithfully reports what the source says. The only way to know is to open it.
- A claim and its source can drift apart. The most common small failure isn't a fabricated source — it's a real source that says something slightly different: "up to 120 km of range" becomes "120 km of range"; "free for the first year" becomes "free". The direction of the drift is usually toward the cleaner, more confident sentence. Your click catches it in seconds.
One more practical note: if you want a grounded answer, say so. The words "search the web" in your prompt are the reliable trigger. If an answer comes back with no sources at all, that's your signal it came from training memory — tell the assistant to search and cite, and ask again in the same thread (iteration, the Day 1 superpower).
3Mode 2 — Deep Research: a researcher, not a search box
The second mode is the one most beginners have never tried, and it's the closest thing in this course to hiring staff. Deep Research is a different way of working: instead of one search and one answer, the assistant runs a multi-step investigation. It drafts a research plan and shows it to you for approval. (In Gemini you approve a written plan; in ChatGPT the same steering moment looks like a few clarifying questions before it starts — answer them, then let it run.) Then it searches, reads dozens of sources, follows leads, cross-checks — and 5 to 15 minutes later hands you a structured, multi-page report with a citation behind every claim. Minutes, not seconds. A report, not a reply.
Where you can use it free (the workflow is what matters; the exact allowances shift): Gemini gives you a small free allowance of Deep Research runs per month — that's this course's default home for it. ChatGPT's free plan includes a similar small allowance of lightweight runs a month (a lighter version of the same idea). Claude's equivalent (called Research) is paid-only — on the free plan, Claude does web search, not Deep Research. Free allowances change often — check the current limits inside the app, and either way the workflow works the same.
So which mode when? Use the simplest sorting rule there is: minutes-questions vs. money-questions. A minutes-question ("is this museum open on Mondays?", "what does this ingredient do?") deserves a quick grounded answer. A money-question — anything where the answer steers a purchase, a plan, a health choice, or your reputation — deserves Deep Research and your full attention. A small monthly allowance is not a limitation; it's a budget that forces good taste. Spend it on questions that matter.
4The Three-Click Check — the habit that makes both modes safe
Now the verification habit itself. Whatever mode produced the answer, before any AI-sourced fact leaves your screen and enters your real life — an email, a purchase, a conversation, a post — it earns the Three-Click Check. This is the course's canonical verification routine, and it takes about a minute:
Notice what the rule of thumb does: it scales the habit instead of making everything paranoid. Dinner-table trivia? Zero clicks, enjoy. A fact going into an email to your boss? The full three. A claim steering real money, your health, or a legal question? Three clicks, a second opinion, and — for the truly big ones — a human professional. Verification isn't distrust of the tool; it's the same thing you'd do with any talented intern's memo before it goes out under your name. The work stays theirs. The judgment stays yours.
Imagine a sharp junior researcher hands you a memo an hour before your meeting. A bad boss does one of two things: re-does the whole research personally (and wastes the researcher), or forwards the memo unread (and gambles their reputation on it). A good boss does neither. They skim the memo, find the two or three load-bearing claims — the numbers and facts the conclusion stands on — and check those footnotes personally before repeating anything in the meeting. Five minutes, not five hours. Same memo in all three cases; completely different risk. The Three-Click Check is you being the good boss.
Start with the everyday mode in your daily driver from Day 3 — this works in ChatGPT, Claude and Gemini alike. One convention before your first prompt: square brackets like [this] always mean "replace with your own words before sending" — that holds for every prompt in this course.
- Open your daily driver and start a new chat (the new-chat button, usually near the sidebar with your chat history). an empty message box, ready for input.
- Ask a real question you actually need answered this week — and ask for sources explicitly:
Copy-paste promptSearch the web for current information: [something real you need to know — e.g. "what documents do I need to travel to Spain with a child" or "current prices for a yearly gym membership in my city"]. Give me a short answer, and list your sources as links.a short answer with source links — small numbered chips after sentences, or a list of links at the end.
- Click one of the sources. Read enough of the page to confirm it really says what the assistant said it says. (Tip: press Ctrl+F — Cmd+F on a Mac — and search the page for a keyword from the claim.) the claim, or something close to it, on the source page. That click was your first verification click — the habit starts here.
- If the answer came back with no sources, it came from training memory. Reply in the same thread: "Please search the web for this and cite your sources as links." a re-grounded answer, now with links. Asking is allowed; iteration is the superpower.
Now the big mode. You get a small free allowance of these per month (check the current limit inside the app), so point it at a question that deserves it: a real decision you face this month.
- Open gemini.google.com and sign in (you made this account on Day 1). the Gemini chat window with an empty message box.
- Find Deep Research — it lives in the tools menu in the message box or in the model picker, usually at the top of the chat (interfaces move; the name is what to look for). a Deep Research label or chip confirming the mode is active.
- Paste your research brief — note it follows the Day 2 Briefing Formula: a clear goal, real context, an explicit format:
Copy-paste prompt · Gemini · Deep ResearchResearch this properly: [a real decision you face this month — e.g. "the best e-bike under €2,000 for a daily 12 km commute" or "after-school sports options for a 7-year-old in (your city)"]. Compare the main options on price, quality and practicality. Cite a source for every claim. End with a shortlist of 3 and your reasoning.not an answer — a research PLAN: a short list of steps it intends to take.
- Read the plan. This is your steering moment: if it's missing an angle you care about, say so before approving — e.g. "Also compare warranty and service options." Then approve / start the research. a progress view as it searches and reads — source counts ticking up.
- Wait 5–15 minutes. You don't have to watch it work — open another tab, make tea. This is the first AI in this course that works while you don't. a structured, multi-page report: headings, comparisons, a shortlist — with citations attached to its claims.
- Skim the report's shape first — headings, then the shortlist, then the reasoning. Don't read it like a novel; read it like a memo from your researcher. The checking comes next. a clear recommendation you could act on — which is exactly why you won't act on it unchecked.
Out of free runs this month? A quick grounded answer plus a careful Three-Click Check covers most decisions until the counter resets.
The report looks finished. It is not finished until you've checked it — so run the full routine on it now, while it's open.
- Find the 2–3 load-bearing claims — the facts the recommendation stands on: the price, the key spec, the deadline, the rule. Ignore the filler; check what you'd act on. two or three short, concrete claims you can point at.
- Click one — the minimal version; on bigger decisions open a source for each load-bearing claim: open the source behind claim #1 and find the fact on the page (Ctrl+F / Cmd+F a keyword). Does the page say what the report says — exactly? either a clean match, or a small drift ("up to 120 km" vs "120 km"). Both outcomes are wins — now you know.
- Click two: for your most time-sensitive claim, check the source's date. Is it recent enough for this question? a publication date — and a decision: fresh enough, or find a newer source.
- Click three (stakes are real here, so yes): get a second opinion. Take the core claims to your daily driver:
Copy-paste prompt · second opinionI'm deciding [your decision]. A research report tells me: [paste the 2–3 load-bearing claims]. Search the web and check each claim. Tell me plainly which hold up, which look outdated or wrong, and what I should double-check before acting.agreement on most claims — and friction on one or two. The friction points are your homework, and there are usually very few.
- Expect to find one claim the sources don't quite support. There is almost always one — not because the tool is bad, but because that's what compressed, confident summarizing does. Finding it is not the system failing. Finding it is the system working, with you as its final step. your own verdict, written in one line: what's confirmed, what's shaky.
Mistake 1 — shipping unchecked: copy-pasting AI claims straight into real life — into the email to your boss, the purchase, the public post — without clicking a single source. Today is the day the Confident Intern burns people who skip the check: the answer reads finished, so it gets treated as finished, and the one drifted fact travels under your name.
Mistake 2 — the backwards budget: spending your research effort exactly the wrong way round — burning Deep Research runs on trivia ("history of the croissant"), then making a €2,000 decision on a ten-second chat answer. Match the mode to the stakes: quick answers for minutes-questions, Deep Research for money-questions, and the Three-Click Check on anything that leaves your screen.
That one-minute habit is your edge — most people will never build it, and you'll be able to tell.
If you ran the Deep Research walkthrough above on a real decision, that run is your mission — go straight to the two-line note below. If you practiced on a throwaway question, run one real Deep Research now: in Gemini, on a decision you genuinely face this month — a purchase, a plan, a choice between options. Steer the plan before approving it. When the report lands, run the full Three-Click Check: verify the load-bearing claims, check the dates, get a second opinion from your daily driver on the claims that matter.
Then write two lines in the note you started on Day 2 (tomorrow it officially becomes your Prompt Notebook): one claim you confirmed, and one claim you couldn't. That pair — confirmed / couldn't — is the habit in its smallest form. Keep the report; you may genuinely use this decision.
Today's recap — 30 seconds
- AI research has two modes: the quick web-grounded answer (seconds, a few sources) and Deep Research (minutes, a plan you approve, dozens of cited sources, a structured report).
- Free homes: Gemini gives a small free Deep Research allowance (the course default), ChatGPT a similar small lightweight allowance, Claude = web search only on free — free limits change often, so check them inside the app.
- Un-searched answers come from training memory — compressed and ending at a knowledge cutoff — which is why they can be confidently stale or confidently wrong.
- The Three-Click Check before any AI fact enters real life: click the sources behind the 2–3 load-bearing claims, check the dates, second opinion when stakes are high.
- Stakes pick the mode, and stakes price the clicks: minutes-questions get quick answers; money-questions get Deep Research; the more a wrong answer would cost you, the more clicks it earns.