Version: TECHNEST 2026

FUNCOOL

Add an AI Clone of Yourself

PDFWeek 3 Handout — Printable PDFThe full Week 3 lecture handout: streaming Gemini 2.5 Flash into a floating chat widget, persona system prompt design.Download PDF· 647 KB

Learning Objectives

By the end of this session, students should be able to:

Obtain an API key from Google AI Studio and wire it into both local .env.local and Vercel's production env — with the AI doing the plumbing and the student doing only the human-only OAuth moment.
Write a persona system prompt that makes Gemini 2.5 Flash answer as you, in your voice, with your background — the student's first serious piece of prompt engineering.
Describe a streaming chat UI in English and have Cursor implement it using the Vercel AI SDK, with no hand-written React.

Core Topics

System prompts vs user prompts: why the invisible instructions matter more than the visible ones.
Streaming responses — why "tokens arrive one by one" is the default mode for modern chat UIs.
Secret management: where API keys go, where they don't go, and how the AI helps you not commit them.
The "persona file" pattern — keeping your AI-self definition in one editable Markdown file.

Tools / Stack

Tool	Role this week
Cursor	Where you talk; AI writes the chat component.
Gemini 2.5 Flash	The model answering as your AI-clone. Fast, cheap, good enough.
Google AI Studio	Where you fetch the free API key. One manual moment.
Vercel AI SDK	Abstracts streaming, state, and provider differences.
`.env.local` + Vercel env	Two places your API key lives — both set by AI via CLI.

Session Plan

Time	Activity
0 – 15 min	Recap & Check-in. Quick round: everyone reads their Week 2 site URL out loud and opens the lite AI-clone they wired up at the end of Week 2. Note one off-brand reply from your clone — we're going to fix that feeling today.
15 – 40 min	Concept Teaching. What a system prompt is and why it matters. Why streaming feels alive and batch replies feel dead. How env vars flow through Next.js server actions.
40 – 75 min	Live Demo. Instructor adds the chat widget to her own site in one sitting — from "no chat" to "chat with my AI-clone streaming live" in twelve minutes.
75 – 105 min	Hands-On Lab. Students add the widget to their Week 2 site. The fun part is writing their persona prompt — we'll share favourites at the end.
105 – 120 min	Q&A + Wrap. Each student demos their AI-clone answering one question from a classmate.

Hands-On Lab

Task. By the end of class your portfolio site will have a floating chat widget in the bottom-right corner. Anyone can open it and ask "tell me about yourself" — the answer arrives streaming, in your voice, grounded in your real background.

Phase 1 — Get the API key

Already have a key from Week 2?

If you created a Gemini API key at the end of Week 2, reuse it — skip to Phase 3 (persona design). Phases 2's .env.local and Vercel-env wiring are also already done if your Week 2 lite-clone runs locally.

MANUALStep 1 · Manual (human only):

Open aistudio.google.com/apikey, sign in with your Google account, click Create API key, and copy it. Keep the tab open — you'll paste the key into Cursor in the next step.

Why manual: Google's API-key creation requires a human to accept their usage terms. No CLI or MCP can press 'Agree' for you.

Phase 2 — Give the key to AI

PROMPTStep 2 · Say to Cursor:

Here is my Gemini API key: [paste key here]. Please:

Add it as GOOGLE_GENERATIVE_AI_API_KEY to my .env.local (create the file if it doesn't exist). Make sure .env.local is in .gitignore — I never want this key on GitHub.

Add the same key to my Vercel production environment using the Vercel CLI.

Install the Vercel AI SDK and the Google provider (ai, @ai-sdk/google).

Confirm both environments have the key set before continuing.

VERIFYStep 3 · Verify:

.env.local contains the key; .gitignore lists .env.local.
vercel env ls (Cursor will run this for you) shows GOOGLE_GENERATIVE_AI_API_KEY in production.
package.json has ai and @ai-sdk/google under dependencies.

Phase 3 — Design the persona

PROMPTStep 4 · Say to Cursor:

Create a new file content/persona.md that will define my AI-clone's personality. Write it as a system prompt — instructions to the model about who I am, how I speak, and what I know. Use this information about me:

Who I am: [name, year, degree, school]

What I'm into: [3–5 topics you actually care about — not what sounds cool]

What I'm building: [your portfolio + your Week 1 "hello TECHNEST" + whatever side projects are real for you]

Voice: [pick two adjectives — e.g., "warm but precise," "playful and nerdy," "direct and low-hype"]

What I won't answer as: ["don't pretend to know things I don't," "decline to roleplay as someone else"]

Write the file as one clear, long-ish paragraph in second-person instructions ("You are Chan Meng…"), not first-person bio. When you're done, paste the file contents back into chat so I can read it.

VERIFYStep 5 · Verify:

content/persona.md exists and reads like instructions to a model, not a bio.
The adjectives you chose are reflected in the phrasing.
It includes explicit "don't do this" clauses.

PROMPTStep 6 · Say to Cursor:

Now add a floating chat widget to the site:

A round launcher button in the bottom-right corner with my accent colour.

Click opens a panel ~360 px wide × 480 px tall with a clean chat UI: messages bubble list, input at the bottom, close button at top.

Wire it to a Next.js route handler at /api/chat that uses the Vercel AI SDK to stream responses from Gemini 2.5 Flash.

Load the system prompt from content/persona.md on each request.

When the model is generating, show a small animated dots indicator.

Persist the current session to localStorage so a refresh doesn't wipe it.

Make sure it works on mobile (the panel becomes full-screen on narrow viewports). Commit in small logical commits — one for the API route, one for the UI, one for the localStorage wiring.

VERIFYStep 7 · Verify:

Open your site at http://localhost:3000. A launcher button appears bottom-right.
Click it. Type "What are you building this term?". The reply streams in token-by-token in your voice.
Refresh. The conversation persists.
Open the same page on your phone — the panel becomes full-screen; input is tappable.

Phase 5 — Deploy & demo

PROMPTStep 8 · Say to Cursor:

Ship this to production. Commit all remaining changes, push to GitHub, and confirm the Vercel auto-deploy succeeds. When the production URL is live, open it and test the chat from the live site — same question as before to make sure the env vars carried over.

VERIFYStep 9 · Verify:

Your production site now has the chat widget.
The live chat streams responses correctly from the production URL.
No GOOGLE_GENERATIVE_AI_API_KEY references in any file that got committed.

RECOVERStep 10 · If stuck, say to AI:

The chat says "500 internal server error" on the live site but works locally. Please read the Vercel function logs, find the root cause (most likely a missing env var in production), fix it through the Vercel CLI, trigger a redeploy, and confirm the chat works.

Iterate the persona, not the code

When your AI-clone says something off-brand, don't edit the component. Open content/persona.md and describe the new constraint in plain English: "Don't use the word 'passionate'" or "Always answer in under 80 words unless I ask for detail." Save and reload. The persona file is where your prompt engineering lives.

Career · Screen-record your AI-clone

Record a 20-second clip of you opening the chat and asking "What's something you're working on right now?" This clip is gold for LinkedIn posts and hiring conversations — it's concrete proof you can build a streaming LLM feature end to end.

Weekly Assignment

Build / Implement.

Floating chat widget on your live site powered by Gemini 2.5 Flash.
A content/persona.md file committed to your repo (yes, your persona lives in source control — future-you will thank present-you).

Requirements.

The widget streams responses (not a single dump at the end).
Session persists across page refresh.
API key is in Vercel production env, not in any committed file.
20-second screen recording of the live chat answering one question.

Submission. Live URL + short recording in the course Slack channel before Week 4.

Resources

Docs	Videos	Repos
Vercel AI SDK — streaming with Google provider	Instructor demo: "Wiring persona + streaming in 12 minutes"	`vercel/ai` — SDK source
Google AI Studio — API keys + rate limits		`her-waka/tutorial/vibe-coding/build-with-claude.mdx` — deeper dive
Next.js route handlers — streaming responses

Real-World Application

Every AI product you'll build in your career has a system prompt somewhere. The skill of writing clear instructions to a model — constraining voice, scope, and failure modes — is the single most durable AI-engineering skill. The "AI-clone" framing makes it concrete: you learn prompt engineering by tuning a thing that sounds like you, not by reading a textbook.

Career

Recruiters who visit your portfolio will now have a chat widget that answers "Why should I hire this person?" in your voice, at 2 AM their time. That's not a gimmick — that's an asymmetric advantage.

Looking ahead to Week 4 — RAG

Today's clone is purely prompt-based — your content/persona.md is the entire knowledge base. In Week 4 we wire up Neon Postgres + pgvector and start storing real source material the AI can retrieve from. That's the RAG (Retrieval-Augmented Generation) step the instructor flagged at the end of Week 2 with "in the enterprise world we use vector databases — not today". Today is the bridge: a persona that fits in a single prompt. Next week: a persona that scales to thousands of documents.

Challenges & Tips

"The AI says things I didn't tell it to." Tighten the persona file. Add a line: "If you don't know the answer, say 'I haven't talked about that yet — ask me next week.'" Models respect explicit fallbacks.
"Streaming shows nothing, then the whole reply drops at once." You're probably returning the response instead of piping the stream. Say to Cursor: "The response isn't streaming — it arrives as a single chunk. Use toDataStreamResponse from the Vercel AI SDK."
"I get API_KEY_INVALID on production." The key wasn't added to Vercel env. Say: "Show me which env vars are set on my Vercel project, and make sure GOOGLE_GENERATIVE_AI_API_KEY is set for the Production environment specifically."
"The widget looks ugly on my phone." Describe the broken layout in specific terms: "On my iPhone the input gets hidden behind the keyboard." Cursor knows the viewport-fit pattern to fix it.
"The model's answers are too long / too formal." Add constraints to the persona: "Default reply length: 2 short paragraphs max. Avoid corporate phrasing."

If chat works locally but breaks in production after deploy

Say to Cursor: "Compare my .env.local with my Vercel production env vars and list every difference." This catches 90% of deploy-time AI bugs.

Learning Objectives​

Core Topics​

Tools / Stack​

Session Plan​

Hands-On Lab​

Phase 1 — Get the API key​

Phase 2 — Give the key to AI​

Phase 3 — Design the persona​

Phase 4 — Build the chat widget​

Phase 5 — Deploy & demo​

Weekly Assignment​

Resources​

Real-World Application​

Challenges & Tips​