| This put up first appeared on Aman Khan’s AI Product Playbook publication and is being republished right here with the writer’s permission. |
Let me begin with some honesty. When folks ask me “Ought to I change into an AI PM?” I inform them they’re asking the flawed query.
Right here’s what I’ve discovered: Changing into an AI PM isn’t about chasing a classy job title. It’s about creating concrete abilities that make you simpler at constructing merchandise in a world the place AI touches every part.
Each PM is changing into an AI PM, whether or not they understand it or not. Your cost move may have fraud detection. Your search bar may have semantic understanding. Your buyer help may have chatbots.
Consider AI product administration as much less of an OR and as an alternative extra of an AND. For instance: AI x well being tech PM or AI x fintech PM.
The 5 Abilities I Truly Use Each Day
| This put up was tailored from a dialog with Aakash Gupta on The Progress Podcast. You will discover the episode right here. |
After ~9 years of constructing AI merchandise (the final three of which have been an entire ramp-up utilizing LLMs and brokers), listed here are the abilities I exploit continuously—not those that sound good in a weblog put up however the ones I actually used yesterday.
- AI prototyping
- Observability, akin to telemetry
- AI evals: The brand new PRD for AI PMs
- RAG versus fine-tuning versus immediate engineering
- Working with AI engineers
1. Prototyping: Why I code each week
Final month, our design staff spent two weeks creating lovely mocks for an AI agent interface. It appeared good. Then I spent half-hour in Cursor constructing a practical prototype, and we instantly found three elementary UX issues the mocks hadn’t revealed.
The talent: Utilizing AI-powered coding instruments to construct tough prototypes.
The software: Cursor. (It’s VS Code however you possibly can describe what you need in plain English.)
Why it issues: AI habits is not possible to know from static mocks.
Learn how to begin this week:
- Obtain Cursor.
- Construct one thing stupidly easy. (I began with a private web site touchdown web page.)
- Present it to an engineer and ask what you probably did flawed.
- Repeat.
You’re not making an attempt to change into an engineer. You’re making an attempt to know constraints and potentialities.
2. Observability: Debugging the black field
Observability is the way you truly peek beneath the hood and see how your agent is working.
The talent: Utilizing traces to know what your AI truly did.
The software: Any APM that helps LLM tracing. (We use our personal at Arize, however there are a lot of.)
Why it issues: “The AI is damaged” isn’t actionable. “The context retrieval returned the flawed doc” is.
Your first observability train:
- Choose any AI product you utilize each day.
- Attempt to set off an edge case or error.
- Write down what you assume went flawed internally.
- This psychological mannequin constructing is 80% of the talent.
3. Evaluations: Your new definition of “accomplished”
Vibe coding works if you happen to’re delivery prototypes. It doesn’t actually work if you happen to’re delivery manufacturing code.
The talent: Turning subjective high quality into measurable metrics.
The software: Begin with spreadsheets, graduate to correct eval frameworks.
Why it issues: You may’t enhance what you possibly can’t measure.
Construct your first eval:
- Choose one high quality dimension (conciseness, friendliness, accuracy).
- Create 20 examples of excellent and dangerous. Label them “verbose” or “concise.”
- Rating your present system. Set a goal: 85% of responses needs to be “excellent.”
- That quantity is now your new North Star. Iterate till you hit it.
4. Technical instinct: Figuring out your choices

Immediate engineering (1 day): Add model voice tips to the system immediate.
Few-shot examples (3 days): Embrace examples of on-brand responses.
RAG with type information (1 week): Pull from our precise model documentation.
Tremendous-tuning (1 month): Practice a mannequin on our help transcripts.
Every has totally different prices, timelines, and trade-offs. My job is understanding which to suggest.
Constructing instinct with out constructing fashions:
- Once you see an AI function you want, write down 3 ways they may have constructed it.
- Ask an AI engineer if you happen to’re proper.
- Improper guesses train you greater than proper ones.
5. The brand new PM-engineer partnership
The most important shift? How I work with engineers.
Previous means: I write necessities. They construct it. We take a look at it. Ship.
New means: We label coaching knowledge collectively. We outline success metrics collectively. We debug failures collectively. We personal outcomes collectively.
Final month, I spent two hours with an engineer labeling whether or not responses had been “useful” or not. We disagreed on quite a lot of them. This taught me that I want to begin collaborating on evals with my AI engineers.
Begin collaborating otherwise:
- Subsequent function: Ask to hitch a mannequin analysis session.
- Supply to assist label take a look at knowledge.
- Share buyer suggestions when it comes to eval metrics.
- Have a good time eval enhancements such as you used to rejoice function launches.
Your 4-Week Transition Plan
Week 1: Software setup
- Set up Cursor.
- Get entry to your organization’s LLM playground.
- Discover the place your AI logs/traces reside.
- Construct one tiny prototype (took me three hours to construct my first).
Week 2: Statement
- Hint 5 AI interactions in merchandise you utilize.
- Doc what you assume occurred versus what truly occurred.
- Share findings with an AI engineer for suggestions.
Week 3: Measurement
- Create your first 20-example eval set.
- Rating an current function.
- Suggest one enchancment based mostly on the scores.
Week 4: Collaboration
- Be a part of an engineering mannequin evaluate.
- Volunteer to label 50 examples.
- Body your subsequent function request as eval standards.
Week 5: Iteration
- Take your learnings from prototyping and construct them right into a manufacturing proposal.
- Set the bar with evals.
- Use your AI Instinct for iteration—Which knobs do you have to flip?
The Uncomfortable Fact
Right here’s what I want somebody had instructed me three years in the past: You’ll really feel like a newbie once more. After years of being the skilled within the room, you’ll be the particular person asking primary questions. That’s precisely the place you have to be.
The PMs who reach AI are those who’re comfy being uncomfortable. They’re those who construct dangerous prototypes, ask “dumb” questions, and deal with each complicated mannequin output as a studying alternative.
Begin this week
Don’t look ahead to the right course, the perfect position, or for AI to “stabilize.” The abilities you want are sensible, learnable, and instantly relevant.
Choose one factor from this put up, decide to doing it this week, after which inform somebody what you discovered. That is the way you’ll start to speed up your individual suggestions loop for AI product administration.
The hole between PMs who discuss AI and PMs who construct with AI is smaller than you assume. It’s measured in hours of hands-on follow, not years of examine.
See you on the opposite facet.
