A Voice-to-Contact App That Actually Understands What You Said

Last updated May 2026 · by Emily Hyeamang, founder of YourPond

Most voice-to-text apps give you a transcript. YourPond gives you a contact. On iPhone, just tap the microphone and describe someone in your life — “I met Maya Chen at Aisha’s wedding, she’s a designer at Figma in Brooklyn, has two kids and a corgi named Bear” — and YourPond extracts the name, job, location, relationships, kids, and pet automatically. You review what it captured, fix anything wrong, and save. YourPond is a personal CRM and relationship mapping app that helps you organize the people in your life, track how they're all connected, and remember the details that matter.

The “I just met someone, hands are full” problem

People don’t enter your life at a desk. They show up at weddings and dinners, at a kid’s birthday party, on the sideline of a soccer game, on a walk with the dog. You shake a hand, you learn three interesting things about someone, and then the moment passes — because your hands are full, you’re holding a drink, you’re pushing a stroller, or it would just be rude to start typing.

So you tell yourself you’ll remember. You almost never do. Phone-typing a new contact is slow and awkward, and it forces you to drop the rich detail — the company, the city, the spouse’s name, how you actually met — down to a bare name and number, if you capture anything at all. The detail is exactly the part that matters, and it’s the first thing to evaporate.

Why voice + structured extraction is different

Voice memos feel like a fix, but they aren’t. A voice memo is a linear blob of audio you have to listen back to, and it’s unsearchable — six months later you can’t find “the designer from the wedding” without scrubbing through recordings. Standard voice-to-text apps are only marginally better: they hand you a wall of transcript text that you still have to read, parse, and manually reorganize into something usable.

YourPond does the parsing for you. It extracts the structure hidden inside what you said. A name becomes a contact. “Wife” becomes a relationship. “Figma” becomes a job. “Brooklyn” becomes a location. “Bear the corgi” becomes a pet on that person’s profile. This is the difference that matters: it’s not transcription, it’s structured extraction. And nothing is automatic — you review the extracted fields before a single thing is saved.

How it works on iPhone

  1. Open YourPond on your iPhone.
  2. Tap the microphone icon.
  3. Describe the person or people you just met. One person or many — YourPond extracts each one separately.
  4. Stop speaking. YourPond shows you what it extracted, laid out in structured form: name, job, location, relationships, kids, pets, notes.
  5. Review, edit anything that’s off, and save.

The whole loop takes seconds, and it happens while the conversation is still fresh — on the walk to the car, in the elevator, the moment you sit down. That’s the point: capture at the speed you actually live, not the speed you can thumb-type.

What YourPond can extract from your voice

  • Names — including spouse, kids, siblings, and parents. You can describe a whole relationship chain in one breath.
  • Jobs — company and title together (“designer at Figma”).
  • Schools — the university or high school someone attended.
  • Locations — cities, states, and countries.
  • Relationships to you — friend, sister-in-law, college roommate.
  • Relationships between people — “Maya’s husband is Theo” links the two contacts to each other.
  • Kids’ names and ages.
  • Pets’ names and species — Bear the corgi, Ezra the cat.
  • Notes — “she’s allergic to shellfish,” “he just moved to Austin.”
  • How you met — “at Aisha’s wedding,” “through Theo.”

Privacy — your voice doesn’t get stored

Your voice is transcribed locally on your iPhone using iOS speech recognition, and the audio isn’t kept. The resulting transcript is sent to Anthropic’s Claude API purely to extract the structured fields, and that transcript is auto-deleted within 7 days. Anthropic does nottrain models on YourPond data. Nothing saves to your YourPond account without your explicit approval, and there are no ads and no data sales — ever.

How this fits the bigger picture

Voice is one of three ways to add people to YourPond. The other two are typing a paragraph — the “Describe Your People” text mode, which works exactly like voice but with your keyboard — and bulk import from your phone’s contacts. All three feed the same place. A contact you added by voice gets the same self-building family tree, the same place on the relationship map, and the same birthday and follow-up reminders as one you typed by hand. Voice is just the fastest door into the same house. If you’re weighing YourPond against other tools, the personal CRM comparison lays out where each one fits.

When to use voice vs. typing

  • Voice — right after meeting someone, on a walk, in the car, any time your hands are busy and the details are fresh.
  • Typing — quiet, deliberate moments when you want to describe a whole group at once and see the text as you go.
  • Bulk import — onboarding day, when you want to migrate your existing iPhone contacts in one pass.

Frequently asked questions

Can I add contacts to my personal CRM with my voice?

Yes — on iPhone, YourPond lets you tap a microphone and describe people. YourPond extracts the names, relationships, jobs, locations, kids, and pets automatically. You review and approve before anything saves.

Does YourPond work like a voice memo app?

No. Voice memo apps give you audio files or transcripts. YourPond extracts structured contact data from what you said — names become contacts, “wife” becomes a relationship, “Figma” becomes a job. It’s structured extraction, not transcription.

Is voice entry available on Android or web?

Currently iOS-only. Voice entry is an iOS-native feature that uses iOS speech recognition. Web users can use “Describe Your People” text entry, which works the same way.

What happens to my voice recording?

Your voice is transcribed locally on your iPhone (using iOS speech recognition), the transcript is sent to Anthropic’s Claude API for structured extraction, and the transcript is auto-deleted within 7 days. Anthropic does not train models on YourPond data.

Can I add multiple people in one voice entry?

Yes. Say “I met Maya and her husband Theo at the wedding” and YourPond extracts both contacts and the relationship between them.

What if YourPond gets something wrong?

You review every extraction before saving. Nothing is automatic. If YourPond extracts “Maya, designer at Figma” but you meant “Maya, designer at Discord,” you edit it before approving.

Talk to add your people

Free for your first 25 contacts. No credit card required.