Source (Bluesky)

Transcript
Here’s an example that Google’s Josh Woodward, VP of the Gemini app, Google Labs, and AI Studio, shared in a blog post about how Personal Intelligence can work. Google also put together a similar example in a video that I’ve embedded below:
For example, we needed new tires for our 2019 Honda minivan two weeks ago. Standing in line at the shop, I realized I didn’t know the tire size. I asked Gemini. These days any chatbot can find these tire specs, but Gemini went further. It suggested different options: one for daily driving and another for all-weather conditions, referencing our family road trips to Oklahoma found in Google Photos. It then neatly pulled ratings and prices for each. As I got to the counter, I needed our license plate. Instead of searching for it or losing my spot in line to walk back to the parking lot, I asked Gemini. It pulled the seven-digit number from a picture in Photos and also helped me identify the van’s specific trim by searching Gmail. Just like that, we were set.


Can we talk about the Transcript feature? How does that happen? As a frequent mobile user subjected to people 1920x1080 full screen text grabs, I greatly appreciate it.
I think people are just adding it themselves inside a spoiler tag for accessibility.
In the past I’ve used combinations of OCR and manual transcription. OCR saves a lot of time, but usually requires some postprocessing. It’s not really a feature, it’s the OP putting in effort to make our lives easier. Thanks @ThefuzzyFurryComrade@pawb.social!