Ever wish you had a personal assistant to handle all those annoying, multi-step tasks? You know, like planning a date night, buying a new outfit, AND finding the perfect gift all at once? Well, I just saw a video where an AI professional put ChatGPT’s brand new Agent Mode to the ultimate test, and the results were… something else.
This is one of the most exciting AI features to drop recently, so I was super curious to see if it could actually deliver on its promises.
⚙️ The Ultimate Agent Test
The YouTuber didn’t hold back. He threw a super complex, real-world task at the new agent to see how far it could get. I was blown away by the ambition here!
Here’s the exact prompt the creator used:
Book a date night for my wife’s birthday in 2 weeks. Find a restaurant that allows online booking and is a top rated restaurant in San Diego. It shouldn’t be more than $100 total per person. Look for openings on Thursday or Friday of next week around 6 p.m. Book the reservation for two. Also, find me a new pair of pants and matching long sleeve shirt that goes well with the pants. I’m 6′ 3 and have a 34 inch waist and 34 inch length. I typically wear XL shirts, make sure they look classy, and order them for me. Also, buy my wife a birthday present that’s $200 max. She loves new video games, camping gear, travel, and reading.
So, what happened? The agent worked for a whopping 50 minutes! It found a restaurant, picked out clothes, and even selected a Kindle as a gift. But it wasn’t a perfect victory. It got stuck and needed the YouTuber to take over to enter login and payment details. His other tests, like creating a slide deck and analyzing his channel, produced functional but flawed results.
My take? This is a huge step forward, but it’s not quite ready to be your fully autonomous assistant. It’s still a bit buggy and slow.
✨ A Whirlwind of New AI Tools
This video was packed with demos of other new tools, and some were seriously impressive. The mind behind this channel tested a bunch, but a few really stood out:
- 👽 Runway Act-Two: This new motion capture model lets you drive an animation with a video of yourself. The expert recorded himself talking and moving, and it worked pretty well for re-skinning him as an astronaut. But when he tried a full-body video with a toy sword… things got weird. We’re talking floating swords and extra hands. It’s fun, but best for upper-body shots for now.
- 😵💫 MirageLSD: This was my favorite part! The creator demoed a tool that re-skins your live video in real-time. He turned his office into a “yarn world,” a “goo world,” and even a world of “zombies” just by typing prompts. It’s wild, trippy, and happens instantly.
- 🎤 Hume AI Personality Cloning: This tool clones your voice and speaking style in seconds. The result was pretty wild: the AI was so chatty it wouldn’t let the person who shared it get a word in! It had personality, that’s for sure.
- 🎬 Invideo AI Twin: This innovator created a digital clone of himself just by uploading a 60-second video. He then used the clone to make a fun ad for a fake product. I think this is a game-changer for content creators who need to do quick reshoots or simple ads.
Some of the other tools the YouTuber tried to test, like Anthropic’s new connectors and XAI’s Grok companion, were unfortunately down due to server overload. It’s a good reminder that we’re still in the early days!
This video is an awesome tour of what’s new and what’s possible right now. For the full demos and to see these tools in action (especially the fails!), you have to see the original video from the creator. Go check it out!