Episode Transcript
Available transcripts are automatically generated. Complete accuracy is not guaranteed.
Josh:
So Ejaz you're wealthy you've just won the (00:03):
undefined
Josh:
lotto you have all this money what's one of the first things you (00:06):
undefined
Josh:
do and I know you're probably going to say a lot of answers but there's one particular answer (00:08):
undefined
Josh:
that I want you to say which is you're going to get yourself an assistant right (00:12):
undefined
Josh:
to kind of help you do all the things that you don't like to do they're going (00:15):
undefined
Josh:
to manage your calendar they're going to order your groceries they're going (00:20):
undefined
Josh:
to book your flights they're going to take care of all the essential annoyances (00:24):
undefined
Josh:
that you just don't want to deal with. (00:27):
undefined
Josh:
Now, what's cool about the news from this week is that OpenAI has released something (00:29):
undefined
Josh:
that essentially commoditizes that part of wealth. (00:34):
undefined
Josh:
It gives you an assistant that allows you to do all of these tedious things (00:37):
undefined
Josh:
that you normally don't like to do, but only for the cost of $20 a month. (00:42):
undefined
Josh:
And it's a product called Agent. (00:46):
undefined
Josh:
And here will show up the announcement post that they shared. (00:47):
undefined
Josh:
And what's really interesting about agent is it kind of builds (00:50):
undefined
Josh:
upon a lot of other things that openai has been building right (00:53):
undefined
Josh:
they have a browser feature they have a deep research feature (00:56):
undefined
Josh:
but they haven't created a fully comprehensive assistant (00:59):
undefined
Josh:
and that is what i'm assuming openai is (01:03):
undefined
Josh:
trying to do with their agent feature now the news dropped last week i was watching (01:05):
undefined
Josh:
the live stream ijaz we were we were kind of chatting throughout the whole thing (01:10):
undefined
Josh:
you were slightly underwhelmed because it didn't seem as cool as you wanted (01:12):
undefined
Josh:
i was more like in awe at the user interface i thought it just looked very are (01:16):
undefined
Josh:
you pretty more than anything else? (01:22):
undefined
Josh:
And we're kind of seeing on screen the announcement video. (01:23):
undefined
Josh:
What were your first takes when you saw this, the agent product from OpenAI? (01:25):
undefined
Ejaaz:
OK, so we were both watching that live stream at the same time, (01:29):
undefined
Ejaaz:
and we were sharing our thoughts. (01:32):
undefined
Ejaaz:
Do you remember that one sentence that I had for you, Josh, which was, if this is another, (01:34):
undefined
Ejaaz:
Google Drive connector, another agent that claims to do a ton of things and (01:41):
undefined
Ejaaz:
really it just kind of connects to your Google Drive, I'm going to be really mad. (01:46):
undefined
Ejaaz:
And what are we looking at on our screens right now, Josh? Tell me what we're looking at. (01:49):
undefined
Josh:
That's so funny because I was reading your message. You just sent this about (01:53):
undefined
Josh:
the first minute of the presentation and maybe 10 minutes later, (01:56):
undefined
Josh:
they showed the exact connection that he was describing, which is a Google Drive (01:59):
undefined
Josh:
connector. And what does that mean for the people who aren't sure why that's (02:02):
undefined
Josh:
not super exciting to you? Why is that annoying? (02:05):
undefined
Ejaaz:
So everyone uses email, right? And the most popular email is probably Gmail. (02:08):
undefined
Ejaaz:
And it comes with this suite of different products, right? You've got Google (02:12):
undefined
Ejaaz:
Docs where people write up documents at different company jobs. (02:14):
undefined
Ejaaz:
You've got Google Meet, people do video conferencing. It's a whole thing, right? (02:18):
undefined
Ejaaz:
And when agents started becoming popular, Josh, do you remember one of the main (02:21):
undefined
Ejaaz:
claims to fame was that you could now use AI in your Google account, (02:26):
undefined
Ejaaz:
which means that it kind of becomes chat GPT, (02:30):
undefined
Ejaaz:
but it can reference all your documents, it understands all your Excel sheets, (02:34):
undefined
Ejaaz:
your emails, it writes emails for you, whatever, right? (02:38):
undefined
Ejaaz:
But it ended up being more for PR purposes than what the agent actually did. (02:41):
undefined
Ejaaz:
When you actually tried some of these agents, all it kind of did was, (02:47):
undefined
Ejaaz:
it felt like a very friction filled process very fragmented still josh i still (02:51):
undefined
Ejaaz:
had to tell the ai no i meant i want you to reference this document or i want (02:56):
undefined
Ejaaz:
you to talk about this particular thing and i just ended up doing it all myself (03:01):
undefined
Ejaaz:
so i kind of have a shaded experience with using agents so when i saw this on (03:05):
undefined
Ejaaz:
the live stream i was like oh. (03:09):
undefined
Josh:
God is this (03:11):
undefined
Ejaaz:
The same thing. (03:11):
undefined
Josh:
And is it the same thing? Do you feel it's the same thing? Or is this a slight (03:13):
undefined
Josh:
enhancement? Because it appears to me like it's different. This time it is a little bit different. (03:16):
undefined
Ejaaz:
Okay, I will humor you. It actually, this time actually is a bit different. (03:19):
undefined
Ejaaz:
So you mentioned earlier, Josh, that this is a combination of their operator (03:24):
undefined
Ejaaz:
agent, which is, if I'm not mistaken, a computer use agent, right? (03:30):
undefined
Ejaaz:
So if you imagine it is an AI that can look at your desktop or your laptop screen (03:34):
undefined
Ejaaz:
the same way that I'm looking at it right now, and it can scroll with its own (03:40):
undefined
Ejaaz:
cursor, it can open up new tabs, it can open up documents, (03:45):
undefined
Ejaaz:
it can type your keyboard virtually and write a message, right? (03:48):
undefined
Ejaaz:
And it combines this feature with something called deep research, (03:52):
undefined
Ejaaz:
which you and I have spoken about a lot on this show, Josh, which is kind of (03:55):
undefined
Ejaaz:
like a PhD level researcher for whatever requests that you might have. (03:58):
undefined
Ejaaz:
So in theory, that sounded really good. (04:02):
undefined
Ejaaz:
Well, we just got access to this. (04:05):
undefined
Ejaaz:
I'm just gonna pull up the suite so that people have more context here, (04:07):
undefined
Ejaaz:
because I'm sure a lot of our listeners are kind of like asking, you know (04:10):
undefined
Ejaaz:
can i use this right now the feature just rolled (04:12):
undefined
Ejaaz:
out to all pro users so all anyone that's (04:15):
undefined
Ejaaz:
on the gpt pro plan will get access to this and so i (04:18):
undefined
Ejaaz:
was playing around with this for the last couple of days josh and (04:21):
undefined
Ejaaz:
to answer your question yes it is actually quite useful i kind of use some of (04:24):
undefined
Ejaaz:
the presented prompts so you know kind of like when you haven't typed anything (04:29):
undefined
Ejaaz:
yet they kind of suggest things that you have to type josh and i got it to organize (04:32):
undefined
Ejaaz:
bouquet of flowers to be delivered to my apartment for my girlfriend. (04:38):
undefined
Ejaaz:
And that's all I asked her to do. I was like, you know, off you go. (04:44):
undefined
Ejaaz:
This is not something I want to spend my time doing. So you go do it for me, right? (04:48):
undefined
Ejaaz:
Exactly, right? And so I set it. And then I'll tell you my firsthand experience, (04:53):
undefined
Ejaaz:
Josh, which is kind of funny. (04:58):
undefined
Ejaaz:
I just stared at the screen and I watched it create this virtual browser. (04:59):
undefined
Ejaaz:
So for those of you who are listening to the audio here, kind of imagine a pop-up (05:05):
undefined
Ejaaz:
appearing and it's like a mini desktop. (05:08):
undefined
Ejaaz:
But all the actions, all the movements, all the screen grabbing and all that (05:11):
undefined
Ejaaz:
kind of stuff is completely autonomous. (05:15):
undefined
Ejaaz:
You're just watching the AI do its thing. So it opened up a browser, (05:17):
undefined
Ejaaz:
Josh, and it started scrolling through different links from Google, (05:21):
undefined
Ejaaz:
from GPT, flower websites in my local area. (05:25):
undefined
Ejaaz:
It started using my location tracker to figure out where I was so it could figure (05:28):
undefined
Ejaaz:
out kind of like my vague general address and then figure out what flower shops (05:33):
undefined
Ejaaz:
were near me. It then kind of like started passing through a bunch of different (05:37):
undefined
Ejaaz:
bouquet flowers, all that kind of stuff. (05:41):
undefined
Ejaaz:
Kind of figuring out what's seasonal and what's not, what's within kind of a (05:43):
undefined
Ejaaz:
reasonable price point. (05:47):
undefined
Ejaaz:
And then it gave me a bunch of suggestions. And it said, hey, (05:48):
undefined
Ejaaz:
okay, these are some pretty good suggestions that I've got for you, (05:51):
undefined
Ejaaz:
but I kind of want to see whether I'm on track, whether I'm kind of hitting your vibe. (05:54):
undefined
Ejaaz:
Can you give me some more information to let me know? (05:59):
undefined
Ejaaz:
And so what I basically did, Josh, was I reviewed what it had for me. (06:01):
undefined
Ejaaz:
I scrolled through some of the examples that I had and I said, (06:05):
undefined
Ejaaz:
yeah, I actually kind of like this. (06:08):
undefined
Ejaaz:
And options one and four are kind of like really my vibe. (06:09):
undefined
Ejaaz:
And it said, okay, say no more. And it went away and it started like loading (06:12):
undefined
Ejaaz:
up the checkout sites and all that. (06:16):
undefined
Ejaaz:
And I could see the way it thought Josh, which was like one of my biggest takeaways (06:18):
undefined
Ejaaz:
was it kind of gave me comfort being able to watch it do its own thing. (06:22):
undefined
Ejaaz:
So it took about 10 minutes in this entire process. And by the end of it, (06:28):
undefined
Ejaaz:
I had a pretty good idea of which bouquet I wanted. (06:32):
undefined
Ejaaz:
And it just needed me to fill out some wallet details and off I went. (06:35):
undefined
Ejaaz:
So my initial take was, you've just saved me a lot of time. You did that in (06:39):
undefined
Ejaaz:
10 minutes where I could have just technically looked away from the screen and (06:43):
undefined
Ejaaz:
got on with my own thing, right? And then come back to it. (06:47):
undefined
Ejaaz:
The other takeaway that I have, and I'm going to take the other side of this, (06:50):
undefined
Ejaaz:
right? Is it still required me to do things, Josh. (06:53):
undefined
Ejaaz:
It didn't know me entirely. And maybe I'm being too hard on the AI, (06:57):
undefined
Ejaaz:
but ideally I want an AI agent that just kind of like knows what I want and (07:02):
undefined
Ejaaz:
gets it done for me, right? (07:05):
undefined
Ejaaz:
And like has access to my wallet and pays for me. (07:07):
undefined
Ejaaz:
We're not quite there yet, but this is a noticeable step change from what we had before. (07:09):
undefined
Josh:
Okay, so good, not great. It'll get you most of the way there, (07:15):
undefined
Josh:
but it won't quite finish the entire thing. (07:18):
undefined
Josh:
And it was interesting hearing the process because during the stream, (07:20):
undefined
Josh:
I was listening to how it works. And you described the combination between operator (07:23):
undefined
Josh:
and deep research, how they're complimentary. (07:28):
undefined
Josh:
I kind of want to unpack that a little bit because operator was the browser-based (07:30):
undefined
Josh:
feature. can control a browser. (07:34):
undefined
Josh:
So now you have this tool that can control a browser, but it can't really read (07:36):
undefined
Josh:
long articles. It's not good at reading long articles. It doesn't have a lot of context. (07:39):
undefined
Josh:
It's not that good at things that deep research was good at. (07:42):
undefined
Josh:
Deep research is good at reading huge amounts of information, (07:47):
undefined
Josh:
compressing it down, coming up with links, and feeding that information back to the browser. (07:49):
undefined
Josh:
So we have this very complementary thing where deep research can't interact (07:53):
undefined
Josh:
with visual web pages, but it can understand a lot of context. (07:57):
undefined
Josh:
An operator can interact with visuals, but it doesn't have the ability to read a lot. (08:00):
undefined
Josh:
And that creates this complementary tool set that we're seeing on screen right (08:04):
undefined
Josh:
now, which is basically what OpenAd calls the agent. (08:07):
undefined
Josh:
It's given a tool set. So the agent spins up a virtual machine. (08:11):
undefined
Josh:
So you can imagine when you prompt in a query, it created this virtual computer (08:14):
undefined
Josh:
that started talking through how to get the bouquet of flowers. (08:17):
undefined
Josh:
It has a text-based browser tool, it has an image-based browser tool, (08:21):
undefined
Josh:
and there's access to its own terminal to create complementary things. (08:24):
undefined
Josh:
So let's say you were for example doing a spreadsheet and you wanted to see (08:29):
undefined
Josh:
what the costs of all the bouquet of flowers are throughout the neighborhood. (08:32):
undefined
Josh:
It can actually do that because it has all these complementary tool sets. (08:37):
undefined
Josh:
So while I can do all of this, (08:39):
undefined
Josh:
Well, it's not completely finished, like you said, you still have to kind of nudge it through. (08:41):
undefined
Josh:
So an example that they used in the live stream was trying to get a wedding outfit for a wedding. (08:47):
undefined
Josh:
And it'll get you kind of close, but then it doesn't quite know the sizing. (08:52):
undefined
Josh:
It doesn't quite know your style preferences. (08:59):
undefined
Josh:
And there's more context that it needs (09:02):
undefined
Ejaaz:
To complete this. It's lacking a bit of context, right? Yeah, exactly. (09:05):
undefined
Josh:
Yes, it's lacking the context and it's lacking the ability to click pay. (09:07):
undefined
Josh:
So we're seeing here in the example, it's stuck on the pay screen because it (09:09):
undefined
Josh:
doesn't have your credit card information and they just don't have that capability stored yet. (09:12):
undefined
Josh:
So I would imagine this is probably just, I mean, this is version one. (09:16):
undefined
Josh:
It will get better. It will have the context. It will get your credit card info. (09:20):
undefined
Josh:
One of the things I loved, and you described this briefly, is when it loads (09:23):
undefined
Josh:
up this browser window, you can actually replay it. (09:27):
undefined
Josh:
So Ejaz, you were at your browser for 10 minutes kind of watching it do its thing. (09:29):
undefined
Josh:
But in the case that you stepped away and you left you could (09:32):
undefined
Josh:
come back and you could actually re-watch all the steps that (09:35):
undefined
Josh:
it took as if you were watching a youtube video which is something that i really (09:38):
undefined
Josh:
liked a lot about this i think of all the things that i liked the most it was (09:42):
undefined
Josh:
the interface it was very pretty it was very easy to understand and it's really (09:46):
undefined
Josh:
funny the example they they were able to just kind of pan through like it was (09:49):
undefined
Josh:
a youtube video see where it made the searches see what it was looking for walk (09:52):
undefined
Josh:
through that step process to see how exactly it arrived at the answer. (09:55):
undefined
Josh:
And that was pretty neat. So, I mean, I haven't gotten a chance to try yet. (09:58):
undefined
Josh:
My account still does not have enabled. (10:02):
undefined
Josh:
Please give it to me. But based on your experience, it seems like it's good, not great. (10:04):
undefined
Josh:
And I think that's a step in the right direction because we went from pretty (10:08):
undefined
Josh:
mediocre to good. And that's a big jump in the right direction. Yeah. (10:11):
undefined
Ejaaz:
So let me unpack my thoughts a little more for you, Josh. Basically, (10:14):
undefined
Ejaaz:
I think it's good for people that have high agency. (10:17):
undefined
Ejaaz:
And what I mean by that is people who know what exactly they need to get done and, you know, (10:22):
undefined
Ejaaz:
maybe what they need to get done requires general access to a browser or and (10:29):
undefined
Ejaaz:
a few different tools will get a ton of value out of this agent product. (10:34):
undefined
Ejaaz:
I'm not saying it's useless. It's definitely way more useful than any other (10:39):
undefined
Ejaaz:
agent that's been released before. (10:42):
undefined
Ejaaz:
For the general public, who is so used to, I feel like at this point, (10:45):
undefined
Ejaaz:
Josh, magic when it comes to the internet. (10:50):
undefined
Ejaaz:
It's like, you know, when you open up an app and you're like, (10:53):
undefined
Ejaaz:
whoa, like snaps of photos that disappear after six seconds. (10:55):
undefined
Ejaaz:
That's so novel, right? Back in the day when Snapchat was created. (10:59):
undefined
Ejaaz:
You know, they want like that kind of like easy to understand kind of like basic (11:02):
undefined
Ejaaz:
level thinking. I don't need to think too much. It just wows me. (11:06):
undefined
Ejaaz:
We're not there yet with agents. But I have a feeling that when you combine (11:09):
undefined
Ejaaz:
agent with deep research as they did today, but with the context of memory, (11:14):
undefined
Ejaaz:
which as you and I have spoken about many times, (11:20):
undefined
Ejaaz:
and that OpenAI has the kind of like forefront leadership on, (11:22):
undefined
Ejaaz:
when you combine all those three things together, then you're going to see that (11:26):
undefined
Ejaaz:
magical experience where all you have to do, well, maybe you don't even have to prompt it, Josh. (11:29):
undefined
Ejaaz:
Maybe you just kind of open your screen and the agent is like, (11:33):
undefined
Ejaaz:
hey, I kind of ordered your coffee for you already. (11:36):
undefined
Ejaaz:
It's actually outside the Korea. I've been tracking him is arriving and I've (11:38):
undefined
Ejaaz:
like ordered your Tesla robo taxi to arrive. (11:44):
undefined
Ejaaz:
So, you know, you'll be ready to go and leave at 730 a.m. or whatever it is to go to the office. (11:46):
undefined
Ejaaz:
That's when people are going to start being like. (11:51):
undefined
Ejaaz:
I will pay hundreds of dollars for this thing. And I won't even think twice. (11:53):
undefined
Josh:
Yes. So we have great examples of people using this so far. (11:58):
undefined
Josh:
I see you have one pulled up right here. I want you to walk me through how you (12:02):
undefined
Josh:
can actually use this thing. Because (12:05):
undefined
Josh:
a lot of these things are kind of constrained by your own creativity. (12:06):
undefined
Josh:
I do a lot of stuff, but I'm not quite sure what I'm able to optimize for. (12:09):
undefined
Josh:
So what are some people using this to optimize for currently? (12:13):
undefined
Ejaaz:
So in this example, this person is using it to shop, which isn't exactly something that I can relate to. (12:15):
undefined
Ejaaz:
I don't do a lot of online shopping, but i have a girlfriend who does (12:21):
undefined
Ejaaz:
and i know that there are many different occasions that she (12:25):
undefined
Ejaaz:
needs to kind of cater and figure out what to buy for and there's so (12:28):
undefined
Ejaaz:
many sites that she needs to browse so in this example over here um you can (12:31):
undefined
Ejaaz:
see like you know she puts in this problem we pause it she goes find a beige (12:35):
undefined
Ejaaz:
trench coat for under 500 any website is fine must be under 500 including shipping (12:39):
undefined
Ejaaz:
must have a belt and double-breasted buttons optional to have a hood but must be detachable, (12:44):
undefined
Ejaaz:
basically all these different specificities that she needs for this comic, right? (12:49):
undefined
Ejaaz:
And kind of reading this kind of gives me a little anxiety because I'm like, (12:54):
undefined
Ejaaz:
oh my god, like I do not have the energy to search for something like this myself. (12:57):
undefined
Ejaaz:
And you see the agent responding being like, understood, I'll start searching (13:02):
undefined
Ejaaz:
for the beige coat under 500 bucks, and off we go, right? (13:05):
undefined
Ejaaz:
So that's happening. It's doing its own desktop thing. And we see it kind of (13:08):
undefined
Ejaaz:
unfolding. It's going through Amazon. (13:12):
undefined
Ejaaz:
It's kind of like going through a bunch of different options, (13:14):
undefined
Ejaaz:
looking at different colors, making sure it fits all the different criteria. (13:16):
undefined
Ejaaz:
And in the meantime, she's timing up another prompt and she's running up another prompt. (13:20):
undefined
Ejaaz:
So she's basically running like a bunch of different prompts for like her own chat GPT discussions. (13:25):
undefined
Ejaaz:
And what she's demonstrating here is she can go back and forth between windows, Josh. (13:30):
undefined
Ejaaz:
And she can like basically multiply her time and effort over a ton of different (13:34):
undefined
Ejaaz:
tasks while she has this agent doing it for her in the background. (13:40):
undefined
Ejaaz:
And if we get to the end of it... (13:42):
undefined
Ejaaz:
As we see over here, we see that it's pulled up a bunch of options. (13:46):
undefined
Ejaaz:
I think it's primarily used Amazon as its main kind of retailer. (13:49):
undefined
Ejaaz:
And then she ends up kind of like making a decision around what kind of coat that she wants to buy. (13:53):
undefined
Ejaaz:
So I thought that was like a really generally accessible or publicly accessible (13:59):
undefined
Ejaaz:
example that anyone can get into. (14:03):
undefined
Ejaaz:
I thought this was pretty fun. So this is something that like really relates (14:05):
undefined
Ejaaz:
to me, right? I'm like getting old. (14:09):
undefined
Ejaaz:
I'm like, okay, I need to, okay, I'm not that old. But like I'm starting to (14:11):
undefined
Ejaaz:
think about major financial decisions, right? I'm like, can I afford a house (14:14):
undefined
Ejaaz:
or am I going to be eternally renting for the rest of my life? Stuff like that. (14:18):
undefined
Ejaaz:
So I'm like, okay, what do I do with my finances? I don't even know the first (14:20):
undefined
Ejaaz:
place to consider or look into. (14:24):
undefined
Ejaaz:
And this demo basically says that it took 20 minutes for this agent to run a (14:27):
undefined
Ejaaz:
task to figure out what a healthy retirement plan might look like or financial (14:33):
undefined
Ejaaz:
plan might look like for this particular user that is demonstrating this example. (14:39):
undefined
Ejaaz:
And what I found really interesting here, and I'm highlighting it on this tweet, (14:43):
undefined
Ejaaz:
is within 20 minutes, it found local tax laws in Vancouver, which is where this guy must be based. (14:47):
undefined
Ejaaz:
It analyzed average monthly spend rates. It calculated savings needed to retire at 30 years old. (14:53):
undefined
Ejaaz:
It researched optimal investment allocations. It found taxed optimized strategies. (15:00):
undefined
Ejaaz:
It built multiple retirement scenarios. (15:04):
undefined
Ejaaz:
And then it created a downloadable presentation with all the results, Josh. (15:08):
undefined
Ejaaz:
This would have cost me $5,000 from a financial advisor and taken weeks. (15:11):
undefined
Ejaaz:
But here I have ChatGPT doing this all for whatever, $50, $100 subscription. (15:16):
undefined
Ejaaz:
And I'm going to bring up the video here where we basically see it kind of go (15:21):
undefined
Ejaaz:
through its thought process and then create a final finished deck, (15:24):
undefined
Ejaaz:
which you see on your screens right now. (15:28):
undefined
Ejaaz:
It looks pretty rudimentary, doesn't it, Josh? Let's be honest. (15:29):
undefined
Josh:
It looks kind of like a... I wouldn't give it a passing grade in terms of style, (15:32):
undefined
Josh:
but perhaps in terms of the actual content. Very impressive. (15:35):
undefined
Ejaaz:
Very impressive. Very impressive. (15:39):
undefined
Josh:
And this third example that we have is pretty funny because just before we started (15:41):
undefined
Josh:
recording, EJs and I were going through what we were going to order for lunch. (15:43):
undefined
Josh:
And it was this kind of intensive process. (15:46):
undefined
Josh:
We were reading off the menu items and choosing what we liked. (15:47):
undefined
Josh:
And it took a few minutes. And this one is a little bit different. (15:50):
undefined
Josh:
This one, the prompt was to order a roast dinner, which I think is very fitting (15:53):
undefined
Josh:
for you to describe EJs because I'm not quite sure what a roast dinner is. (15:56):
undefined
Josh:
This seems like a British thing. (15:59):
undefined
Ejaaz:
It's a Sunday meal, Josh, where the family comes together. It's either roast (16:00):
undefined
Ejaaz:
chicken, roast lamb or roast beef and it is a lengthy process all right and (16:05):
undefined
Ejaaz:
trust me i've watched my mom do it a million times. (16:10):
undefined
Josh:
Well according to this example it does not have to (16:13):
undefined
Josh:
be as lengthy as you imagine it used to be (16:15):
undefined
Josh:
what i'm seeing this agent do is it's actually it's going through (16:18):
undefined
Josh:
a grocery list it's choosing the chicken it's scheduling a delivery time (16:21):
undefined
Josh:
and it's giving you the the prompts in which you would (16:24):
undefined
Josh:
need to just finish filling it out so it's like hey what are the login details what (16:27):
undefined
Josh:
are the credit card information what time do you want delivered but it's doing (16:30):
undefined
Josh:
everything and it's amazing you're watching it click the browser it is going (16:33):
undefined
Josh:
through the thinking it is selecting all the things that you want really impressive (16:36):
undefined
Josh:
stuff so in this case like that solves you your your roast dinner problem where (16:40):
undefined
Josh:
you don't have to worry about it anymore you set this up once it knows your (16:44):
undefined
Josh:
preferences and then you just type in the prompt when you're ready to go and (16:48):
undefined
Josh:
it will place another order (16:50):
undefined
Ejaaz:
You know what's really impressive about this, Josh? I've been on this website, (16:52):
undefined
Ejaaz:
this exact website before actually, Tesco, when I was a university kid. (16:56):
undefined
Ejaaz:
And I remember doing bulk orders because I had like no money back then. (17:00):
undefined
Ejaaz:
So I was like, any kind of like bundle deal, I'll order in bulk and then like (17:04):
undefined
Ejaaz:
store it in the freezer or whatever. (17:09):
undefined
Ejaaz:
There are so many products. It's like going to Costco or whatever. (17:11):
undefined
Ejaaz:
It's Costco here, right? Where it's like a bulk ordering thing. and (17:16):
undefined
Ejaaz:
you know you just spend millions of hours just staring in (17:19):
undefined
Ejaaz:
the eyes being like i don't know what i need do i need this many toilet (17:22):
undefined
Ejaaz:
roll i have no idea this just like abstracts all (17:25):
undefined
Ejaaz:
of that away from you and i just know that there are a ton of people where this (17:28):
undefined
Ejaaz:
would be kind of like super useful for right the other thing was that kind of (17:32):
undefined
Ejaaz:
like popped into my mind is like how relevant does this make like supermarket (17:36):
undefined
Ejaaz:
websites these days right or just like general retailers in general Like, (17:41):
undefined
Ejaaz:
do the Amazons and Tescos, in this example, of the world need to now try and (17:46):
undefined
Ejaaz:
cater to these agents, Josh? Like, how are you thinking about that? (17:52):
undefined
Josh:
Yeah, I think the answer is yes. I think directionally the answer is yes for (17:56):
undefined
Josh:
things much further than just grocery stores. (17:59):
undefined
Josh:
Where if you're building a (18:01):
undefined
Josh:
website, you kind of want a version of your website to be readable by AIs. (18:03):
undefined
Josh:
A lot of websites are visually complex. They have a lot of visual. (18:07):
undefined
Josh:
Well, they're meant to be aesthetically pleasing, but they're not meant to serve (18:10):
undefined
Josh:
data in the most optimized way. (18:13):
undefined
Josh:
So what I've been seeing with a lot of developers they've started doing is they've (18:15):
undefined
Josh:
actually created a .md file, which is a markdown file that just has the raw (18:18):
undefined
Josh:
text data of the website. (18:22):
undefined
Josh:
No way. So when an agent is scraping... (18:24):
undefined
Josh:
No images at all, just the raw text data. Because when an AI is scraping a website, (18:26):
undefined
Josh:
all it wants is the tokens. (18:31):
undefined
Josh:
It doesn't care for the visuals. It can't see in most cases. (18:33):
undefined
Josh:
So the most optimal way of serving these models is just by allowing the robot (18:36):
undefined
Josh:
to come and read your website as it wants to. (18:40):
undefined
Josh:
And I think a similar thing is probably going to happen with delivery, (18:43):
undefined
Josh:
with groceries, with everything is just kind of creating a dual experience, this split experience. (18:46):
undefined
Josh:
There's an experience you build for the human, and then there's an experience (18:51):
undefined
Josh:
you build for the AI model. (18:54):
undefined
Josh:
And the ai model is going to be really boring and plain it's (18:56):
undefined
Josh:
just going to be a massive block of text that has (18:59):
undefined
Josh:
no spacing no prompts it's just text but (19:01):
undefined
Josh:
that's really all the ai model wants it ingests the tokens it sorts them and (19:05):
undefined
Josh:
it pops new ones out and i think this is probably a trend we're going to be (19:09):
undefined
Josh:
seeing with a lot of things is is the user interface the visual element of it (19:12):
undefined
Josh:
is going to matter less and less and i can't talk about this without thinking (19:16):
undefined
Josh:
about an episode that we're going to be filming soon, (19:20):
undefined
Josh:
which is about browsers and AI browsers in general, and how we kind of interact with these things. (19:22):
undefined
Josh:
Because it's becoming more and more clear that the future of engaging with the (19:27):
undefined
Josh:
internet will be requesting something from it and expecting to be delivered (19:33):
undefined
Josh:
that thing without needing to search and go through all the tedious efforts (19:37):
undefined
Josh:
of actively going to the website, (19:41):
undefined
Josh:
searching through the million different SKUs that they have on Tesco, choosing what you want. (19:43):
undefined
Josh:
You could just say, hey, I have this grocery list or hey, I want to make this (19:46):
undefined
Josh:
for dinner. just go find the stuff for me and just make sure it shows about my doorstep by 6 p.m. (19:50):
undefined
Josh:
And that is a really cool trend that I think we are starting to see and we're (19:54):
undefined
Josh:
going to see more as more tools like this get rolled out. (19:57):
undefined
Ejaaz:
I can't help but think that this is going to completely disrupt the advertising industry, Josh. (20:00):
undefined
Ejaaz:
Isn't that like the main way that all these internet companies make all their money? (20:06):
undefined
Ejaaz:
YouTube, Google on search ad revenue, Meta on advertising on all their news (20:10):
undefined
Ejaaz:
feeds and all that kind of stuff. It's advertisers, right? (20:16):
undefined
Ejaaz:
And now I'm trying to think, how do you prove that your advert has made an impression (20:19):
undefined
Ejaaz:
on an artificial intelligent agent, right? (20:26):
undefined
Ejaaz:
To your point, you go from making flashy ads, which were designed for human (20:31):
undefined
Ejaaz:
eyes to see and walk at, to agents that are just reading a bunch of .md text (20:35):
undefined
Ejaaz:
files, as you put it, right? (20:42):
undefined
Ejaaz:
Which is kind of like insane to think about. (20:44):
undefined
Ejaaz:
The other thing that I'm thinking about is I think services are going to look (20:48):
undefined
Ejaaz:
very different now, Josh, right? (20:53):
undefined
Ejaaz:
It's going to become less human-like. So we're going to, you know, (20:56):
undefined
Ejaaz:
we're not naturally going to go on websites. (21:00):
undefined
Ejaaz:
And I kind of have a comment on your browser, a topic that you mentioned. (21:03):
undefined
Ejaaz:
I kind of think of it as like an intermediary step, right? (21:07):
undefined
Ejaaz:
Almost like it's kind of like trying to bridge the gap between humans and AI. (21:10):
undefined
Ejaaz:
Oh, we're used to browsers, scrolling browsers. (21:15):
undefined
Ejaaz:
So, you know, let's add a few AI features to these browsers. (21:18):
undefined
Ejaaz:
And I'm really excited to do that episode. (21:21):
undefined
Ejaaz:
But then ultimately, we're going to (21:22):
undefined
Ejaaz:
end up in this world where it's just advertisers trying to pitch agents. (21:24):
undefined
Ejaaz:
Your maybe your own personal agent to sell something to you and i wonder whether (21:28):
undefined
Ejaaz:
it skews incentives in different ways right maybe you can pay the agent shill (21:33):
undefined
Ejaaz:
a certain product to the individual i wonder how all of that ends up mapping (21:37):
undefined
Ejaaz:
out it's a weird thing to think. (21:40):
undefined
Josh:
About i would actually love to have a specialist on advertising on the show (21:42):
undefined
Josh:
to talk about this that just knows more about it than we do because the model (21:45):
undefined
Josh:
is already starting to break we've seen it a lot with ad blockers now being (21:49):
undefined
Josh:
built into browsers a lot of companies are kind of optimizing out of displaying ads, so (21:52):
undefined
Josh:
On YouTube, a lot of people will just pay for their premium service to not even have to deal with it. (21:58):
undefined
Josh:
So there's this trend towards avoiding ads. (22:02):
undefined
Josh:
Now that these agents are browsing the website, do they insert the ads into (22:06):
undefined
Josh:
the markdown files of the website and hope that they get served? (22:09):
undefined
Josh:
Is the agent going to be able to filter out from there? Because that's this (22:13):
undefined
Josh:
thing called injection poisoning, right? (22:16):
undefined
Josh:
Where you kind of inject these things into the words that it's reading to give it commands. (22:18):
undefined
Josh:
And that was actually part of the presentation that OpenAI had was about the (22:22):
undefined
Josh:
security and defensiveness of these agents. (22:25):
undefined
Josh:
Because when you have an agent that is equipped with your credit card information, (22:28):
undefined
Josh:
all of your personal information, everything about you, and it has read and (22:32):
undefined
Josh:
write ability on the open internet, (22:35):
undefined
Josh:
well, you kind of want to make sure the agent is not giving this stuff up and (22:37):
undefined
Josh:
telling a malicious actor what your address is, what your credit card information (22:41):
undefined
Josh:
is, what all your preferences are. (22:46):
undefined
Josh:
So there's the second side of this conversation, which (22:47):
undefined
Josh:
is the security of allowing an agent to go (22:51):
undefined
Josh:
out into the wild with your entire profile in its (22:53):
undefined
Josh:
brain in the hopes that some malicious actor (22:57):
undefined
Josh:
can't can't prompt inject and create this malicious intention that serves up (23:00):
undefined
Josh:
the information to a website that they're just that's not real there's there's (23:05):
undefined
Josh:
this whole defensible side of this argument to be had also it seems like they're (23:09):
undefined
Josh:
doing a lot of work to prevent this but i guess we'll see as time goes on how it actually plays out (23:13):
undefined
Ejaaz:
Okay, so, so far we've spoken about retail consumption of these agents, (23:19):
undefined
Ejaaz:
Josh. We've spoken about the security. (23:24):
undefined
Ejaaz:
One thing that we haven't covered is kind of like the workforce and the enterprise world, right? (23:26):
undefined
Ejaaz:
Presumably, right, if you can connect an agent to, you know, (23:33):
undefined
Ejaaz:
very neatly use your Google Drive, your Slack, your LinkedIn, (23:37):
undefined
Ejaaz:
your Salesforce CRM, what are you doing from nine to five every day, right? (23:43):
undefined
Ejaaz:
I pulled up this demo here, Josh. I don't know if you can see it. (23:49):
undefined
Ejaaz:
But it's this demo that shows someone using the GPT agent to basically make (23:54):
undefined
Ejaaz:
personalized invite connections to a bunch of different people on LinkedIn. (24:00):
undefined
Ejaaz:
And so it kind of sets the scenario with the prompt saying, like, (24:04):
undefined
Ejaaz:
hey, I want you to basically reach out to individuals that have this kind of (24:08):
undefined
Ejaaz:
a background, that have held this kind of role, that has this many years of (24:11):
undefined
Ejaaz:
experience, and I want you to pitch them on A, B, and C, right? (24:15):
undefined
Ejaaz:
And like throughout the demo, it kind of like goes through, personalizes each (24:18):
undefined
Ejaaz:
request to each individual person. (24:22):
undefined
Ejaaz:
And the first thought that I had in this, Josh, was if you're on the sales team (24:24):
undefined
Ejaaz:
of some company, your job just got automated, dude. (24:29):
undefined
Ejaaz:
Because a lot of the time you spend, if you're a recruitment person, (24:32):
undefined
Ejaaz:
work for a recruitment firm and you're headhunting someone, 90% of their time (24:37):
undefined
Ejaaz:
is spent on LinkedIn. I get a ton of these requests day in, day out, right? (24:41):
undefined
Ejaaz:
And so I'm thinking, well, now your job can be completely automated and it could (24:45):
undefined
Ejaaz:
probably do a much better job (24:49):
undefined
Ejaaz:
and reach thousands of people that you reach over months within an hour, (24:50):
undefined
Ejaaz:
within 20 minutes, within however long this video is, which is two minutes and (24:56):
undefined
Ejaaz:
36 seconds. So I think that we're going to start to. (25:01):
undefined
Josh:
See a lot of (25:04):
undefined
Ejaaz:
These agents in work environments. I don't know if we're going to have as much insight into that. (25:04):
undefined
Ejaaz:
It's going to be at the discretion of all these different companies as to what (25:10):
undefined
Ejaaz:
they want to reveal to us. But that's something that we should keep our eyes on, Josh. (25:13):
undefined
Ejaaz:
And, you know, I would love to get someone on the show, (25:17):
undefined
Ejaaz:
knows about how it's transforming their own company right at a massive scale. (25:20):
undefined
Josh:
Yeah it's the so there's there's two elements of (25:24):
undefined
Josh:
this agent right it's like the personal unlock that we talked about at the top of the show (25:27):
undefined
Josh:
where now it's this super assistant that you have to take care of all of your (25:29):
undefined
Josh:
yes irons that you don't want to run all the things that are annoying and then (25:34):
undefined
Josh:
you have the professional version of it where it can go and it can make sales (25:37):
undefined
Josh:
funnels and sales calls and it could do a lot of the professional things it (25:40):
undefined
Josh:
can query data it can create nice spreadsheets it can create google slide presentations (25:43):
undefined
Josh:
and granted they don't look that pretty, (25:47):
undefined
Josh:
but I mean, directionally, they're heading there and they will eventually get there. (25:49):
undefined
Josh:
So it's this new trend that is really exciting because it's now here and it's (25:53):
undefined
Josh:
accessible for $20 a month and you can actually go and try it and you could (25:57):
undefined
Josh:
push those limitations, see (26:00):
undefined
Josh:
where the outer bounds lie and see how it can mostly optimize your life. (26:01):
undefined
Josh:
And then as we go, it'll just unlock more and more and more. (26:05):
undefined
Josh:
It'll start taking down your credit card. (26:08):
undefined
Josh:
It'll take down your whole preference stack and just become this supercharged agent. (26:10):
undefined
Josh:
So I think that is OpenAI's agent feature. It's live. It is out right now. (26:14):
undefined
Josh:
I'm hoping my account gets it soon because I really want to use it. (26:19):
undefined
Ejaaz:
Yeah, dude. You know what? I am officially more optimistic than I was at the (26:23):
undefined
Ejaaz:
start of this episode about this thing. (26:27):
undefined
Ejaaz:
I think you've successfully, or rather we're both successfully convinced me, (26:29):
undefined
Ejaaz:
that this is going to become a much larger thing. (26:33):
undefined
Ejaaz:
And I'm just looking at the end of this demo here, Josh, like this screenshot (26:37):
undefined
Ejaaz:
of like, you know, this virtual browser. (26:40):
undefined
Ejaaz:
Even though it's so rudimentary and basic, I kind of get it. (26:43):
undefined
Ejaaz:
Do you know what I mean? it's like running a simulation on my screen and like (26:47):
undefined
Ejaaz:
having it like just set and forget on the top left of my screen and just letting it do its thing. (26:51):
undefined
Ejaaz:
Or like if I'm doing work and I want to watch a stream of something, (26:57):
undefined
Ejaaz:
I can have like a YouTube video like there on the corner of my screen. (26:59):
undefined
Ejaaz:
It's kind of similar to that. So even just me saying that, it's behaviorally getting to me. (27:02):
undefined
Ejaaz:
I'm understanding it subconsciously and I'm like, yeah, you know what? (27:08):
undefined
Ejaaz:
This is going to be fun. So I'm bullish is what I'm saying. (27:11):
undefined
Josh:
Great. I am bullish too. My hot take before we leave, I'm just going to mic (27:14):
undefined
Josh:
drop this is I think the agent is much more impressive and much more important (27:18):
undefined
Josh:
than the AI powered browser. (27:21):
undefined
Josh:
I think the browsers are kind of dumb. The agent hops right over the browser. (27:22):
undefined
Ejaaz:
That's probably where this goes. Dude, that was going to be you. (27:25):
undefined
Ejaaz:
That's going to be my take on the future episode. (27:28):
undefined
Josh:
I was going to show it. We're going to stop before we spoil it. (27:30):
undefined
Josh:
The browser episode is coming soon. As soon as we get access to Comet, (27:33):
undefined
Josh:
we want to use Comet from Complexity. (27:35):
undefined
Josh:
We're working on getting access to it. Once we do, we will have a fully comprehensive (27:37):
undefined
Josh:
episode covering AI browsers. (27:40):
undefined
Josh:
But for now, that was OpenAI's agent feature. (27:41):
undefined
Josh:
I would encourage you if you have a plus plan, which is the $20 a month plan. (27:44):
undefined
Josh:
Go check it out. Give it a try. See where it can automate parts of your life. (27:47):
undefined
Josh:
See where it improves it. See where it detracts from it. I don't know. (27:50):
undefined
Josh:
I'd love to hear feedback. I'm sure both Ejaz and I would love to hear more (27:54):
undefined
Josh:
examples of how you're using it. (27:56):
undefined
Josh:
Because at the end of the day, the only real constraint on this is the creativity (27:57):
undefined
Josh:
that you use when you're using it. So I hope you enjoyed it. (28:01):
undefined
Josh:
That has been another episode. We will be back again soon talking about more (28:03):
undefined
Josh:
AI news in the future. I'll talk to you guys soon. (28:06):
undefined
Music:
Music (28:08):
undefined