Robin Li, the founder and CEO of Chinese search engine Baidu, said that artificial intelligence (AI) will innovate all kinds of industries, including Baidu’s search engine, in the next decade, in his visit to Indian Institute of Technology Madras (IITM) on January 4.
“We think in the age of AI, search will be very different from what we see today. Once we can transform search into a very different product, we will be ready to launch that internationally,” Li said.
China’s search engine giant Baidu has expanded into markets in countries like Japan, Thailand, and Egypt. However, their main product is still mainly domestic, faced with challenges from information-flow-driven companies such as ByteDance.
“In the future, you will not be able to distinguish between search and feed, or search and recommendation. Search is giving you the right answer right away, but recommendation will let you consume a lot more content and let you learn a lot more knowledge. You will have much better access to a lot of new knowledge,” Li said.
In the previous decade when consumers were dominated by the mobile internet. As for the coming decade, Li believed that AI will cut out humans’ dependence on the mobile phones. He would label it as “intelligent economy.”
“What does that mean, is that if we can see that internet changed the way we consume, or internet changed the way we entertain ourselves, the intelligent economy will change the way we produce. It will significantly improve productivity for humans,” He said.
During the trip, Li also confirmed that he plans to seek collaborations with IITM as Baidu is trying to grow its AI capacities.
Li’s full speech:
Robin Li at HAASTRA: Innovation in the Age of AI
Good afternoon everyone. It’s so great to see you all come to my talk today on a Saturday afternoon. I heard people who cannot get into IIT go to MIT. I know why today because you are so eager to learn new things.
Today I’m going to talk about innovation in the age of AI because everyone knows that AI is the major wave these days.
Before I talk about AI, I would like to go over with you what’s happened to the internet after smartphones. Or, how has mobile changed the internet? Of course, this is pretty much a China perspective because I guess most of you are more familiar with the US landscape, but China is slightly different because we have a relatively independent ecosystem in mobile.
The first change is apps becoming isolated islands. What that means is that there are quite a number of large apps that are wrapped around isolated (islands) – the contents, the services are not so easily accessible by search engines or third-party programs. We see that as a trend that more and more apps are doing things independently instead of relying on a search engine.
And the second is that content is linked to an author. What that means is in the PC era, we pretty much interacted with web sites or web pages. We know there’s a webmaster behind the website but we probably never think about directly communicating with the webmaster. But in the age of mobile, content is closely linked to authors, especially on social media. And even today, especially in China, news feeds, or content feeds, are very popular. When you search for things, not only (will) you find the relevant content, it’s easy for you to find the author behind the content. So today, when you find the relevant content you can ask questions and most likely that author will directly respond to your questions. This is increasingly the case for mobile internet.
And the third one is video. Video is becoming the main form of content. We used to see text, then more and more images became available on the internet, and today video has become the most important form of content on the internet. People’s mindsets are also changing toward video content. Today, if you search for, let’s say, the general relativity theory, you probably would imagine that a Wikipedia entry would come up as the first result. In the case of Baidu, a Baidu encyclopedia entry would come up as the first result. But if you think about it, a video clip, video content, could be a better answer for this query because we can probably find a very good talker, very good teacher, to talk about the relativity theory in a good way, very easy-to-understand way. You feel you’re connected to the teacher, to the person who created that content, instead of just hard text. This kind of theory is relatively hard to understand. And video provides a lower barrier to entry for this kind of knowledge and content.
So this is what we see during the mobile internet age.
And in the age of AI, search is evolving too. So, how is AI changing search?
We’re also seeing a number of trends. The first is that the first result is typically the right answer. Right now about 60% of queries are answered by the first result.
So, we are increasingly giving direct answers instead of a very large number of links for the users to find the right answer. And I believe this kind of scenario will become more and more popular, or, an increasing number of queries will be answered directly by the first result or by a paragraph of content.
So right now it’s like 60%, it will go to 70%, 80%, or even 90%. So increasingly your query will be answered directly instead of going through a list of websites or links. Because, if you think about the search problem, it’s essentially an AI problem. Although, 20, 25 years ago, when search engines became popular, the technology behind it had nothing to do with AI. But search is essentially an AI problem because you basically, humans, express their request, their interest, in the form of queries or text, then we use computers to guess what that human or what user means, then come up with the relevant answer. And if you think about AI, that’s pretty much the definition of AI, letting computers understand humans and serve humans.
So solving the search problem is pretty much like solving the general AI problem. It is a hard problem, but we are getting closer and closer.
Then second, content feed blends with search results. What that means – given that in a lot of cases, in most cases, the first result is the right answer, or we can directly answer your question without having you go through a large number of links, so the rest of the links becomes redundant.
We actually don’t need to give you a lot of redundant content. So once your query is answered, what we would like to give you is knowledge related to that topic, but not directly on that topic.
For example, if you search for Van Gogh, and the first result is about the general introduction of Van Gogh, then the second one can be a general introduction about Monet. It doesn’t have to have the word Van Gogh in it. Once your question is answered, we can expand the content based on your interest, not necessarily related to your query, based on our understanding of your interests, of you as a user. In the age of mobile, we actually know a lot more about our users than the PC era, so we can actually extend the user’s interests a lot. We can give them more and let users spend more time. In China, on average every user spends about five hours on the mobile phone (per day) and that’s still increasing. People spend more and more time, and for search, we can directly answer users’ queries in one shot, so we are giving more and more relevant content to our users.
Then the third, I think many of you already have this kind of experience, the camera and microphone become the new keyboard. You don’t have to express your interest in text only, you can express your interest in speech, in images, or in video. If you are interested in a certain plant and wonder what the name is, you can just use your camera and point to that flower and it will tell you. This has increasingly become accurate because of AI.
So if we have to look back for the past 10 years, as we just entered 2020, I think if we need to put a label on the economy, I would call it the internet economy, because internet changed our lives, changed a lot of things over the past 10 years. Let me show you a video clip to demonstrate that:
It changed payment, food delivery, retail, ride-hailing.
And more importantly, I think entertainment. Internet changed entertainment. Ten years ago most of us spent a lot of time watching TV. Today, I was at a forum a couple of weeks ago, it’s about this size, about 400 people. And I asked, who of you watched TV last night, and none of them raised their hand.
Today they spend, you know, five hours playing games or watching short videos just using their mobile phone. They don’t watch TV anymore. So the internet fundamentally changed the way people entertain themselves. But, going forward, I think we are entering a new age, the age of AI. So, the characteristics of the economy will also change. So in the coming decade, I would label it as “intelligent economy”. What does that mean, is that if we can see that internet changed the way we consume, or internet changed the way we entertain ourselves, the intelligent economy will change the way we produce. It will significantly improve productivity for humans.
There are also three layers I’d like to go through. The first one is the new mode of human-machine interaction, the second one is how AI transforms industry after industry, and the last one I’d like to talk about is the infrastructure for AI.
The new human-computer interaction. I think many of you already have this kind of experience. Today, new cars sold on the market are all connected cars, meaning that they are connected to the internet. When you get into a car, you have a screen (that is) bigger and better than your mobile phone screen. You have more expensive microphones, you have cameras, all kinds of sensors in the car, so essentially when you get into the car, you don’t need to use your mobile phone anymore. This video shows that.
So you can see that it’s pretty much all voice controlled. It connects with all kinds of car services, content, and it responds on a continuous basis. You don’t have to use wake words every time. And this is an experience that’s already on the market today.
And at home, you will also have an experience that is very different from today’s mobile internet.
So when you have a smart display at home like this, chances are that you will use your mobile phone less. If you want to know the weather tomorrow, you ask this kind of smart display and it will answer you directly. But if you want to get the weather report from your mobile phone, you typically need to pull out your mobile phone from your pocket, unlock it, find the right app, and type in the destination. It requires a lot of steps.
But for a voice-first device like this, it’s much more direct and more convenient. The barrier to entry is also lower. You don’t even need to be literate. You use talk and it will get you the answer.
So because of this, for the past 10 years, we humans are increasingly dependent on mobile phones. I would say over the next 10 years we will be less dependent on the mobile phone, less and less, because wherever you go, there are surrounding sensors, there are infrastructure, that can answer your question, that can serve you. So you don’t have to pull out your mobile phone every time. This is the power of AI.
In production, we also have this kind of new human-machine interaction. We call it “digital person”. It’s essentially a virtual assistant in the form of human, and doing things that complete your task, like this:
Why is this useful? In this case, we’re using it for bank services. A lot of banks cannot afford to open all kinds of different branches in many cities. It’s very expensive to rent that kind of real estate and hire lots of people. But we can establish this kind of virtual assistant, if you want to open a bank account or if you want to borrow money, or any kind of bank services that require human assistance, you can do that through this kind of virtual assistant.
And we found that people, users, feel more comfortable to deal with a virtual person than a real person. So not only does it save money, save space, it also becomes more user-friendly. You don’t have any pressure. You can say whatever you want and do whatever you want.
So all of these are changing the way we interact with computers or machines. And AI is also transforming a lot of industries, in the sense of higher efficiency and lower risk. Let me go through a couple of them.
Customer service. You’ve seen the virtual assistant case for banking, but in many other industries, customer service can be transformed by AI. We’ve been working with a number of telecom operators to assist their customer service using virtual assistants. You know in China, I think in India too, a typical telecom operator has like 100 different plans. When a customer calls in, the customer service people can typically recommend a plan that is suitable for that person.
But how do you figure out what’s the best plan for that user in one or two minutes? It’s very challenging for a real person. But for a virtual assistant, it’s actually very easy and quick, and we can use this kind of virtual assistant to do a much more efficient customer service. That’s for the telecom industry, and for many other industries we can also find similar cases.
For education, it’s a similar thing. We can come up with a personal tutor, personal assistant, to help students to learn new things. When the student has any kind of questions or problems, we use this kind of virtual assistant to help walk through all kinds of knowledge points and help the students learn.
Also, for the pharmaceutical industry, AI will accelerate the pace of drug discovery. We see a lot of startups doing this. Using AI, you can come up with all kinds of different combinations of molecules as drug targets. So you can very quickly generate a lot of potential drug targets and let the biologists, the scientists, to sift through and validate those drug targets.
AI is transforming transportation. This is a very big deal in China because in China we have built a lot of transport-related infrastructure: highways, metros, overpasses. It costs a lot of money. But the software layer of the transportation has not been improved much. In the age of AI, we think that’s going to change dramatically. This is a video showing you that.
(This is the so-called V2X, vehicle to everything, especially V2I, or vehicle to infrastructure. The roadside units will communicate with cars to improve the efficiency of transportation, avoid blind points on the road, assist self-driving, manage parking)
Apollo is an open source platform for automated driving. But it’s not just for driving, I think it’s for the whole transportation system. It’s going to take many more years for fully autonomous cars to be available everywhere. But before that we can already use AI to significantly improve transportation.
Today, every year, more than a million people get killed in car accidents. We think using AI we can significantly reduce the fatality rate for that. Using AI, if you take over the traffic lights you can in real time get a sense of how many cars are there, which direction are they driving, and at what speed, and you can intelligently remind cars that are at risk using the roadside sensors. You can also in real time adjust the traffic light time so that the whole city works in a harmonious way, that the delay will be significantly reduced.
In a Chinese city called Baoding we took over almost all the traffic lights in that city and we were able to reduce the wait time by 20% to 30% during peak hours, so reduce traffic delays by 20% to 30%.
Now let’s talk about the infrastructure. We know that infrastructure is very important. Highways and high speed rail significantly propelled the growth of China’s economy over the past few decades, but going forward, I think the infrastructure for AI will significantly propel the speed of innovation. That includes the app development platform, deep learning framework, general AI technology, and chips designed specifically for AI.
At Baidu, we have more than 2,000 engineers working on our AI platform. The goal is to let all the other developers, we have millions of developers, to develop all kinds of applications in a more convenient way, a faster way, and a lower cost way.
For conversational AI, we have DuerOS that’s used for smart speakers, smart display, or any kind of IOT devices. For Baidu Cloud, it’s optimized for all kinds of AI applications. Apollo, I’ve talked about it, it’s an open source platform for autonomous driving. We now have more than 175 eco partners, including all of the major OEMs, Mercedes, BMW, Toyota, Ford. And for Baidu Brain we provide all kinds of basic AI capabilities such as voice recognition, computer vision, natural language processing, and all kinds of recommendation platforms that we use for mobile content.
And PaddlePaddle is the deep learning framework originated from China, like TensorFlow or PyTorch.
This video shows our AI platform. It’s being used in all kinds of scenarios.
So AI is a big wave, but not every company, not everyone has the power to develop a full-fledged cutting edge AI technology. That’s why AI platform is very important and that’s why we’ve devoted a lot of resources to this kind of open source, open platform so that everyone can take advantage of that.
We also use AI for public welfare.
We use AI to help find missing people. In China, we’ve already found more than 9,000 missing people using AI technology, pretty much facial recognition technology. Even if after a person is missing for more than 20-years, we had a case, a boy, he was lost at age four and at age 25 he was identified as that missing person.
And we use AI to help the visually impaired people. We’ve installed the Baidu Xiaodu smart speaker in a lot of the blind massage parlors. Those massage therapists who are visually impaired can use voice to control air conditioning, control the curtains, control a lot of IoT devices, which makes their life much easier.
So AI can be used in a lot of these public welfare cases.
I also have a claim, AI will make you immortal. What does that mean? It means that machines can become smarter and smarter, can learn from humans. And today, storage has become cheaper and cheaper, and we can afford to store a lot of personal information. For example, I make a speech here and it is video-taped, it can be stored for a long time. And your voice can be stored, your video can be stored, your text, your articles, everything about yourself can be digitized.
And later on, based on this kind of digital information or content, computers can learn how you think. So after a while, it’s not hard to imagine when Tim Cook wants to evaluate whether Apple should work on an autonomous driving project, he can actually ask Steve Jobs, the digital copy of Steve Jobs, if that’s a good idea. Because there is a lot of information about Steve Jobs stored on the internet, and computers can learn the way Jobs thinks. So this makes Jobs immortal. But it’s not just Jobs, anyone, anyone’s information can be stored, can be learned, and made available when necessary. So in a sense, AI will make you immortal.
That’s how fascinating innovation is, that’s how fascinating AI is. India is one of the fastest-growing smartphone markets in the world, and India is also a very large developing country right next to China. We’ve seen fast growth for both countries over the past few decades. And I think for next decade, there will be more opportunities for us. So we at Baidu are very much looking forward to working with Indian institutions to make a better world through innovation. Thank you all.
Host: Firstly, what are the future plans of Baidu? Does it plan to grow internationally?
Robin: We already have a presence in a number of markets, like Japan, Thailand, and Egypt, but our main product, search, is pretty much domestic. We think in the age of AI, search will be very different from what we see today. Once we can transform search into a very different product, we will be ready to launch that internationally.
Host: Secondly, Baidu’s search engine has been custom made in the Chinese language. Can it be edited[GM1] so that it can be used in Hindi or English as well?
Robin: The fundamental technology I think is very similar, but like I said, it’s still evolving very quickly. In the past, I think search technology is pretty much a statistic technology. Today, everything is machine learning. As we progress, as AI technology progresses, as natural language understanding technology progresses, like I talked about during the presentation, more and more user requests will be answered by one shot. In that case, I think search will be very different and that will create a lot of opportunities. In the future, you will not be able to distinguish between search and feed, or search and recommendation. Search is giving you the right answer right away, but recommendation will let you consume a lot more content and let you learn a lot more knowledge. You will have much better access to a lot of new knowledge.
Host: Finally, what would be your message to all these Indian entrepreneurs who aspire to make a bid in the technology world?
Robin: I’m very happy to see you all here on a Saturday afternoon, I think you are very talented and India is a very large market. We are very fortunate to live in an age full of innovation opportunities, especially in the area of artificial intelligence. I think once you get into this field, focus on things that you are familiar with, focus on things that you really love, you will be able to find great opportunities.