Fei-Fei Li: The Woman Behind the AI Boom

Category Machine Learning

tldr #

Fei-Fei Li is the woman behind today's artificial intelligence boom and also the founding director of Stanford University's Institute for Human-Centered Artificial Intelligence. Li's new memoir recounts her pioneering work in curating the ImageNet dataset that accelerated the computer vision branch of AI. Li has spoken with the Associated Press about all the misconceptions of AI and how tech should be built around universal human values of dignity and a better life.


content #

She's an important figure behind today's artificial intelligence boom, but not all computer scientists thought Fei-Fei Li was on the right track when she came up with the idea for a giant visual database called ImageNet that took years to build. Li, now a founding director of Stanford University's Institute for Human-Centered Artificial Intelligence, is out with a new memoir that recounts her pioneering work in curating the dataset that accelerated the computer vision branch of AI.

Before becoming the director of the Institute for Human-Centered AI, Li served as the Chief Scientist of AI/ML at Google Cloud from 2017 to 2018.

The book, "The World I See," also portrays her formative years that abruptly shifted from China to New Jersey and follows her through academia, Silicon Valley and the halls of Congress as growing commercialization of AI technology brought public attention and a backlash. She spoke with The Associated Press about the book and the current AI moment. The interview has been edited for length and clarity.

Li was the inaugural Sequoia Professor of Computer Science at Stanford University.

Q: Your book describes how you envisioned ImageNet as more than just a huge data set. Can you explain? .

A: ImageNet really is the quintessential story of identifying the North Star of an AI problem and then finding a way to get there. The North Star for me was to really rethink how we can solve the problem of visual intelligence. One of the most fundamental problems in visual intelligence is understanding, or seeing, objects because the world is made of objects. Human vision is grounded in our understanding of objects. And there are many, many, many of them. ImageNet is really an attempt to define the problem of object recognition and also to provide a path to solve it, which is the big data path.

Li was the first female director of the Stanford Artificial Intelligence Lab (SAIL)

Q: If I could time travel back 15 years ago when you're hard at work on ImageNet and told you about DALL-E, Stable Diffusion, Google Gemini and ChatGPT—what would most surprise you? .

A: What does not surprise me is that everything you mention—DALL-E, ChatGPT, Gemini—is large-data based. They are pretrained on a large amount of data. That's exactly what I was hoping for. What surprised me is we got to generative AI faster than most of us thought. Generation for humans is actually not that easy. Most of us are not natural artists. The easiest generation for humans are words because speaking is generative, but drawing and painting is not generative for normal humans. We need the Van Goghs of the world.

In 2010 Li was the sole woman on the list of 100 most influential people in the world by Time Magazine.

Q: What do you think most people want from intelligent machines and is that aligned with what scientists and tech companies are building? .

A: I think fundamentally people want dignity and a good life. That's almost the founding principle of our country. Machines and tech should be aligned with universal human values—dignity and a better life, including freedom and all of those things. Sometimes when we talk about tech or sometimes when we build tech, whether it's intended or unintended, we don't talk enough about that. When I say 'we,' it includes technologists, it includes businesses, but also includes journalists. It's our collective responsibility.

Li co-founded the non-profit organization AI4All, which is focused on making AI education more accessible.

Q: What are the biggest misconceptions about AI? .

A: The biggest misconception of AI in journalism is when journalists use the subject AI and a verb and put humans in the object. Human agency is not fully appreciated if the media is not balanced. AI doesn't think, AI doesn't do anything, AI doesn't feel, AI doesn't fix things. AI needs humans.


hashtags #
worddensity #

Share