AI class unit 6: Difference between revisions
No edit summary |
No edit summary |
||
Line 6: | Line 6: | ||
{{#ev:youtubehd|s4Ou3NRJc-s}} | {{#ev:youtubehd|s4Ou3NRJc-s}} | ||
So welcome to the class on unsupervised learning. We talked a lot about supervised learning in which we are given data and target labels. In unsupervised learning we're just given data. So here is a data matrix of data items of n features each. There's m in total. | |||
[[File:AI class 2011-11-03-173600.PNG]] | |||
So the task of unsupervised learning is to find structure in data of this type. To illustrate why this is an interesting problem let me start with a quiz. Suppose we have two feature values. One over here, and one over here, and our data looks as follows. Even though we haven't been told anything in unsupervised learning, I'd like to quiz your intuition on the following two questions: First, is there structure? Or put differently do you think there's something to be learned about data like this, or is it entirely random? And second, to narrow this down, it feels that there are clusters of data the way I do it. So how many clusters can you see? And I give you a could of choices, 1, 2, 3, 4, or none. | |||
[[File:AI class 2011-11-03-174200.PNG]] |
Revision as of 14:49, 3 November 2011
These are my notes for unit 6 of the AI class.
Unsupervised Learning
Unsupervised Learning
{{#ev:youtubehd|s4Ou3NRJc-s}}
So welcome to the class on unsupervised learning. We talked a lot about supervised learning in which we are given data and target labels. In unsupervised learning we're just given data. So here is a data matrix of data items of n features each. There's m in total.
So the task of unsupervised learning is to find structure in data of this type. To illustrate why this is an interesting problem let me start with a quiz. Suppose we have two feature values. One over here, and one over here, and our data looks as follows. Even though we haven't been told anything in unsupervised learning, I'd like to quiz your intuition on the following two questions: First, is there structure? Or put differently do you think there's something to be learned about data like this, or is it entirely random? And second, to narrow this down, it feels that there are clusters of data the way I do it. So how many clusters can you see? And I give you a could of choices, 1, 2, 3, 4, or none.